Genalog is an open-source, a cross-platform Python package that generates document images with synthetic noise that mimics scanned analog documents. Various text degradations can be added to these images to create a fast and efficient way of generating synthetic documents by leveraging layout from templates you can make using HTML format.
Genalog’s capabilities include flexible format image generation, custom image degradation, extracting text from images using a cognitive search pipeline, and getting OCR performance metrics.
This package provides a comprehensive solution for generating synthetic images from any text data rich in natural language and imitate most OCR noises that are found in scanned documents.
Genalog provides you with several document templates to use as a starting point. The document’s layout can be altered using standard CSS properties like font-family, font-size, text-align, etc.
Apart from document generation and degradation, Genalog also provides efficient implementation for text alignment between the source and noise text.
Related Paper: https://arxiv.org/pdf/2108.02899.pdf
Asif Razzaq is an AI Journalist and Cofounder of Marktechpost, LLC. He is a visionary, entrepreneur and engineer who aspires to use the power of Artificial Intelligence for good.
Asif's latest venture is the development of an Artificial Intelligence Media Platform (Marktechpost) that will revolutionize how people can find relevant news related to Artificial Intelligence, Data Science and Machine Learning.
Asif was featured by Onalytica in it’s ‘Who’s Who in AI? (Influential Voices & Brands)’ as one of the 'Influential Journalists in AI' (https://onalytica.com/wp-content/uploads/2021/09/Whos-Who-In-AI.pdf). His interview was also featured by Onalytica (https://onalytica.com/blog/posts/interview-with-asif-razzaq/).