Text Post-processing on Optical Character Recognition output using Natural Language Processing Methods
Image credit: UnsplashAbstract
The technique of turning images of printed or written text from scanned documents, images of documents, or simple photos into machine-encoded text is known as optical character recognition (OCR). OCR has proven to be very useful in terms of digitizing documents and making them easier to analyze. Despite the advancement in the technology since it was introduced, there are still areas OCR falls short. If either the written text is illegible, or the OCR software isn’t powerful enough, it results in inaccurate translations. This research work aims at addressing this shortcoming by performing postprocessing on OCR outputs primarily using Transformers such as BERT in a two-step pipeline to correct these mistakes and improve the quality of the document.
Type
Publication
2023 IEEE 3rd Mysore Sub Section International Conference (MysuruCon)
Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.