Text Post-processing on Optical Character Recognition output using Natural Language Processing Methods

Dec 1, 2023·

Sneha Balasubramoni

· 1 min read

Image credit: Unsplash

Abstract

The technique of turning images of printed or written text from scanned documents, images of documents, or simple photos into machine-encoded text is known as optical character recognition (OCR). OCR has proven to be very useful in terms of digitizing documents and making them easier to analyze. Despite the advancement in the technology since it was introduced, there are still areas OCR falls short. If either the written text is illegible, or the OCR software isn’t powerful enough, it results in inaccurate translations. This research work aims at addressing this shortcoming by performing postprocessing on OCR outputs primarily using Transformers such as BERT in a two-step pipeline to correct these mistakes and improve the quality of the document.

Type

Conference paper

Publication

2023 IEEE 3rd Mysore Sub Section International Conference (MysuruCon)

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

Last updated on Dec 1, 2023

OCR Document Processing Transformers BERT

Natural Question Generation using Transformers and Reinforcement Learning Dec 1, 2022 →