high efficiency of document management with a minimum number of errors involves storing electronic versions of documents and fast digitization of printed versions of documents. To improve the quality and speed of this process, it is proposed to perform automatic recognition of basic information from images of a printed document to create an electronic document template. The solution uses neural network algorithms for recognizing information from images.
Context: when processing a large flow of documents, a lot of time is spent on routine, and it is difficult to automate processing since one document has a lot of different forms. Manual markup takes a long time.
Decision: created a document recognition system based on its own OCR and Text Detection models.
Results:
The solution is integrated into the customer's business process:
Telecom, Finance
Technology stack: TensorFlow, Python, Flask.