13Deep Learning Framework for Detecting, Classifying, and Recognizing Invoice Metadata

Nhat Quang Doan1, Van Tang Nguyen2*, Anh Tuan Giang1, Van Trung Doan3 and Dang Bui Hai4

1University of Science and Technology of Hanoi, Vietnam Academy of Science and Technology, Hanoi, Vietnam

2Faculty of Technology and Data Science Hanoi Foreign Trade University, Hanoi, Vietnam

3Faculty of Information Technology, Hanoi University of Industry, Hanoi, Vietnam

4KSE Software JSC, Hanoi, Vietnam

Abstract

In general, images or invoice images, are rich information sources; however, processing large amounts of invoices in finance requires extensive human work and material resources. The demand for a technological tool to process invoices is essential. Therefore, this study proposes a framework based on deep learning techniques for recognizing specific invoice information (or metadata). The deep learning model used in this research includes the classic architecture of Convolutional Neural Networks and its other variations, such as VGG-16 and Faster R-CNN. The complete process includes image processing series, model training for field detection and classification, and optical character recognition (OCR) for the detected fields. The proposed framework collects relevant information from invoice images and converts them into a text format to facilitate invoice management and transform them into structured data. The framework performance shows a significant value in recognizing invoice metadata compared ...

Get Creative Approaches Towards Development of Computing and Multidisciplinary IT Solutions for Society now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.