Davarocr：用于OCR和多模式文档理解的工具箱

论文标题

Davarocr：用于OCR和多模式文档理解的工具箱

DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding

论文作者

Qiao, Liang, Jiang, Hui, Chen, Ying, Li, Can, Li, Pengfei, Li, Zaisheng, Zou, Baorui, Guo, Dashan, Xu, Yingda, Xu, Yunlu, Cheng, Zhanzhan, Niu, Yi

论文摘要

本文介绍了Davarocr，这是一种用于OCR和文档理解任务的开源工具箱。 Davarocr目前实施了19种高级算法，涵盖了9个不同的任务表。 Davarocr为每种算法提供了详细的使用说明和经过训练的模型。与以前的OpenSource OCR工具箱相比，Davarocr对文档理解的最先进技术的子任务具有相对完整的支持。为了促进OCR技术在学术界和行业中的开发和应用，我们更加关注使用不同的技术可以共享的模块的使用。 Davarocr在https://github.com/hikopensource/davar-lab-ocr上公开发行。

This paper presents DavarOCR, an open-source toolbox for OCR and document understanding tasks. DavarOCR currently implements 19 advanced algorithms, covering 9 different task forms. DavarOCR provides detailed usage instructions and the trained models for each algorithm. Compared with the previous opensource OCR toolbox, DavarOCR has relatively more complete support for the sub-tasks of the cutting-edge technology of document understanding. In order to promote the development and application of OCR technology in academia and industry, we pay more attention to the use of modules that different sub-domains of technology can share. DavarOCR is publicly released at https://github.com/hikopensource/Davar-Lab-OCR.

下载PDF全文

下载文献需遵守相关版权规定

论文标题