首页 | 本学科首页   官方微博 | 高级检索  
     

文档智能: 数据集、模型和应用
引用本文:崔磊,徐毅恒,吕腾超,韦福如.文档智能: 数据集、模型和应用[J].中文信息学报,2022,36(6):1-19.
作者姓名:崔磊  徐毅恒  吕腾超  韦福如
作者单位:微软亚洲研究院 自然语言计算组,北京 100080
摘    要:文档智能是指通过计算机进行自动阅读、理解以及分析商业文档的过程,是自然语言处理和计算机视觉交叉领域的一个重要研究方向。近年来,深度学习技术的普及极大地推动了文档智能领域的发展,以文档版面分析、文档信息抽取、文档视觉问答以及文档图像分类等为代表的文档智能任务均有显著的性能提升。该文对于早期基于启发式规则的文档分析技术、基于统计机器学习的算法以及近年来基于深度学习和预训练的方法进行简要介绍,并展望了文档智能技术的未来发展方向。

关 键 词:文档智能  深度学习  多模态自然语言处理  

Document AI: Benchmarks,Models and Applications
CUI Lei,XU Yiheng,LYU Tengchao,WEI Furu.Document AI: Benchmarks,Models and Applications[J].Journal of Chinese Information Processing,2022,36(6):1-19.
Authors:CUI Lei  XU Yiheng  LYU Tengchao  WEI Furu
Affiliation:Natural Language Computing Group, Microsoft Research Asia, Beijing 100080, China
Abstract:Document AI, or Document Intelligence, is a relatively new research topic that refers to the techniques to automatically read, understand and analyze business documents. It is an important interdisciplinary study involving natural language processing and computer vision. In recent years, the popularity of deep learning technology has greatly advanced the development of Document AI tasks, such as document layout analysis, document information extraction, document visual question answering, and document image classification etc. This paper briefly introduces the early-stage heuristic rule-based document analysis, statistical machine learning based algorithms, as well as the deep learning-based approaches especially the pre-training approaches. Finally, we also look into the future direction of Document AI.
Keywords:Document AI  deep learning  multimodal NLP  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号