首页 | 本学科首页   官方微博 | 高级检索  
     

基于句法语义依存分析的中文金融事件抽取
引用本文:万齐智,万常选,胡蓉,刘德喜. 基于句法语义依存分析的中文金融事件抽取[J]. 计算机学报, 2021, 44(3): 508-530. DOI: 10.11897/SP.J.1016.2021.00508
作者姓名:万齐智  万常选  胡蓉  刘德喜
作者单位:江西财经大学信息管理学院 南昌 330032;江西财经大学数据与知识工程江西省高校重点实验室 南昌 330013;江西财经大学信息管理学院 南昌 330032;江西财经大学数据与知识工程江西省高校重点实验室 南昌 330013;江西财经大学软件与物联网工程学院 南昌 330032;江西财经大学数据与知识工程江西省高校重点实验室 南昌 330013;江西财经大学信息管理学院 南昌 330032;江西财经大学数据与知识工程江西省高校重点实验室 南昌 330013
基金项目:江西省教育厅科学技术研究项目;本课题得到国家自然科学基金项目
摘    要:事件抽取在自然语言处理应用中扮演着重要的角色,如股票市场趋势预测.传统事件抽取较为关注触发词和论元所属类型的正确性,较少地结合应用需求去分析研究事件抽取效果及使用价值.在财经领域,事件作用对象及动作是关注的重点.因此,本文聚焦于金融事件,抽取三元组事件ET(Sub,Pred,Obj).在中文财经新闻中,存在大量事件嵌套...

关 键 词:中文事件抽取  核心动词链  句法语义依存分析图  事件语义关联  缺省补全

Chinese Financial Event Extraction Base on Syntactic and Semantic Dependency Parsing
WAN Qi-Zhi,WAN Chang-Xuan,HU Rong,LIU De--Xi. Chinese Financial Event Extraction Base on Syntactic and Semantic Dependency Parsing[J]. Chinese Journal of Computers, 2021, 44(3): 508-530. DOI: 10.11897/SP.J.1016.2021.00508
Authors:WAN Qi-Zhi  WAN Chang-Xuan  HU Rong  LIU De--Xi
Affiliation:(School of Information Technology,Jiangxi University of Finance and Economics,Nanchang 330032;School of Software and lnternet of Things Engineering,Jiangxi University of Finance and Ecomomics,Nanchang330032;Jiangri Key Labroratory of Data and Knowledge Enginering,Jiangxi University of Finance and Economics,Nanchang0330013)
Abstract:As a sub-task of information extraction,event extraction plays an important role in nature language process applications,such as stock market trend forecast,which can provide strong clues for events users,e.g.investors,managers and government,to analyze the market and make decisions.At present,most of the studies about event extraction pay more attention to the type correctness of triggers and arguments,and not consider the effect and value of event extraction based on application requirements.We call this type of event extraction traditional event extraction.The event types and standards in traditional event extraction are derived from ACE2005 containing 8 categories and 33 sub-categories,KBP2015 and ERE,et al.However,there are some limitations in application of them to event extraction in specific financial domain.For example,there is not the overweight event type in ACE2005,which is a special behavior in the financial domain.In this paper,we focus on the financial news and extract open events without types.In the field of finance and economics,most event users are more concerned with the objects and actions that events affect.Therefore,combined with the application requirement,we propose to extract the financial event ET(Sub,Pred,Obj),where Sub,Pred and Obj represent subject,predicate and object respectively.However,Chinese financial news generally suffers from the event nesting and component default problem,which result in event omission and key element missing of events.To tackle this issue,with the expression habits and characteristics of Chinese linguistics,we build a Chinese event extraction framework based on syntactic and semantic dependency parsing.Then summarize four common default structures and design corresponding completion rules.In particular,at the beginning of this paper,we summarize four prominent phenomena in the extraction of events from the headlines of financial news,and explore the cause of these problems,no in-depth analyzing the relevance of syntactic and semantic structure or lack of it.After that,we employ the syntactic dependency parsing tree and lexical structure,and propose the core verb chains,which make sure that each core verb corresponds to an event solving event leakage problem.Thirdly,we add semantic dependency relation between events on the basis of syntactic dependency tree,which is called Syntactic Semantic Dependency Parsing(SSDP)tree.In order to better separate the detected events and their properties,we adjust and optimize SSDP tree to form the SSDP graph,where the word nodes of the same syntactic structure are at the same level,providing a way for subsequent event extraction.Fourthly,with the division of default structure in linguistic,we summarize four common default structures and propose ten corresponding completion rules to solve the problem of component default.Meanwhile,the whole Chinese event extraction algorithm based SSDP graph is shown at the end of the section.Finally,this paper depicts a detailed experimental situation.The experimental dataset,labeling standard and evaluation index are given.Subsequently,the method in this paper is verified on two datasets,financial news titles and common field news titles.At the end,we conduct comprehensive benchmarks on Chinese financial news titles and CoNLL2009 Chinese Corpus.The experimental results show that the proposed methods are effective.
Keywords:Chinese event extraction  core verb chain  syntactic semantic dependency parsing graph  event semantics relevance  default complement
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号