首页 | 本学科首页   官方微博 | 高级检索  
     


Automatic extraction of citation information in Japanese patent applications
Authors:Hidetsugu Nanba  Natsumi Anzen  Manabu Okumura
Affiliation:(1) Faculty of Information Sciences, Hiroshima City University, 3-4-1 Ozukahigashi, Asaminamiku, Hiroshima 731-3194, Japan;(2) NEC System Technologies, 1-40-1 Tomo-minami, Asaminamiku, Hiroshima 731-3168, Japan;(3) Precision and Intelligence Laboratory, Tokyo Institute of Technology, 4259 Nagatsuta, Yokohama 226-8503, Japan
Abstract:The need for academic researchers to retrieve patents and research papers is increasing, because applying for patents is now considered an important research activity. However, retrieving patents using keywords is a laborious task for researchers, because the terms used in patents for the purpose of enlarging the scope of the claims are generally more abstract than those used in research papers. Therefore, we have constructed a framework that facilitates patent retrieval for researchers, and have integrated research papers and patents by analysing the citation relationships between them. We obtained cited research papers in patents using two steps: (1) detection of sentences containing bibliographic information, and (2) extraction of bibliographic information from those sentences. To investigate the effectiveness of our method, we conducted two experiments. In the experiment involving Step 1, we prepared 42,073 sentences, among which a human subject manually identified 1,476 sentences containing citations of papers. For Step 2, we prepared 3,000 sentences, in which the titles, authors, and other bibliographic information were manually identified. We obtained a precision of 91.6%, and a recall of 86.9% in Step 1, and a precision of 86.2% and a recall of 85.1% in Step 2. Finally, we constructed an information retrieval system that provided two methods of retrieving research papers and patents. One method was retrieval by query, and another was from the citation relationships between research papers and patents.
Keywords:Citation relationships  Information retrieval  Invalidity search  Scientometrics  Research paper  patent
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号