Automatic extraction of citation information in Japanese patent applications |
| |
Authors: | Hidetsugu Nanba Natsumi Anzen Manabu Okumura |
| |
Affiliation: | (1) Faculty of Information Sciences, Hiroshima City University, 3-4-1 Ozukahigashi, Asaminamiku, Hiroshima 731-3194, Japan;(2) NEC System Technologies, 1-40-1 Tomo-minami, Asaminamiku, Hiroshima 731-3168, Japan;(3) Precision and Intelligence Laboratory, Tokyo Institute of Technology, 4259 Nagatsuta, Yokohama 226-8503, Japan |
| |
Abstract: | The need for academic researchers to retrieve patents and research papers is increasing, because applying for patents is now
considered an important research activity. However, retrieving patents using keywords is a laborious task for researchers,
because the terms used in patents for the purpose of enlarging the scope of the claims are generally more abstract than those
used in research papers. Therefore, we have constructed a framework that facilitates patent retrieval for researchers, and
have integrated research papers and patents by analysing the citation relationships between them. We obtained cited research
papers in patents using two steps: (1) detection of sentences containing bibliographic information, and (2) extraction of
bibliographic information from those sentences. To investigate the effectiveness of our method, we conducted two experiments.
In the experiment involving Step 1, we prepared 42,073 sentences, among which a human subject manually identified 1,476 sentences
containing citations of papers. For Step 2, we prepared 3,000 sentences, in which the titles, authors, and other bibliographic
information were manually identified. We obtained a precision of 91.6%, and a recall of 86.9% in Step 1, and a precision of
86.2% and a recall of 85.1% in Step 2. Finally, we constructed an information retrieval system that provided two methods of
retrieving research papers and patents. One method was retrieval by query, and another was from the citation relationships
between research papers and patents. |
| |
Keywords: | Citation relationships Information retrieval Invalidity search Scientometrics Research paper patent |
本文献已被 SpringerLink 等数据库收录! |
|