首页 | 本学科首页   官方微博 | 高级检索  
     


KP-Miner: A keyphrase extraction system for English and Arabic documents
Authors:Samhaa R. El-Beltagy  Ahmed Rafea
Affiliation:1. Faculty of Computers and Information, Computer Science Department, Cairo University, 5 Dr. Ahmed Zewail Street, 12613 Orman, Giza, Egypt;2. Computer Science Department, American University in Cairo, 113 Kasr El Aini St., PO Box 2511, Cairo 11511, Egypt
Abstract:Automatic keyphrase extraction has many important applications including but not limited to summarization, cataloging/indexing, feature extraction for clustering and classification, and data mining. This paper presents the KP-Miner system, and demonstrates through experimentation and comparison with widely used systems that it is effective and efficient in extracting keyphrases from both English and Arabic documents of varied length. Unlike other existing keyphrase extraction systems, the KP-Miner system does not need to be trained on a particular document set in order to achieve its task. It also has the advantage of being configurable as the rules and heuristics adopted by the system are related to the general nature of documents and keyphrases. This implies that the users of this system can use their understanding of the document(s) being input into the system to fine-tune it to their particular needs.
Keywords:Keyphrase extraction   Heuristic rules   Automatic indexing
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号