首页 | 本学科首页   官方微博 | 高级检索  
     

基于用户自描述标签的层次分类体系构建方法
引用本文:刘苏祺,白光伟,沈航.基于用户自描述标签的层次分类体系构建方法[J].计算机科学,2016,43(7):224-229, 239.
作者姓名:刘苏祺  白光伟  沈航
作者单位:南京工业大学计算机科学与技术学院 南京211816,南京工业大学计算机科学与技术学院 南京211816;南京理工大学高维信息智能感知与系统教育部重点实验室 南京210094,南京工业大学计算机科学与技术学院 南京211816
基金项目:本文受国家自然科学基金(60673185,7),江苏省自然科学基金(BK2010548),江苏省科技支撑计划(工业)(BE2011186),南京邮电大学宽带无线通信与传感网技术教育部重点实验室开放研究基金资助
摘    要:模式层知识对于语义万维网的发展非常重要,然而当前开放链接数据(LOD)中模式层知识的数量十分有限,为突破这一局限,提出一种基于社交网络中用户自描述标签的层次分类体系构建方法。该方法首先设计基于搜索引擎的标签分块算法,将描述相同话题的标签划分到同一标签块中,然后采用基于半监督学习的标签传播算法挖掘相同标签块中标签间的上下位关系,最后运用基于启发式规则的贪心算法来构建层次分类体系,从而在社交站点中构建出大规模且高质量的层次分类体系。实验结果表明,该构建方法与现有相关工作相比在准确率、召回率以及F值上均有明显提高。

关 键 词:模式层知识  用户自描述标签  层次分类体系  标签传播
收稿时间:2015/4/10 0:00:00
修稿时间:9/1/2015 12:00:00 AM

Taxonomy Construction Based on User Self-describing Tags
LIU Su-qi,BAI Guang-wei and SHEN Hang.Taxonomy Construction Based on User Self-describing Tags[J].Computer Science,2016,43(7):224-229, 239.
Authors:LIU Su-qi  BAI Guang-wei and SHEN Hang
Affiliation:School of Computer Science and Technology,Nanjing Tech University,Nanjing 211816,China,School of Computer Science and Technology,Nanjing Tech University,Nanjing 211816,China;Key Laboratory of Intelligent Perception and System for High-Dimensional Information of Ministry of Education of China,Nanjing University of Science and Technology,Nanjing 210094,China and School of Computer Science and Technology,Nanjing Tech University,Nanjing 211816,China
Abstract:Knowledge on schema level is vital for the development of semantic Web.However,the number of schema knowledge is limited in current linking open data (LOD).To optimize the issue,this paper proposed an approach for constructing a taxonomy using user self-describing tags in social network.This approach first designs a tag blocking algorithm based on search engine to partition tags into the same block,which describes the same topic.Then,it uses a label propagation algorithm based on the semi-supervised learning to detect hypernym relation between tags in the same block.Finally,it applies a greedy algorithm based on heuristic rules to construct a taxonomy.A large scale and high-quality taxonomy can be constructed after applying the proposed approach in social Web sites.The experimental results show that,compared with the existing related work,the proposed approach performs better in terms of precision,recall and F-score.
Keywords:Knowledge on schema level  User self-describing tags  Taxonomy  Label propagation
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号