首页 | 本学科首页   官方微博 | 高级检索  
     


Topic categorization and representation of health community generated data
Authors:Maofu Liu  He Zhang  Huijun Hu  Wei Wei
Affiliation:1.College of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan,China;2.Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System,Wuhan,China;3.School of Computer Science and Technology,Huazhong University of Science and Technology,Wuhan,China
Abstract:The representation and categorization of professional health provider released data have been well investigated and practically implemented. These have facilitated browsing, search and high-order learning of health information. On the other hand, there has been little corresponding studies on the representation and categorization of health community generated data. It is usually more complex, inconsistent and ambiguous, and consequently raises challenges for data access and analytics. This paper explores various representations for health community generated data and categorizes these data in terms of health topics. In addition, this work utilizes pseudo-labeled data to train the supervised topic categorization models, and this makes the whole categorization process unsupervised and extendable to handle large-scale data. The extensive experiments on two real-world datasets reveal our interesting findings of the informative representation approaches and effective categorization models for health community generated data.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号