首页 | 本学科首页   官方微博 | 高级检索  
     


Using structural similarity for clustering XML documents
Authors:Ali A?telhadj  Mohand Boughanem  Mohamed Mezghiche  Fatiha Souam
Affiliation:1. Mouloud Mammeri University of Tizi-Ouzou, Tizi-Ouzou, Algeria
3. M??hamed Bougara University of Boumerdes, Boumerdes, Algeria
2. IRIT, Paul Sabatier University of Toulouse, Toulouse, France
Abstract:In this paper, we describe a method for clustering XML documents. Its goal is to group documents sharing similar structures. Our approach is two-step. We first automatically extract the structure from each XML document to be classified. This extracted structure is then used as a representation model to classify the corresponding XML document. The idea behind the clustering is that if XML documents share similar structures, they are more likely to correspond to the structural part of the same query. Finally, for the experimentation purpose, we tested our algorithms on both real (ACM SIGMOD Record corpus) and synthetic data. The results clearly demonstrate the interest of our approach.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号