首页 | 本学科首页   官方微博 | 高级检索  
     


HamleDT: Harmonized multi-language dependency treebank
Authors:Email author" target="_blank">Daniel?ZemanEmail author  Ond?ej?Du?ek  David?Mare?ek  Martin?Popel  Loganathan?Ramasamy  Jan??těpánek  Zdeněk??abokrtsky  Jan?Haji?
Affiliation:1.Faculty of Mathematics and Physics, úFAL,Charles University in Prague,Prague,Czech Republic
Abstract:We present HamleDT—a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. In the present article, we provide a thorough investigation and discussion of a number of phenomena that are comparable across languages, though their annotation in treebanks often differs. We claim that transformation procedures can be designed to automatically identify most such phenomena and convert them to a unified annotation style. This unification is beneficial both to comparative corpus linguistics and to machine learning of syntactic parsing.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号