A New Multiword Expression Metric and Its Applications |
| |
Authors: | Fan Bu Xiao-Yan Zhu Ming Li |
| |
Affiliation: | 1. State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China 2. David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, N2L 3G1, Canada
|
| |
Abstract: | Multiword Expressions (MWEs) appear frequently and ungrammatically in natural languages. Identifying MWEs in free texts is a very challenging problem. This paper proposes a knowledge-free, unsupervised, and language-independent Multiword Expression Distance (MED). The new metric is derived from an accepted physical principle, measures the distance from an n-gram to its semantics, and outperforms other state-of-the-art methods on MWEs in two applications: question answering and named entity extraction. |
| |
Keywords: | |
本文献已被 CNKI 万方数据 SpringerLink 等数据库收录! |
|