Language resources for Hebrew |
| |
Authors: | Alon Itai Shuly Wintner |
| |
Affiliation: | (1) Department of Computer Science, Technion, Israel Institute of Technology, 32000 Haifa, Israel;(2) Department of Computer Science, University of Haifa, 31905 Haifa, Israel |
| |
Abstract: | We describe a suite of standards, resources and tools for computational encoding and processing of Modern Hebrew texts. These
include an array of XML schemas for representing linguistic resources; a variety of text corpora, raw, automatically processed
and manually annotated; lexical databases, including a broad-coverage monolingual lexicon, a bilingual dictionary and a WordNet;
and morphological processors which can analyze, generate and disambiguate Hebrew word forms. The resources are developed under
centralized supervision, so that they are compatible with each other. They are freely available and many of them have already
been used for several applications, both academic and industrial.
|
| |
Keywords: | Language resources Hebrew Corpora Lexicon Morphological processing WordNet |
本文献已被 SpringerLink 等数据库收录! |
|