Bitext Generation Through Rich Markup |
| |
Authors: | Arantza Casillas Raquel Martí nez |
| |
Affiliation: | (1) Departamento Electridad y Electrónica, Facultad de Ciencia y Tecnología, UPV-EHU, Spain |
| |
Abstract: | This paper reports on a method for exploiting a bitext as the primary linguistic information source for the design of a generation environment for specialized bilingual documentation. The paper discusses such issues as Text Encoding Initiative (TEI), proposals for specialized corpus tagging, text segmentation and alignment of translation units and their allocation into translation memories, Document Type Definition (DTD), abstraction from tagged texts, and DTD deployment for bilingual text generation. The parallel corpus used for experimentation has two main features: |
| |
Keywords: | alignment bilingual document generation bitext parallel corpus segmentation SGML TEI translation memories |
本文献已被 SpringerLink 等数据库收录! |
|