首页 | 本学科首页   官方微博 | 高级检索  
     


Bitext Generation Through Rich Markup
Authors:Arantza Casillas  Raquel Martí  nez
Affiliation:(1) Departamento Electridad y Electrónica, Facultad de Ciencia y Tecnología, UPV-EHU, Spain
Abstract:This paper reports on a method for exploiting a bitext as the primary linguistic information source for the design of a generation environment for specialized bilingual documentation. The paper discusses such issues as Text Encoding Initiative (TEI), proposals for specialized corpus tagging, text segmentation and alignment of translation units and their allocation into translation memories, Document Type Definition (DTD), abstraction from tagged texts, and DTD deployment for bilingual text generation. The parallel corpus used for experimentation has two main features:
Keywords:alignment  bilingual document generation  bitext  parallel corpus  segmentation  SGML  TEI  translation memories
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号