共查询到20条相似文献,搜索用时 0 毫秒
1.
2.
张翔;何世柱;张元哲;刘康;赵军 《中文信息学报》2024,38(12):1-17
语义是自然语言理解的核心研究对象,让机器掌握语义有多种途径,因此产生了不同的语义表示方法。但是,这些不同的途径之间往往缺乏联系,分别散落于不同的研究领域和研究任务中,如知识库问答中的查询图和SparQL、表格问答中的SQL、句子语义分析中的框架语义和AMR图等。虽然形式相近,但相关研究却缺乏协同。随着研究深入,语义表示之间缺少对比、具体任务中语义表示难以选择且性能差异大等缺点也显露出来。为了缓解这个问题,该文综述了常见于各类任务中的语义表示,并以世界和语言的关系为主线将它们重新划分为“外延世界语义表示”和“语言内部语义表示”两大类。总结了前者的技术研究热点及新型语义表示的设计和对比方法,并简单探讨了近期围绕后者是否真正含有语义的辩论。最后,该文综述了结合外延和内部两类语义表示的研究,发现这类方法有较强潜力。 相似文献
3.
The Proper Place of Men and Machines in Language Translation 总被引:1,自引:0,他引:1
Martin Kay 《Machine Translation》1997,12(1-2):3-23
The only way in which the power of computers has been brought to bear on the problem of language translation is machine translation, that is, the automation of the entire process. Machine translation is an excellent research vehicle but stands no chance of filling actual needs for translation which are growing at a great rate. In the quarter century during which work on machine translation has been going on, there has been considerable progress in relevant areas of computer science. However, advances in linguistics, important though they may have been, have not touched the core of this problem. The proper thing to do is therefore to adopt the kinds of solution that have proved successful in other domains, namely to develop cooperative man–machine systems. This paper proposes a translator's amanuensis, incorporating into a word processor some simple facilities peculiar to translation. Gradual enhancements of such a system could eventually lead to the original goal of machine translation. 相似文献
4.
5.
6.
7.
In knowledge-based machine translation (KBMT), the lexicon can be specified and acquired only in close connection with the specification and acquisition of the world model (ontology) and the specification of the text meaning representation (interlingua) language. The former supplies the atoms for the specification of text meaning and provides world knowledge to support the inference processes necessary for a variety of disambiguation and meaning assignment operations. The latter is necessary for the formulation of the semantic zone of the lexicon entries, which can be viewed as containing the static building blocks of the text meaning representation. This is the view taken in the Mikrokosmos KBMT project. 相似文献
8.
In the 1960s, the researchers of Harbin Institute of Technology (HIT) attempted to do relevant research on natural language
processing. With more than 40-year’s effort, HIT has already established three research laboratories for Chinese information
processing, i.e. the Machine Intelligence and Translation Laboratory (MI&T Lab), the Intelligent Technology and Natural Language
Processing Laboratory (ITNLP) and the Information Retrieval Laboratory (IR-Lab). At present, it has a well-balanced research
team of over 200 persons, and the research interests have extended to language processing, machine translation, text retrieval
and other fields. Harbin Institute of Technology has accumulated a batch of key techniques and data resources, won many prizes
in the technical evaluations at home and abroad. Harbin Institute of Technology has become one of the most important natural
language processing bases for teaching and scientific research in China now. This paper gives an introduction to the achievements
on NLP in HIT. 相似文献
9.
陈潇;杨雅婷;董瑞;时现伟;马博;吐尔洪·吾司曼 《中文信息学报》2025,39(2):80-88
针对当前多语言零样本翻译的标签策略难以提供丰富的翻译方向信息和模型对语言建模能力的不足,进而导致翻译脱靶的问题,该文提出了使用标签句子重构的多语言零样本神经机器翻译方法。具体而言,首先提出了一种词级别的标签策略,在编码器端增加目标语言标签嵌入,在源语言句子每个词的嵌入表示中注入目标语言信息;其次设计了标签句子重构任务,对句子添加噪声和使用词级别的标签策略后进行重构操作,以达到增强模型语言建模能力的目的。在MultiUN数据集和Europarl数据集上的零样本实验结果表明,所提方法在零样本翻译上的平均BLEU值分别超过强基线0.7和0.3。消融实验结果表明,所提出的语言标签嵌入策略和标签句子重构均能有效提升模型的零样本翻译性能。 相似文献
10.
一种基于实例的汉英机器翻译策略 总被引:3,自引:0,他引:3
介绍了一种基于实例的汉英机器翻译策略,重点讨论了汉英双语语料库的设计和基于该语料库的汉语句子的匹配算法。在进行汉语句子的匹配时,根据汉语的特点直接采用汉字的匹配,而没有进行汉语句子的分词。另外,匹配时确定匹配片断的边界也是基于实例机器翻译的难点之一,在这方面也采取了相应的解决方法。没有对翻译句子的连接装配进行更深入的研究,这是因为该翻译策略是用于多翻译引擎系统的,它要与其它翻译策略配合使用,以提高翻译结果的正确率。基于实例的机器翻译需要大量的双语语料库作为翻译时的依据,而人工建设大型语料库费时费力,所以尝试采用计算机进行汉英双语语料库的自动建立,包括篇章对齐和单词级的对齐。 相似文献
11.
Per-Kristian Halvorsen 《Computational Intelligence》1986,2(1):54-62
This paper reevaluates some of the contributions of Montague grammar in view of the increasing importance of computational considerations in linguistic theory and the demand for linguistic theories that can provide support in the design of natural language systems. It also considers Montague grammar in relation to work on lexical semantics and semantic nets. In this perspective the techniques of Montague grammar for systematically linking syntactic form and a model-theoretic semantics emerge as the most significant feature of the theory, while a number of the specific semantic assumptions recede in importance. Yet, with different ways of thinking about the structure mapping between levels of linguistic form and interpretations (e.g., constraint systems), we can also implement this connection using different techniques from what Montague had at his disposal. 相似文献
12.
AMR(抽象语义表示)是国际上一种新的句子语义表示方法,有着接近于中间语言的表示能力,其研发者已经建立了英文《小王子》等AMR语料库。AMR与以往的句法语义表示方法的最大不同在于两个方面,首先采用图结构来表示句子的语义;其次允许添加原句之外的概念节点来表示隐含的语义。该文针对汉语特点,在制定中文AMR标注规范的基础上,标注完成了中文版《小王子》的AMR语料库,标注一致性的Smatch值为0.83。统计结果显示,英汉双语含图结构句子具有很高的相关性,且含有图的句子比例高达40%左右,额外添加的概念节点则存在较大差异。最后讨论了AMR在汉语句子语义表示以及跨语言对比方面的优势。 相似文献
13.
汉英机器翻译中基于实例的歧义结构消解 总被引:1,自引:0,他引:1
歧义是自然语言特别是汉语的显著特点和普遍现象,也是当前汉英机器翻译系统的主要处理难点之一。通过对其中一些常见汉语歧义结构的分析,提出一种基于实例的歧义结构消解方法。由于歧义结构的对应实例具有较高的“结构”上的代表性,通过与这些实例的相似性比较可以较准确地把握待消歧语段的内部结构。 相似文献
14.
15.
16.
17.
18.
机器译文自动评价是机器翻译中的一个重要任务。针对目前译文自动评价中完全忽略源语言句子信息,仅利用人工参考译文度量翻译质量的不足,该文提出了引入源语言句子信息的机器译文自动评价方法: 从机器译文与其源语言句子组成的二元组中提取描述翻译质量的质量向量,并将其与基于语境词向量的译文自动评价方法利用深度神经网络进行融合。在WMT-19译文自动评价任务数据集上的实验结果表明,该文所提出的方法能有效增强机器译文自动评价与人工评价的相关性。深入的实验分析进一步揭示了源语言句子信息在译文自动评价中发挥着重要作用。 相似文献
19.
20.
基于语义神经网络的汉语表层语义分析 总被引:1,自引:0,他引:1
本文提出一种基于语义神经网络的汉语表层语义分析方法。该方法将符号主义和连接主义结合在一起。文中给出了语义神经网络的定义、构造,以及应用于汉语表层语义分析的算法。 相似文献