共查询到20条相似文献,搜索用时 0 毫秒
1.
基于转换的汉语基本名词短语识别模型 总被引:28,自引:10,他引:18
基本名词短语的识别在自然语言信息处理领域具有重要作用。本文首先从语言学的角度提出了汉语基本名词短语的概念,然后从语言信息处理的角度将用于基本名词短语识别的知识分为两部分,即表示基本名词短语句法组成的基本结构模板(静态知识)与表示基本名词短语出现的上下文环境特征的转换规则(动态知识)。在此基础上设计了一种基于转换的基本名词短语识别模型,该模型可同时结合这两类知识识别基本名词短语。实验结果显示了较高的识别正确率。 相似文献
2.
N-gram统计模型在机器翻译系统中的应用 总被引:2,自引:1,他引:2
文章提出了N-gram模型在机器翻译系统中的几个应用。模型是在语料库的基础上统计连续几个词的出现概率,以此来筛选翻译过程中的侯选元素,并可以对译文的语序进行纠正。由于此种方法是建立在语料库的基础之上的,从而具有真实可靠和实时等特点。实验表明,这种方法具有良好的性能,且与被处理的语言无关。 相似文献
3.
数据库自然语言查询接口可以使用户直接以日常生活中使用的自然语言提出查询请求,获取数据库中的信息。这是一个具有重要理论价值和巨大实用价值的研究领域。本文在总结数据库汉语查询的语言规律基础上,重点介绍了我们设计实现的一个数据库汉语查询接口系统WTCDIS,最后给出了对该系统进行初步测试的结果。 相似文献
4.
《中文信息学报》1999,13(6):27-34,61
Natural Language Query Interfaces to Database give users a more natural method to access databases. This research has significantly theoretical and practical value.In this paper , we first summarize some principles of Chinese language used in database query. Then we mainly explain the design and implementation of a natural language interface to database we have developed. Finally , preliminary empirical results from tests of this system are presented. 相似文献
5.
电子文本病历语料库可提供相关医学影像的定性诊断结果,但缺乏直观影像和文本标注信息,不利于有效管理医学数据和医科学生自主学习相关医学知识.针对此问题,文中提出基于深度水平集算法的医学影像分割方法,对医学影像进行自动分割,给出感兴趣区域的轮廓结果及相关定量指标,并结合自然语言处理方法实现电子病历文本的标注,增强影像与文本病历多模态语料库的信息表征能力.在青光眼影像数据上的实验表明,文中方法可精准分割眼底图像中视盘和视杯,有效构建具有直观影像标记与对应病历文本的多模态语料库. 相似文献
6.
《Expert systems with applications》2014,41(11):5495-5508
This paper describes an approach to assessing semantic annotation activities based on formal concept analysis (FCA). In this approach, annotators use taxonomical ontologies created by domain experts to annotate digital resources. Then, using FCA, domain experts are provided with concept lattices that graphically display how their ontologies were used during the semantic annotation process. In consequence, they can advise annotators on how to better use the ontologies, as well as how to refine these ontologies to better suit the needs of the semantic annotators. To illustrate the approach, we describe its implementation in @note, a Rich Internet Application (RIA) for the collaborative annotation of digitized literary texts, we exemplify its use with a case study, and we provide some evaluation results using the method. 相似文献
7.
如何在VB中利用API函数操作注册表 总被引:1,自引:0,他引:1
孔德川 《数字社区&智能家居》2005,(9):77-78
介绍了在VB程序设计中如何利用API函数对系统注册表进行访问和操作,并给出了相应的范例。 相似文献
8.
作为一项重要的基础资源,双语平行语料库在人工智能领域的研究起着举足轻重的作用。我国涉及少数民族语言的双语语料库的探索工作相对较少,本文介绍了汉维双语语料库的建立过程,并对语料的收集整理和组织做了初步的设想。 相似文献
9.
10.
旨在探索利用语言学手段来提高句法分析精度的可能性.采用MaltParser和自建的汉语依存树库进行相关汉语依存句法分析实验.通过对句法分析结果的分析,找出影响句法分析精度的主要因素,并据此对树库中处理某些语言结构的方式进行修改.然后再对得到的句法分析数据进行进一步分析,以确定所用方法的有效性.结果表明,无标记依存句法分析精度提高了5.5%,有标记依存句法分析精度提高了7.5%. 相似文献
11.
论述了一个多智能体并行设计环境中自然语言通信模块的设计和实现方法。通过建立一个受限词汇的综合信息库,记录词汇的各种语法、语义信息;采用扩充短语结构文法对句子结构进行分析,定义了句子生成模板,智能体借助该模板提供的接口函数生成自然语言句子。该设计方法建立在受限语言的基础上,突出了在协商式多智能体之间进行自然语言通信时的准确性。此外,为智能体提供了灵活、友善的接口,提高了多智能体并行设计环境的性能。 相似文献
12.
13.
14.
15.
本文所作的工作足将中医方剂进行自然语言处理,将文本形式的医案转换成结构化数据。然后,在通过合理而有效的算法将其转换成图的结构。从而为图挖掘技术提供可供使用的图集。 相似文献
16.
A Semantic Network of English: The Mother of All WordNets 总被引:1,自引:0,他引:1
Christiane Fellbaum 《Computers and the Humanities》1998,32(2-3):209-220
We give a brief outline of the design and contents of the English lexical database WordNet, which serves as a model for similarly conceived wordnets in several European languages. WordNet is a semantic network, in which the meanings of nouns, verbs, adjectives, and adverbs are represented in terms of their links to other (groups of) words via conceptual-semantic and lexical relations. Each part of speech is treated differently reflecting different semantic properties. We briefly discuss polysemy in WordNet, and focus on the case of meaning extensions in the verb lexicon. Finally, we outline the potential uses of WordNet not only for applications in natural language processing, but also for research in stylistic analyses in conjunction with a semantic concordance. 相似文献
17.
随着印尼语、马来语互联网普及率的上升,对海量印尼语、马来语文本进行信息处理存在重大需求.虽然研究人员对印尼语、马来语展开较广泛的研究,但是作为低资源语言,受到的关注远不及通用语,未能较好利用前沿的深度学习方法.文中梳理总结包括词法分析、句法分析、机器翻译、拼写检查等印尼语、马来语相关的自然语言处理技术.对比分析相关的研究成果发现,大多数研究因语料规模及评测标准不同难以客观对比各种算法的差异.最后结合印尼语、马来语现有的各领域语言资源开放情况,指出印尼语、马来语的自然语言处理研究面临的问题,并展望未来发展趋势. 相似文献
18.
This paper presents a framework for collecting and analysing large volume social media content. The real-time analytics framework comprises semantic annotation, Linked Open Data, semantic search, and dynamic result aggregation components. In addition, exploratory search and sense-making are supported through information visualisation interfaces, such as co-occurrence matrices, term clouds, treemaps, and choropleths. There is also an interactive semantic search interface (Prospector), where users can save, refine, and analyse the results of semantic search queries over time. Practical use of the framework is exemplified through three case studies: a general scenario analysing tweets from UK politicians and the public’s response to them in the run up to the 2015 UK general election, an investigation of attitudes towards climate change expressed by these politicians and the public, via their engagement with environmental topics, and an analysis of public tweets leading up to the UK’s referendum on leaving the EU (Brexit) in 2016. The paper also presents a brief evaluation and discussion of some of the key text analysis components, which are specifically adapted to the domain and task, and demonstrate scalability and efficiency of our toolkit in the case studies. 相似文献
19.
《Journal of Visual Languages and Computing》2014,25(6):695-702
The MADCOW annotation system supports a notion of group, facilitating focused annotations with respect to a domain. In previous work, we adopted ontologies to represent knowledge about domains, thus allowing more refined annotations to a group, and discussed how the use of ontologies facilitates the formulation of semantically significant queries for retrieving annotations on specific topics. We now expand on previous results and study two new types of measures to identify matches between users׳ interests and groups: Degree Centrality, developed for social networks to assess the quality of concepts in an ontology, and URL concordance, indicating the similarity of interests among users who annotate the same pages. 相似文献
20.
《Advanced Engineering Informatics》2014,28(1):28-36
In this article we report on validated research for the construction of design constraints by automated interpretation of natural language input. We show how our approach of dynamic reconfigurations of parsed syntax trees using a number of production rules is used to formalize and transform natural language constructs into computable constraints that are applied to concrete building information models. The calibration and validation of the proposed algorithms and production rules is based on two test data sets: The verbatim text of Dutch building code regulations for dormer extensions on existing roofs and constraints formulated ad hoc by design students based upon a series of example designs. We show how a prototypical implementation of our approach can be used to interpret 44% of the test data without human interference and how the remaining sentences can be interpreted with minimal additional effort or further development. 相似文献