共查询到20条相似文献,搜索用时 578 毫秒
1.
We briefly discuss the origin and development of WordNet, a large lexical database for English. We outline its design and
contents as well as its usefulness for Natural Language Processing. Finally, we discuss crosslinguistic WordNets and complementary
lexical resources.
相似文献
2.
We have constructed a large scale and detailed database of lexical types in Japanese from a treebank that includes detailed
linguistic information. The database helps treebank annotators and grammar developers to share precise knowledge about the
grammatical status of words that constitute the treebank, allowing for consistent large-scale treebanking and grammar development.
In addition, it clarifies what lexical types are needed for precise Japanese NLP on the basis of the treebank. In this paper,
we report on the motivation and methodology of the database construction.
相似文献
3.
As part of a project to construct an interactive program which would encourage children to play with language by building
jokes, we developed a lexical database, starting from WordNet. To the existing information about part of speech, synonymy,
hyponymy, etc., we have added phonetic representations and phonetic similarity ratings for pairs of words/phrases.
相似文献
4.
Optimizing the production, maintenance and extension of lexical resources is one the crucial aspects impacting natural language
processing (NLP). A second aspect involves optimizing the process leading to their integration in applications. With this
respect, we believe that a consensual specification on monolingual, bilingual and multilingual lexicons can be a useful aid
for the various NLP actors. Within ISO, one purpose of Lexical Markup Framework (LMF, ISO-24613) is to define a standard for
lexicons that covers multilingual lexical data.
相似文献
5.
Thesauri and controlled vocabularies facilitate access to digital collections by explicitly representing the underlying principles
of organization. Translation of such resources into multiple languages is an important component for providing multilingual
access. However, the specificity of vocabulary terms in most thesauri precludes fully-automatic translation using general-domain
lexical resources. In this paper, we present an efficient process for leveraging human translations to construct domain-specific
lexical resources. This process is illustrated on a thesaurus of 56,000 concepts used to catalog a large archive of oral histories.
We elicited human translations on a small subset of concepts, induced a probabilistic phrase dictionary from these translations,
and used the resulting resource to automatically translate the rest of the thesaurus. Two separate evaluations demonstrate
the acceptability of the automatic translations and the cost-effectiveness of our approach.
相似文献
6.
Since the inception of the S enseval series there has been a great deal of debate in the word sense disambiguation (WSD) community on what the right sense distinctions
are for evaluation, with the consensus of opinion being that the distinctions should be relevant to the intended application.
A solution to the above issue is lexical substitution, i.e. the replacement of a target word in context with a suitable alternative
substitute. In this paper, we describe the English lexical substitution task and report an exhaustive evaluation of the systems
participating in the task organized at SemEval-2007. The aim of this task is to provide an evaluation where the sense inventory
is not predefined and where performance on the task would bode well for applications. The task not only reflects WSD capabilities,
but also can be used to compare lexical resources, whether man-made or automatically created, and has the potential to benefit
several natural-language applications.
相似文献
8.
Lexical cohesion refers to the reader-perceived unity of text achieved by the author’s usage of words with related meanings
(Halliday and Hasan, 1976). This article reports on an experiment with 22 readers aimed at finding lexical cohesive patterns in 10 texts. Although
there was much diversity in peoples’ answers, we identified a common core of the phenomenon, using statistical analysis of
agreement patterns and a validation experiment. The core data may now be used as a minimal test set for models of lexical
cohesion; we present an example suggesting that models based on mutually exclusive lexical chains will not suffice. In addition,
we believe that procedures for revealing and analyzing sub-group patterns of agreement described here may be applied to data
collected in other studies of comparable size.
相似文献
9.
Lexical cohesion refers to the reader-perceived unity of text achieved by the author’s usage of words with related meanings
(Halliday and Hasan, 1976). This article reports on an experiment with 22 readers aimed at finding lexical cohesive patterns in 10 texts. Although
there was much diversity in peoples’ answers, we identified a common core of the phenomenon, using statistical analysis of
agreement patterns and a validation experiment. The core data may now be used as a minimal test set for models of lexical
cohesion; we present an example suggesting that models based on mutually exclusive lexical chains will not suffice. In addition,
we believe that procedures for revealing and analyzing sub-group patterns of agreement described here may be applied to data
collected in other studies of comparable size.
相似文献
10.
Highly frequent and highly polysemous verbs, such as give, take, and make, pose a challenge to automatic lexical acquisition methods. These verbs widely participate in multiword predicates (such
as light verb constructions, or LVCs), in which they contribute a broad range of figurative meanings that must be recognized.
Here we focus on two properties that are key to the computational treatment of LVCs. First, we consider the degree of figurativeness
of the semantic contribution of such a verb to the various LVCs it participates in. Second, we explore the patterns of acceptability
of LVCs, and their productivity over semantically related combinations. To assess these properties, we develop statistical
measures of figurativeness and acceptability that draw on linguistic properties of LVCs. We demonstrate that these corpus-based
measures correlate well with human judgments of the relevant property. We also use the acceptability measure to estimate the
degree to which a semantic class of nouns can productively form LVCs with a given verb. The linguistically-motivated measures
outperform a standard measure for capturing the strength of collocation of these multiword expressions.
相似文献
11.
We describe a suite of standards, resources and tools for computational encoding and processing of Modern Hebrew texts. These
include an array of XML schemas for representing linguistic resources; a variety of text corpora, raw, automatically processed
and manually annotated; lexical databases, including a broad-coverage monolingual lexicon, a bilingual dictionary and a WordNet;
and morphological processors which can analyze, generate and disambiguate Hebrew word forms. The resources are developed under
centralized supervision, so that they are compatible with each other. They are freely available and many of them have already
been used for several applications, both academic and industrial.
相似文献
12.
PECOLE (Peer-to-pEer COLlaborative Environment) is a fully decentralized multimedia collaborative environment that supports
a wide range of collaborative multimedia applications, including chat, shared browsing, shared telepointer, multipoint-to-multipoint
audio/video conferencing and multilingual collaboration. PECOLE can intelligently run on very constrained resources, is highly
resilient, scalable and does not rely on dedicated servers. Instead, PECOLE is built upon a Peer-to-Peer (P2P) overlay network,
using SUN’s JXTA framework and SWT technology. In this paper, we present the architecture and implementation of PECOLE with
the performance results of the tests we conducted.
相似文献
13.
In this paper we present an application fostering the integration and interoperability of computational lexicons, focusing
on the particular case of mutual linking and cross-lingual enrichment of two wordnets, the ItalWordNet and Sinica BOW lexicons.
This is intended as a case-study investigating the needs and requirements of semi-automatic integration and interoperability
of lexical resources, in the view of developing a prototype web application to support the GlobalWordNet Grid initiative.
相似文献
14.
We present a study of using camera-phones and visual-tags to access mobile services. Firstly, a user-experience study is described in which participants were both observed learning to interact with a prototype mobile service and interviewed
about their experiences. Secondly, a pointing-device task is presented in which quantitative data was gathered regarding the speed and accuracy with which participants aimed and clicked
on visual-tags using camera-phones. We found that participants’ attitudes to visual-tag-based applications were broadly positive,
although they had several important reservations about camera-phone technology more generally. Data from our pointing-device
task demonstrated that novice users were able to aim and click on visual-tags quickly (well under 3 s per pointing-device
trial on average) and accurately (almost all meeting our defined speed/accuracy tradeoff of 6% error-rate). Based on our findings,
design lessons for camera-phone and visual-tag applications are presented.
相似文献
15.
We provide the complete record of methodology that let us evolve BrilliAnt, the winner of the Ant Wars contest. Ant Wars contestants are virtual ants collecting food on a grid board in the presence
of a competing ant. BrilliAnt has been evolved through a competitive one-population coevolution using genetic programming
and fitnessless selection. In this paper, we detail the evolutionary setup that lead to BrilliAnt’s emergence, assess its
direct and indirect human-competitiveness, and describe the behavioral patterns observed in its strategy.
相似文献
16.
This paper describes the simulated car racing competition that was arranged as part of the 2007 IEEE Congress on Evolutionary
Computation. Both the game that was used as the domain for the competition, the controllers submitted as entries to the competition
and its results are presented. With this paper, we hope to provide some insight into the efficacy of various computational
intelligence methods on a well-defined game task, as well as an example of one way of running a competition. In the process,
we provide a set of reference results for those who wish to use the simplerace game to benchmark their own algorithms. The paper is co-authored by the organizers and participants of the competition.
相似文献
17.
Listening to music on personal, digital devices whilst mobile is an enjoyable, everyday activity. We explore a scheme for
exploiting this practice to immerse listeners in navigation cues. Our prototype, ONTRACK, continuously adapts audio, modifying
the spatial balance and volume to lead listeners to their target destination. First we report on an initial lab-based evaluation
that demonstrated the approach’s efficacy: users were able to complete tasks within a reasonable time and their subjective
feedback was positive. Encouraged by these results we constructed a handheld prototype. Here, we discuss this implementation
and the results of field-trials. These indicate that even with a low-fidelity realisation of the concept, users can quite
effectively navigate complicated routes.
相似文献
18.
This article introduces the topic of “Multilingual language resources and interoperability”. We start with a taxonomy and
parameters for classifying language resources. Later we provide examples and issues of interoperatability, and resource architectures
to solve such issues. Finally we discuss aspects of linguistic formalisms and interoperability.
相似文献
19.
There are only a few ethical regulations that deal explicitly with robots, in contrast to a vast number of regulations, which
may be applied. We will focus on ethical issues with regard to “responsibility and autonomous robots”, “machines as a replacement
for humans”, and “tele-presence”. Furthermore we will examine examples from special fields of application (medicine and healthcare,
armed forces, and entertainment). We do not claim to present a complete list of ethical issue nor of regulations in the field
of robotics, but we will demonstrate that there are legal challenges with regard to these issues.
相似文献
20.
In this paper, we present an Inverse Multi-Objective Robust Evolutionary (IMORE) design methodology that handles the presence
of uncertainty without making assumptions about the uncertainty structure. We model the clustering of uncertain events in
families of nested sets using a multi-level optimization search. To reduce the high computational costs of the proposed methodology
we proposed schemes for (1) adapting the step-size in estimating the uncertainty, and (2) trimming down the number of calls
to the objective function in the nested search. Both offline and online adaptation strategies are considered in conjunction
with the IMORE design algorithm. Design of Experiments (DOE) approaches further reduce the number of objective function calls
in the online adaptive IMORE algorithm. Empirical studies conducted on a series of test functions having diverse complexities
show that the proposed algorithms converge to a set of Pareto-optimal design solutions with non-dominated nominal and robustness
performances efficiently.
相似文献
|