Implementing NLP projects for noncentral languages: instructions for funding bodies, strategies for developers |
| |
Authors: | Oliver Streiter Kevin P Scannell Mathias Stuflesser |
| |
Affiliation: | (1) Department of Western Languages and Literature, National University of Kaohsiung, 700 Kaohsiung University Road, Kaohsiung, 811, Taiwan, ROC;(2) Department of Mathematics and Computer Science, Saint Louis University, 220 N Grand Blvd, Saint Louis, MO 63103-2007, USA;(3) Institute for Specialised Communication and Multilingualism, European Academy Bozen/Bolzano, Drususallee/Viale Druso 1, 39100 Bolzano, Italy |
| |
Abstract: | This research begins by distinguishing a small number of “central” languages from the “noncentral languages”, where centrality
is measured by the extent to which a given language is supported by natural language processing tools and research. We analyse
the conditions under which noncentral language projects (NCLPs) and central language projects are conducted. We establish
a number of important differences which have far-reaching consequences for NCLPs. In order to overcome the difficulties inherent
in NCLPs, traditional research strategies have to be reconsidered. Successful styles of scientific cooperation, such as those
found in open-source software development or in the development of the Wikipedia, provide alternative views of how NCLPs might
be designed. We elaborate the concepts of free software and software pools and argue that NCLPs, in their own interests, should
embrace an open-source approach for the resources they develop and pool these resources together with other similar open-source
resources. The expected advantages of this approach are so important that we suggest that funding organizations put it as
sine qua non condition into project contracts.
All trademarks are hereby acknowledged. |
| |
Keywords: | Minority languages Open-source Free software Software pools |
本文献已被 SpringerLink 等数据库收录! |
|