首页 | 本学科首页   官方微博 | 高级检索  
     

日语文本语义接受度评价研究
引用本文:杜家利,于屏方. 日语文本语义接受度评价研究[J]. 计算机工程与应用, 2009, 45(23): 137-139. DOI: 10.3778/j.issn.1002-8331.2009.23.038
作者姓名:杜家利  于屏方
作者单位:鲁东大学,外国语学院汉语言文学院,山东,烟台,264025;鲁东大学,外国语学院汉语言文学院,山东,烟台,264025
基金项目:国家社会科学基金项目,教育部人文社会科学重点研究基地重大项目,山东省社会科学规划项目 
摘    要:基于日语料库的粘着语文本语义接受度(SAS)研究分三步展开。首先提取『ゆきぐに』为分析文本,以等距离系统随机抽样方法取得6对比组。然后在屈折语SAS研究基础上提出适用于粘着语文本的词长定义,即百词所含5音拍及以上词数为超常用词量。最后得出结论:抽取间距由大变小引发抽取率(SR)由小变大的曲线变化;依次攀升的SR与围绕均值波动的SAS组图证明两者的非关联性,以实例验证了屈折语SAS评价公式对粘着语文本研究的可适用性。

关 键 词:粘着语  信息检索  语料库  语义接受度  抽取率
收稿时间:2008-12-16
修稿时间:2009-2-23 

Research on evaluation of semantic accessibility scale in Japanese text
DU Jia-li,YU Ping-fang. Research on evaluation of semantic accessibility scale in Japanese text[J]. Computer Engineering and Applications, 2009, 45(23): 137-139. DOI: 10.3778/j.issn.1002-8331.2009.23.038
Authors:DU Jia-li  YU Ping-fang
Affiliation:School of Chinese Language and Literature,School of Foreign Languages,Ludong University,Yantai,Shandong 264025,China
Abstract:The study on agglutinative-language-involved Semantic Accessibility Scale(SAS) based on Japanese corpus comprises three steps.Firstly,『ゆきぐに』 is extracted from corpus and divided into six groups for comparison by the systematic random sampling skill in which different equidistant extraction is included.Secondly,the definition of word height in presently-verified SAS formula reflecting inflecting language domain is adapted for agglutinative language domain.The word beyond five music beats is called the unpopular one,and the number of this kind of word every 100 words is considered word height.Finally,a conclusion is drawn that decreasing extracted-space results in increasing Sampling Ratio(SR),and that the non-relevance between SR and SAS is verified by the schema in which the contrast between increasing SR and the mean-fluctuated SAS is involved.In short,the evaluation of SAS in inflecting language text can be applicable in other fields,including agglutinative language text.
Keywords:agglutinative language  information processing  corpus  Semantic Accessibility Scale(SAS)  Sampling Ratio(SR)
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号