首页 | 本学科首页   官方微博 | 高级检索  
     


Multivariate Analysis of Finnish Dialect Data An Overview of Lexical Variation
Authors:Hyvonen  Saara; Leino  Antti; Salmenkivi  Marko
Affiliation:Department of Computer Science, Helsinki Institute for Information Technology, University of Helsinki, P.O. Box 68, FI–00014, Finland
Abstract:During the process of writing a comprehensive dictionary ofFinnish dialects, a large set of maps describing the regionaldistribution of the dialect words have been compiled in electronicform. In this article, we set out to analyse this corpus ofdata in order to gain new insight on the variation of Finnishdialects. We use a wide range of multivariate data analysismethods, including principal components analysis, independentcomponents analysis, clustering, and multidimensional scaling.We explain how to preprocess the data to overcome the problemof uneven sampling caused by the way the data has been collected.We discuss the results obtained by these methods and comparethem to the traditional view of Finnish dialect groups.
Keywords:
本文献已被 Oxford 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号