Multivariate Analysis of Finnish Dialect Data An Overview of Lexical Variation |
| |
Authors: | Hyvonen Saara; Leino Antti; Salmenkivi Marko |
| |
Affiliation: | Department of Computer Science, Helsinki Institute for Information Technology, University of Helsinki, P.O. Box 68, FI–00014, Finland |
| |
Abstract: | During the process of writing a comprehensive dictionary ofFinnish dialects, a large set of maps describing the regionaldistribution of the dialect words have been compiled in electronicform. In this article, we set out to analyse this corpus ofdata in order to gain new insight on the variation of Finnishdialects. We use a wide range of multivariate data analysismethods, including principal components analysis, independentcomponents analysis, clustering, and multidimensional scaling.We explain how to preprocess the data to overcome the problemof uneven sampling caused by the way the data has been collected.We discuss the results obtained by these methods and comparethem to the traditional view of Finnish dialect groups. |
| |
Keywords: | |
本文献已被 Oxford 等数据库收录! |
|