Another Perspective on Vocabulary Richness |
| |
Authors: | David L Hoover |
| |
Affiliation: | (1) New York University, 19 University Place, New York, NY 10003, USA |
| |
Abstract: | This article examines the usefulness ofvocabulary richness for authorship attributionand tests the assumption that appropriatemeasures of vocabulary richness can capture anauthor's distinctive style or identity. Afterbriefly discussing perceived and actualvocabulary richness, I show that doubling andcombining texts affects some measures incomputationally predictable but conceptuallysurprising ways. I discuss some theoretical andempirical problems with some measures anddevelop simple methods to test how wellvocabulary richness distinguishes texts bydifferent authors. These methods show thatvocabulary richness is ineffective for largegroups of texts because of the extremevariability within and among them. I concludethat vocabulary richness is of marginal valuein stylistic and authorship studies because thebasic assumption that it constitutes awordprint for authors is false. |
| |
Keywords: | authorship attribution lexical statistics stylistics vocabulary richness |
本文献已被 SpringerLink 等数据库收录! |