首页 | 本学科首页   官方微博 | 高级检索  
     


Random indexing of multidimensional data
Authors:Fredrik Sandin  Blerim Emruli  Magnus Sahlgren
Affiliation:1.EISLAB,Lule? University of Technology,Lule?,Sweden;2.SICS Swedish ICT,V?ster?s,Sweden;3.SICS Swedish ICT,Kista,Sweden
Abstract:Random indexing (RI) is a lightweight dimension reduction method, which is used, for example, to approximate vector semantic relationships in online natural language processing systems. Here we generalise RI to multidimensional arrays and therefore enable approximation of higher-order statistical relationships in data. The generalised method is a sparse implementation of random projections, which is the theoretical basis also for ordinary RI and other randomisation approaches to dimensionality reduction and data representation. We present numerical experiments which demonstrate that a multidimensional generalisation of RI is feasible, including comparisons with ordinary RI and principal component analysis. The RI method is well suited for online processing of data streams because relationship weights can be updated incrementally in a fixed-size distributed representation, and inner products can be approximated on the fly at low computational cost. An open source implementation of generalised RI is provided.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号