Querying on large and complex databases by content: Challenges on variety and veracity regarding real applications期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Querying on large and complex databases by content: Challenges on variety and veracity regarding real applications

Abstract:	The amount and variety of digital data currently being generated, stored and analyzed, including images, videos, and time series, have brought challenges to data administrators, analysts and developers, who struggle to comply with the expectations of both data owners and end users. The majority of the applications demand searching complex data by taking advantage of queries that analyze different aspects of the data, and need the answers in a timely manner. Content-based similarity retrieval techniques are well-suited to handle large databases, because they enable performing queries and analyses using features automatically extracted from the data, without users’ intervention. In this paper, we review and discuss the challenges posed to the database and related communities in order to provide techniques and tools that can meet the variety and veracity characteristics of big and complex data, while also considering the aspects of semantical preservation and completeness of the data. Examples and results obtained over a two-decade-long experience with real applications are presented and discussed.

Keywords:	Similarity search Content-based image retrieval Feature extraction methods Bags-of-visual-words Missing data Big-data characteristics Variety Veracity
本文献已被 ScienceDirect 等数据库收录！