共查询到20条相似文献,搜索用时 15 毫秒
2.
This paper proposes a technique for the detection of head nod and shake gestures based on eye tracking and head motion decision. The eye tracking step is divided into face detection and eye location. Here, we apply a motion segmentation algorithm that examines differences in moving people’s faces. This system utilizes a Hidden Markov Model-based head detection module that carries out complete detection in the input images, followed by the eye tracking module that refines the search based on a candidate list provided by the preprocessing module. The novelty of this paper is derived from differences in real-time input images, preprocessing to remove noises (morphological operators and so on), detecting edge lines and restoration, finding the face area, and cutting the head candidate. Moreover, we adopt a K-means algorithm for finding the head region. Real-time eye tracking extracts the location of eyes from the detected face region and is performed at close to a pair of eyes. After eye tracking, the coordinates of the detected eyes are transformed into a normalized vector of x-coordinate and y-coordinate. Head nod and shake detector uses three hidden Markov models (HMMs). HMM representation of the head detection can estimate the underlying HMM states from a sequence of face images. Head nod and shake can be detected by three HMMs that are adapted by a directional vector. The directional vector represents the direction of the head movement. The vector is HMMs for determining neutral as well as head nod and shake. These techniques are implemented on images, and notable success is notified. 相似文献
3.
Wireless Visual Sensor Networks (WVSNs) have gained significant importance in the last few years and have emerged in several distinctive applications. The main aim is to design low power WVSN surveillance application using adaptive Compressive Sensing (CS) which is expected to overcome the WVSN resource constraints such as memory limitation, communication bandwidth and battery constraints. In this paper, an adaptive block CS technique is proposed and implemented to represent the high volume of captured images in a way for energy efficient wireless transmission and minimum storage. Furthermore, to achieve energy-efficient target detection and tracking with high detection reliability and robust tracking, to maximize the lifetime of sensor nodes as they can be left for months without any human interactions. Adaptive CS is expected to dynamically achieve higher compression rates depending on the sparsity nature of different datasets, while only compressing relative blocks in the image that contain the target to be tracked instead of compressing the whole image. Hence, saving power and increasing compression rates. Least mean square adaptive filter is used to predicts target’s next location to investigate the effect of CS on the tracking performance. The tracking is achieved in both indoor and outdoor environments for single/multi targets. Results have shown that with adaptive block CS up to 20 % measurements of data are required to be transmitted while preserving the required performance for target detection and tracking. 相似文献
4.
Multimedia Tools and Applications - Although much progress has been made in multi-object tracking in recent decades due to its variety of applications including visual surveillance, traffic... 相似文献
5.
Cross impact analysis (CIA) consists of a set of related methodologies that predict the occurrence probability of a specific event and that also predict the conditional probability of a first event given a second event. The conditional probability can be interpreted as the impact of the second event on the first. Most of the CIA methodologies are qualitative that means the occurrence and conditional probabilities are calculated based on estimations of human experts. In recent years, an increased number of quantitative methodologies can be seen that use a large number of data from databases and the internet. Nearly 80% of all data available in the internet are textual information and thus, knowledge structure based approaches on textual information for calculating the conditional probabilities are proposed in literature. In contrast to related methodologies, this work proposes a new quantitative CIA methodology to predict the conditional probability based on the semantic structure of given textual information. Latent semantic indexing is used to identify the hidden semantic patterns standing behind an event and to calculate the impact of the patterns on other semantic textual patterns representing a different event. This enables to calculate the conditional probabilities semantically. A case study shows that this semantic approach can be used to predict the conditional probability of a technology on a different technology. 相似文献
6.
Multimedia Tools and Applications - Robust and accurate visual tracking is a challenging problem in computer vision. In this paper, we exploit spatial and semantic convolutional features extracted... 相似文献
7.
Multimedia Tools and Applications - Topic models have shown to be one of the most effective tools in Content-Based Multimedia Retrieval (CBMR). However, the high computational learning cost... 相似文献
8.
For many vision-based systems, it is important to detect a moving object automatically. The region-based motion estimation method is popular for automatic moving object detection. The region-based method has several advantages in that it is robust to noise and variations in illumination. However, there is a critical problem in that there exists an occlusion problem which is caused by the movement of the object. The occlusion problem results in an incorrect motion estimation and faulty detection of moving objects. When there are occlusion regions, the motion vector is not correctly estimated. That is, a stationary background in the occluded region can be classified as a moving object.In order to overcome this occlusion problem, a new occlusion detection algorithm is proposed. The proposed occlusion detection algorithm is motivated by the assumption that the distribution of the error histogram of the occlusion region is different from that of the nonocclusion region. The proposed algorithm uses the mean and variance values to decide whether an occlusion has occurred in the region. Therefore, the proposed occlusion detection and motion estimation scheme detects the moving regions and estimates the new motion vector, while avoiding misdetection caused by the occlusion problem. The experimental results for several video sequences demonstrate the robustness of the proposed approach to the occlusion problem.This work was presented in part at the 8th International Symposium on Artificial Life and Robotics, Oita, Japan, January 24–26, 2003 相似文献
9.
Motion anomaly detection through video analysis is important for delivering autonomous situation awareness in public places. Surveillance scene segmentation and representation is the preliminary step to implementation anomaly detection. Surveillance scene can be represented using Region Association Graph (RAG), where nodes represent regions and edges denote connectivity among the regions. Existing RAG-based analysis algorithms assume simple anomalies such as moving objects visit statistically unimportant or abandoned regions. However, complex anomalies such as an object encircles within a particular region (Type-I) or within a set of regions (Type-II). In this paper, we extract statistical features from a given set of object trajectories and train multi-class support vector machines (SVM) to deal with each type of anomaly. In the testing phase, a given test trajectory is categorized as normal or anomalous with respect to the trained models. Performance evaluation of the proposed algorithm has been carried out on public as well as our own datasets. We have recorded sensitivity as high as 86% and fall-out rate as low as 9% in experimental evaluation of the proposed technique. We have carried out comparative analysis with state-of-the-art techniques to benchmark the method. It has been observed that the proposed model is consistent and highly accurate across challenging datasets. 相似文献
11.
A biologically inspired approach to active visual target tracking is presented. The approach makes use of three strategies found in biological systems: space-variant sensing, a spatio-temporal-frequency-based model of motion detection and the alignment of sensory-motor maps. Space-variant imaging is used to create a 1D array of elementary motion detectors (EMDs) that are tuned in such a way as to make it possible to detect motion over a wide range of velocities while still being able to detect motion precisely. The array is incorporated into an active visual tracking system. A method of analysis and design for such a tracking system is proposed. It makes use of a sensory-motor map which consists of a phase-plane plot of the continuous-time dynamics of the tracking system overlaid onto a map of the detection capabilities of the array of EMDs. This sensory-motor map is used to design a simple 1D tracking system and several simulations show how the method can be used to control tracking performance using such metrics as overshoot and settling time. A complete 1D active vision system is implemented and a set of simple target tracking experiments are performed to demonstrate the effectiveness of the approach. 相似文献
12.
During the last decade, the development of the immersive virtual reality (VR) has achieved a great progress in different application areas. For more advanced large-scale immersive VR environments or systems, one of the most challenge is to accurately track the position of the user’s body part such as head when he/she is immersived in the environment to feel the changes among the synthetic stereoscopic image sequences. Unfortunately, accurate tracking is not easy in the virtual reality scenarios due to the variety types of existing intrinsic and extrinsic changes when tracking is on-the-fly. Especially for the single tracker, a long time accurate tracking is usually not possible because of the model adaption problem in different environments. Recent trend of research in tracking is to incorporate multiple trackers into a compositive learning framework and utilize the advantages of different trackers for more effective tracking. Therefore, in this paper, we propose a novel Bayesian tracking fusion framework with online classifier ensemble strategy. The proposed tracking formulates a fusion framework for online learning of multiple trackers by modeling a cumulative loss minimization process. With an optimal pair-wise sampling scheme for the SVM classifier, the proposed fusion framework can achieve more accurate tracking performance when compared with the other state-of-art trackers. In addition, the experiments on the standard benchmark database also verify that the proposed tracking is able to handle the challenges in many immersive VR applications and environments. 相似文献
13.
In the paper, the most state-of-the-art methods of automatic text summarization, which build summaries in the form of generic
extracts, are considered. The original text is represented in the form of a numerical matrix. Matrix columns correspond to
text sentences, and each sentence is represented in the form of a vector in the term space. Further, latent semantic analysis
is applied to the matrix obtained to construct sentences representation in the topic space. The dimensionality of the topic
space is much less than the dimensionality of the initial term space. The choice of the most important sentences is carried
out on the basis of sentences representation in the topic space. The number of important sentences is defined by the length
of the demanded summary. This paper also presents a new generic text summarization method that uses nonnegative matrix factorization
to estimate sentence relevance. Proposed sentence relevance estimation is based on normalization of topic space and further
weighting of each topic using sentences representation in topic space. The proposed method shows better summarization quality
and performance than state-of-the-art methods on the DUC 2001 and DUC 2002 standard data sets. 相似文献
14.
为提高音乐检索效率,使检索结果与搜索目的更接近,提出了基于隐含语义分析的音乐检索方法.将曲谱表示为标准音符和音转的交替串,基于每个交替串使用频率高于包含它的多交替串排列的事实,设计了音乐词汇统计算法.为使各分句能整齐地转化为相同维数的向量,使用最长的分句长度作为标准维数,基于增加频率和的原则进行单词的重新分割.实验结果表明,基于隐含语义分析的检索能获得令人满意的检索结果. 相似文献
15.
Laser devices have received increasing attention in numerous computer-aided applications such as automatic control, 3D modeling and virtual reality. In this paper, aiming at people counting, we propose a novel people detection and tracking method based on the multipoint infrared laser, which can further facilitate intelligent scene modeling and analysis. In our method, a camera with the infrared lens filter is utilized to capture the monitored scene where an array of infrared spots is produced by the multipoint infrared laser. We build a spatial background model based on locations of spots. Pedestrians are detected by clustering of foreground spots. Then, our method tracks and counts the detected pedestrians via inferring the forward–backward motion consistency. Both quantitative and qualitative evaluation and comparison are conducted, and the experimental results demonstrate that the proposed method achieves excellent performance in challenging scenarios. 相似文献
17.
An algorithm is proposed for revealing latent user’s interests from the observable protocol of users behavior, e.g., site visits. The algorithm combines the ideas of customer environment analysis and probabilistic latent semantic analysis. A quality criterion based on the classification of preliminarily labeled sites is introduced to optimize the algorithm parameters and compare algorithms. The experiments show that the quality has an optimum by the essential parameters of the algorithm, however the attempt of too precise optimization can lead to overfitting. 相似文献
18.
In this paper, the problem of automatic document classification by a set of given topics is considered. The method proposed is based on the use of the latent semantic analysis to retrieve semantic dependencies between words. The classification of document is based on these dependencies. The results of experiments performed on the basis of the standard test data set TREC (Text REtrieval Conference) confirm the attractiveness of this approach. The relatively low computational complexity of this method at the classification stage makes it possible to be applied to the classification of document streams. 相似文献
19.
Many national and international governments establish organizations for applied science research funding. For this, several organizations have defined procedures for identifying relevant projects that based on prioritized technologies. Even for applied science research projects, which combine several technologies it is difficult to identify all corresponding technologies of all research-funding organizations. In this paper, we present an approach to support researchers and to support research-funding planners by classifying applied science research projects according to corresponding technologies of research-funding organizations. In contrast to related work, this problem is solved by considering results from literature concerning the application based technological relationships and by creating a new approach that is based on latent semantic indexing (LSI) as semantic text classification algorithm. Technologies that occur together in the process of creating an application are grouped in classes, semantic textual patterns are identified as representative for each class, and projects are assigned to one of these classes. This enables the assignment of each project to all technologies semantically grouped by use of LSI. This approach is evaluated using the example of defense and security based technological research. This is because the growing importance of this application field leads to an increasing number of research projects and to the appearance of many new technologies. 相似文献
20.
针对现有单一底层特征识别扣件状态的算法存在描述能力差、特征维度过高等问题,提出一种基于两种扣件底层特征的潜在语义主题融合的扣件检测模型.通过潜在狄利克雷分布(LDA)模型分别获取扣件图像的局部二值模式(LBP)特征和方向梯度直方图(HOG)特征的扣件语义主题向量.将这两种语义主题向量进行加权融合,权值由该图像LBP特征图和其梯度图的信息熵来确定.以该向量训练分类器,判断待检扣件状态.实验表明:与目前的主流扣件检测方法相比,该方法的漏检率和误检率明显降低,检测能力显著增强. 相似文献
|