首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到18条相似文献,搜索用时 93 毫秒
1.
情感匹配模型是一种常用于评价词曲匹配程度的方法;然而,单纯地依靠情感匹配模型无法对评价词曲匹配度进行准确的评价。为改善此问题,提出了基于序列模型的词曲匹配度智能评估算法,其综合考虑词曲情感和词曲间节奏关系以给出一个更加准确的词曲评估方法。基于公开词曲同步数据集,构建了音乐情感和节奏正反例模型,并基于此模型将音乐切分成片段;进一步,将歌词和旋律片段分别通过歌词编码器和旋律编码器进行编码,并将编码后具有上下语境的歌词特征和旋律特征输入词曲匹配解码器,解析词曲间特征关系,判断词曲片段匹配程度。仿真结果表明:基于序列模型的词曲匹配度智能评估算法,相对于单纯的情感匹配模型,能够更精确地评价词曲匹配程度,验证了本文提出算法的有效性。  相似文献   

2.
为了自动获得入侵证据,提出一种基于攻击特征的自动证据筛选方法.其原理是首先根据被调查攻击的特征重构出攻击行为细节,并从中抽取筛选证据需要的"特征信息".然后,再根据候选数据与这些特征信息的匹配程度筛选出该攻击相关的证据.基于DARPA 2000的实验表明这种方法具有很高的准确率,其完备性更是接近100%.而与现有方法的...  相似文献   

3.
探讨了一种基于机器视觉的PCB自动装配线多焊盘实时定位方法。采用多分辨率图像金字塔匹配策略,利用模板图像与待搜索图像的灰度特性,使用圆投影匹配进行初始候选匹配点的选取,得到一系列的候选匹配子图;使用SIFT算法对候选匹配子图和模板图像进行特征匹配,确定对应匹配点,消除误匹配的候选子图;根据点的模式匹配,确定大致的旋转角度,使用重采样和插值的方法计算精确的旋转角度。实验表明,该方法可以准确、实时地实现目标定位。  相似文献   

4.
为了减少形状匹配阶段的计算量并保持匹配精度,提出一种结合多尺度三角形特征的形状描述子,利用形状整体特征进行粗匹配并使用实时匹配方法完成匹配过程.首先通过高斯演化获得多尺度下的形状轮廓并用不同尺度上的轮廓点组成特征三角形;然后采用三角形面积和特征角来描述轮廓信息;最后用形状面积率对候选轮廓进行筛选,通过计算不同形状间描述子的相似度进行匹配.在不同形状数据集上进行实验的结果表明,该方法有较好的匹配结果和较少的匹配耗时,能够快速准确地描述形状的局部及整体特征,在实际应用中具有优势.  相似文献   

5.
一种利用形状片段的物体检测方法   总被引:2,自引:0,他引:2  
针对物体检测中传统方法计算量大, 对复杂背景敏感, 且大部分物体检测方法只能得到物体所在区域而不能精确定位物体边缘等问题, 本文提出一种基于形状片段特征的物体检测方法. 该方法首先根据训练集得到具有多尺度特性的物体形状片段模型; 对测试图像按照和模型中边缘片段提取一致方法, 根据形状片段间的相似度, 选取出候选形状片段; 然后根据候选形状片段估计出模型中的片段与测试图像中片段之间的旋转角; 最后结合具有全局约束的概率Hough变换, 将物体检测问题转换为Hough空间概率问题; 根据Hough空间求解出的物体中心位置, 对候选形状片段验证, 得到实际物体轮廓片段. 理论分析和实验结果表明, 本文提出的算法具有较好的效果.  相似文献   

6.
在词袋模型基础上,综合考虑姿态之间的时序约束关系,提出一种基于局部匹配窗口的动作识别方法.首先采用人体姿态差别作为动作序列特征描述.其次,在模型学习阶段,使用局部训练法而非传统的整体训练法来提高特征词汇的表征性;在特征量化阶段,使用自适应局部线性重构策略来给特征基更灵活的权值;在对象描述阶段,分别使用时间金字塔、滑动窗口2种方法将整个动作序列划分成多个局部动作片段,进而通过连接各个局部动作片段的特征来描述整个动作序列.最后使用直方图相交操作来完成特征匹配工作.在MSR Action3D数据库上测试了所提算法的性能并与已有的动作识别方法进行对比,结果表明,该方法的识别效果较优.  相似文献   

7.
基于Lab色彩空间和模板匹配的实时交通灯识别算法   总被引:2,自引:0,他引:2  
徐成  谭乃强  刘彦 《计算机应用》2010,30(5):1251-1254
交通灯的识别是无人驾驶汽车研究的一个重要方面,提出一种实时交通灯识别算法。首先对图像进行形态学预处理,然后对三种色彩的交通灯取样,利用交通灯在色彩方面的特征,在对红、黄、绿色彩描述更为清晰的Lab色彩空间,查找交通灯的候选区域。根据交通灯在结构上被一个黑色矩形框包围的特征,设计出三种交通灯模板,使用模板匹配对候选区域进行确认。最后使用统计方法对结果进行验证。实验结果表明,该算法能实时准确地识别出交通灯。  相似文献   

8.
基于SIFT特征的合成孔径雷达景象匹配方法   总被引:3,自引:1,他引:2  
根据合成孔径雷达图像的特点,提出一种基于SIFT特征的合成孔径雷达(SAR)景象匹配的方法。首先利用改进的特征描述符初步提取实时图与参考图的SIFT 关键点;然后利用距离比和RANSAC算法去除错配,匹配出可靠的同名点对;最后计算反映实时图和参考图之间变换关系的转换参数,完成景象匹配。实验结果表明,本方法快速实用,有较强的有效性和鲁棒性。  相似文献   

9.
提出一种新的基于高斯过程动态模型的节奏转移方法。该方法能够准确、有效地将现有运动中的节奏信息转移到新的运动中去,适用于各种不同类型的运动。首先,使用短时(short term)PCA计算源运动的节奏点,组合3种重要的运动特征求解目标运动的特征点;然后,使用动态规划算法来找到两者之间的最佳匹配,最大化减少计算时间及对目标运动的修改;最后,使用高斯过程动态模型对目标运动进行学习,并在隐空间进行节奏化插值,最终合成新的节奏化运动。  相似文献   

10.
基于点特征的旋转图像匹配新方法   总被引:1,自引:0,他引:1  
图像匹配在模式识别、图像分析和计算机视觉中有着广泛的应用.图像匹配是将模板在参考图中逐像素移动,计算它们的灰度相似性,搜索相似性最大的位置.这种逐像素的搜索方法计算复杂度高.如果模板和参考图之间存在旋转,传统的匹配方法很难实时实现.提出了一种基于点特征的旋转图像的匹配方法,首先采用Harris角点检测算子提取图像的特征点,然后利用小面模型对特征点邻域进行拟合,提取特征点的旋转不变特征,最后利用特征点的旋转不变特征进行点集的匹配,获取图像的平移和旋转参数.该方法匹配结果准确,与传统的相关匹配方法相比计算复杂度很小,易于实时实现.  相似文献   

11.
We introduce a novel method for synthesizing dance motions that follow the emotions and contents of a piece of music. Our method employs a learning-based approach to model the music to motion mapping relationship embodied in example dance motions along with those motions' accompanying background music. A key step in our method is to train a music to motion matching quality rating function through learning the music to motion mapping relationship exhibited in synchronized music and dance motion data, which were captured from professional human dance performance. To generate an optimal sequence of dance motion segments to match with a piece of music, we introduce a constraint-based dynamic programming procedure. This procedure considers both music to motion matching quality and visual smoothness of a resultant dance motion sequence. We also introduce a two-way evaluation strategy, coupled with a GPU-based implementation, through which we can execute the dynamic programming process in parallel, resulting in significant speedup. To evaluate the effectiveness of our method, we quantitatively compare the dance motions synthesized by our method with motion synthesis results by several peer methods using the motions captured from professional human dancers' performance as the gold standard. We also conducted several medium-scale user studies to explore how perceptually our dance motion synthesis method can outperform existing methods in synthesizing dance motions to match with a piece of music. These user studies produced very positive results on our music-driven dance motion synthesis experiments for several Asian dance genres, confirming the advantages of our method.  相似文献   

12.
Dancing-to-Music Character Animation   总被引:1,自引:0,他引:1  
  相似文献   

13.
一种新的近似旋律匹配方法及其在哼唱检索系统中的应用   总被引:20,自引:0,他引:20  
提出了一种近似旋律匹配(approximate meltlody matching)的新方法——线性对齐匹配法,并在此基础上实现了一个哼唱检索(queryby humming)系统原型.与已有的基于内容的音乐检索(content-based music retrieval)不同,该算法并非基于近似符号串匹配、统计模型或者特征空间,而是根据相近旋律的音高轮廓在几何上的相似性,将音高和节奏特征一并考虑所设计而成的全新算法.通过实验检验该算法的有效性,在含有3864首乐曲的搜索空间中,检索62段人声哼唱,线性对齐匹配法取得了90.3%的前3位命中率,相比传统的近似符号匹配算法高出11%以上.这一实验结果有力地表明了线性对齐匹配法的有效性,及其应用于大型数字音乐检索引擎的可行性.  相似文献   

14.
The correlation between music and human motion has attracted widespread research attention. Although recent studies have successfully generated motion for singers, dancers, and musicians, few have explored motion generation for orchestral conductors. The generation of music-driven conducting motion should consider not only the basic music beats, but also mid-level music structures, high-level music semantic expressions, and hints for different parts of orchestras (strings, woodwind, etc.). However, most existing conducting motion generation methods rely heavily on human-designed rules, which significantly limits the quality of generated motion. Therefore, we propose a novel Music Motion Synchronized Generative Adversarial Network (M2S-GAN), which generates motions according to the automatically learned music representations. More specifically, M2S-GAN is a cross-modal generative network comprising four components: 1) a music encoder that encodes the music signal; 2) a generator that generates conducting motion from the music codes; 3) a motion encoder that encodes the motion; 4) a discriminator that differentiates the real and generated motions. These four components respectively imitate four key aspects of human conductors: understanding music, interpreting music, precision and elegance. The music and motion encoders are first jointly trained by a self-supervised contrastive loss, and can thus help to facilitate the music motion synchronization during the following adversarial learning process. To verify the effectiveness of our method, we construct a large-scale dataset, named ConductorMotion100, which consists of unprecedented 100 hours of conducting motion data. Extensive experiments on ConductorMotion100 demonstrate the effectiveness of M2S-GAN. Our proposed approach outperforms various comparison methods both quantitatively and qualitatively. Through visualization, we show that our approach can generate plausible, diverse, and music-synchronized conducting motion.  相似文献   

15.
艺术体操动作辅助编排系统   总被引:1,自引:0,他引:1  
针对艺术体操个人项目与集体项目编排的特点,采用运动捕获技术、运动编辑和合成技术以及音乐特征提取等技术,开发了艺术体操动作辅助编排系统.该系统可以对个人和集体动作编排的整个流程进行辅助,从而缩短动作编排时间,拓宽教练员创作空间,提高动作编排质量.  相似文献   

16.
面对海量音乐数据,如何在基于内容检索时对其建立索引、提高检索速度是一个非常重要的研究内容。以句为单位对音乐内容提取特征并建立索引时,库中保存的是音乐旋律的相对特征,但用户在哼唱检索时,有时会哼唱一段包含多句的音乐来检索,这时就需要生成多句特征。针对此本文提出单句特征和多句转换及匹配问题的解决方案并应用于检索系统中,获得了较好的检索效果,相关成果也可应用于具有相似结构的时间序列数据的检索系统中。  相似文献   

17.
Motion estimation is one of the major problems in developing video coding applications. Among all motion estimation approaches, Block-matching (BM) algorithms are the most popular methods due to their effectiveness and simplicity for both software and hardware implementations. A BM approach assumes that the movement of pixels within a defined region of the current frame can be modeled as a translation of pixels contained in the previous frame. In this procedure, the motion vector is obtained by minimizing a certain matching metric that is produced for the current frame over a determined search window from the previous frame. Unfortunately, the evaluation of such matching measurement is computationally expensive and represents the most consuming operation in the BM process. Therefore, BM motion estimation can be viewed as an optimization problem whose goal is to find the best-matching block within a search space. The simplest available BM method is the Full Search Algorithm (FSA) which finds the most accurate motion vector through an exhaustive computation of all the elements of the search space. Recently, several fast BM algorithms have been proposed to reduce the search positions by calculating only a fixed subset of motion vectors despite lowering its accuracy. On the other hand, the Harmony Search (HS) algorithm is a population-based optimization method that is inspired by the music improvisation process in which a musician searches for harmony and continues to polish the pitches to obtain a better harmony. In this paper, a new BM algorithm that combines HS with a fitness approximation model is proposed. The approach uses motion vectors belonging to the search window as potential solutions. A fitness function evaluates the matching quality of each motion vector candidate. In order to save computational time, the approach incorporates a fitness calculation strategy to decide which motion vectors can be only estimated or actually evaluated. Guided by the values of such fitness calculation strategy, the set of motion vectors is evolved through HS operators until the best possible motion vector is identified. The proposed method has been compared to other BM algorithms in terms of velocity and coding quality. Experimental results demonstrate that the proposed algorithm exhibits the best balance between coding efficiency and computational complexity.  相似文献   

18.
视频序列的全景图拼接技术   总被引:10,自引:0,他引:10       下载免费PDF全文
提出了一种对视频序列进行全景图拼接的方法。主要讨论了有大面积的非刚性运动物体出现的序列,不过此方法也同样适用于无运动物体的纯背景序列。为计算各帧间的投影关系,用仿射模型来描述摄像机运动,并用特征点匹配的方法计算出模型中各参数的值。由于用相关法计算的匹配结果准确率比较低,所以用RANSAC(Random Sampling Consensus)对匹配结果进行了筛选,可以准确求出摄像机运动参数。利用运动参数进行投影,然后用多帧相减并求交集,估计出每帧图像中运动物体存在的区域,最后计算得到了全景图。该方法的结果与前人得到的结果进行了比较,证明用此方法能获得质量较高的全景图。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号