新的全参考音视频同步感知质量评价模型 Novel full reference perceptual quality metric for audio-visual asynchrony期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

新的全参考音视频同步感知质量评价模型

引用本文：	魏耀都,谢湘,匡镜明,韩辛璐. 新的全参考音视频同步感知质量评价模型[J]. 通信学报, 2012, 0(2): 182-190

作者姓名：	魏耀都谢湘匡镜明韩辛璐

作者单位：	北京理工大学信息与电子学院

基金项目：	国家科技重大专项基金项目资助(2010ZX03004-003)~~

摘要：	提出一种利用协惯量分析构建的全参考音视频同步感知质量模型。通过对齐得到待测音频与视频的同步误差。将音视频内容分为纯净语音、无语音和有背景语音3类。将纯净语音类分为视频中有说话人和无说话人2个子类。分别对各类选取多维特征,利用协惯量分析从特征中获得音视频最相关的特征映射和相关程度。通过参考音视频得到相关程度曲线并得到同步误差到感知质量的映射关系。结果表明该模型评测结果与主观实验结果有较好相关性。
关键词：	信息处理技术音视频质量评价协惯量分析同步
Novel full reference perceptual quality metric for audio-visual asynchrony

WEI Yao-du,XIE Xiang,KUANG Jing-ming,HAN Xin-lu. Novel full reference perceptual quality metric for audio-visual asynchrony[J]. Journal on Communications, 2012, 0(2): 182-190

Authors:	WEI Yao-du XIE Xiang KUANG Jing-ming HAN Xin-lu

Affiliation:	(School of Information and Electronics,Beijing Institute of Technology,Beijing 100081,China)

Abstract:	A full reference model was proposed to evaluate the perceptual quality of audiovisual asynchrony.A standard synchronization process was used to determine the time difference between audio and video.The mapping between the time difference and the perceptual quality was derived by co-inertia analysis.The co-inertia analysis extracted the most related component from audio and video features,and then formed a mapping for each audiovisual sequence.Audiovisual contents were divided into three categories: clean speech,non speech and mixed speech.The clean speech category was further split into two subcategories.Audio and video features were chosen separately for each category.Subjective test results showed that the proposed model conforms well with subjective results.

Keywords:	signal processing technique audiovisual quality assessment co-inertia analysis synchrony
本文献已被 CNKI 等数据库收录！
	点击此处可从《通信学报》浏览原始摘要信息
	点击此处可从《通信学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏