基于强化学习的立体全景视频自适应流 Adaptive Streaming of Stereoscopic Panoramic Video Based on Reinforcement Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于强化学习的立体全景视频自适应流

引用本文：	兰诚栋,饶迎节,宋彩霞,陈建.基于强化学习的立体全景视频自适应流[J].电子与信息学报,2022,44(4):1461-1468.

作者姓名：	兰诚栋饶迎节宋彩霞陈建

作者单位：	1.福州大学物理与信息工程学院福州 3501082.福建省媒体信息智能处理与无线传输重点实验室福州 350108

基金项目：	国家自然科学基金;福建省自然科学基金项目

摘要：	针对当前立体全景视频传输缺少有效的流自适应方法，且传统全景视频流自适应策略传输双目立体全景视频使得传输数据加倍，所需带宽巨大的问题，该文提出一种基于多智能体强化学习的立体全景视频非对称传输自适应流方法，以实时应对网络带宽波动。首先，根据人眼对视频显著性区域的偏爱，左右视点中每个瓦片(tile)对立体视频的感知质量的贡献度不同，提出一个基于tiles的左右视点观看概率预测方法。其次，设计了一种基于策略-评价(Actor-Critic)的多智能体强化学习框架，对左右视点进行联合码率控制。最后，根据模型结构和双目抑制原理，设计合理的奖励函数。实验结果表明，与传统流自适应传输策略相比，该文所提方法更加适用于基于tiles的立体全景视频传输，实现在有限带宽下提高用户的体验质量(QoE)，为立体全景视频联合码率控制提供了一种全新的方法和思路。
关键词：	立体全景视频传输多智能体强化学习视点预测联合码率控制
收稿时间：	2020-10-23
Adaptive Streaming of Stereoscopic Panoramic Video Based on Reinforcement Learning

LAN Chengdong,RAO Yingjie,SONG Caixia,CHEN Jian.Adaptive Streaming of Stereoscopic Panoramic Video Based on Reinforcement Learning[J].Journal of Electronics & Information Technology,2022,44(4):1461-1468.

Authors:	LAN Chengdong RAO Yingjie SONG Caixia CHEN Jian

Affiliation:	1.College of Physics and Information Engineering, Fuzhou University, Fuzhou 350108, China2.Fujian Provincial Key Laboratory of Media Information Intelligent Processing and Wireless Transmission, Fuzhou 350108, China

Abstract:	Currently, an effective stream adaptation method for stereo panoramic video transmission is missing. However, the traditional panoramic video adaptive streaming strategy for transmitting binocular stereo panoramic video suffers from the problem of doubling the transmission data and requiring huge bandwidth. A multi-agent reinforcement learning based stereo panoramic video asymmetric transmission adaptive streaming method is proposed in this paper to cope with the limited bandwidth and fluctuation of network bandwidth in real time. First, due to the human eye's preference for the saliency regions of video, each tile in the left and right viewpoints of stereoscopic video contributes differently to the perceptual quality, and a tiles-based method for predicting the watching probability of left and right viewpoint is proposed. Second, a multi-agent reinforcement learning framework based on policy-value (Actor-Critic) is designed for joint rate control of left and right viewpoints. Finally, a reasonable reward function is designed based on the model structure and the principle of binocular suppression. The experimental results show that the proposed method is more suitable for tiles-based stereo panoramic video transmission than the traditional self-adaptive stream transmission strategy. A novel approach is proposed for stereo panoramic video joint rate control and user Quality of Experience (QoE) improvement under limited bandwidth.

Keywords:
本文献已被万方数据等数据库收录！
	点击此处可从《电子与信息学报》浏览原始摘要信息
	点击此处可从《电子与信息学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏