首页 | 本学科首页   官方微博 | 高级检索  
     

基于级联网络和残差特征的人脸特征点定位
引用本文:许爱东,黄文琦,明哲,陈伟亮,胡浩基,杨航.基于级联网络和残差特征的人脸特征点定位[J].浙江大学学报(自然科学版 ),2019,53(12):2365-2371.
作者姓名:许爱东  黄文琦  明哲  陈伟亮  胡浩基  杨航
作者单位:1. 南方电网科学研究院,广东 广州,5100802. 南方电网数字电网研究院,广东 广州,5100803. 浙江大学 信息与电子工程学院,浙江 杭州,310027
基金项目:中国南方电网有限责任公司科技资助项目(ZBKJXM20170086)
摘    要:为进一步提高人脸特征点定位精度,探究当前广泛用于人脸关键点定位的全卷积神经网络(FCN)架构的原理和缺陷,讨论FCN核函数在特征点定位中引入的副作用,即训练和测试时评判准则不一致的问题. 理论分析该问题存在的可能性和普遍性,设计实验验证在实际场景下此问题存在的广泛性. 提出结合残差特征的沙漏网络结构并将其应用于人脸特征点检测;提出多级沙漏网络的级联结构,并将其与经典的栈式沙漏网络进行对比分析. 实验结果表明:二级级联结构获得了与四级栈式结构相当的特征点定位精度,大幅降低了模型参数量和时间复杂度. 所提方法在300-W数据库的困难子集上的平均归一化误差为6.84%,优于已有最好方法.

关 键 词:人脸特征点检测  全卷积神经网络(FCN)  残差特征  级联结构  

Facial landmark localization based on cascaded hourglass network with residual features
Ai-dong XU,Wen-qi HUANG,Zhe MING,Wei-liang CHEN,Roland HU,Hang YANG.Facial landmark localization based on cascaded hourglass network with residual features[J].Journal of Zhejiang University(Engineering Science),2019,53(12):2365-2371.
Authors:Ai-dong XU  Wen-qi HUANG  Zhe MING  Wei-liang CHEN  Roland HU  Hang YANG
Abstract:The principles and defects of full convolutional network (FCN), which was widely utilized in facial landmark localization, were studied to improve the facial landmark localization accuracy. Discuss the side effects introduced by the kernel function in the feature of FCN, that the evaluation criteria were inconsistent during training and testing. Firstly, theoretically analyze the possibility and the universality of this problem, and then design experiments to verify the existence of this problem in actual situation. To solve this problem, a hourglass network structure was proposed for facial landmark localization combining residual features; the cascaded hourglass network structure was given. The experimental results show that the two-stage cascade structure can obtain comparable accuracy compared with the four-stage stack structure, which means that the model parameter quantity and time complexity will be reduced greatly. The average normalization error of the proposed method on the difficult subset of the 300-W database was 6.84%, which is better than the previous best result.
Keywords:facial landmark localization  fully convolutional network (FCN)  residual feature  cascaded structure  
本文献已被 CNKI 等数据库收录!
点击此处可从《浙江大学学报(自然科学版 )》浏览原始摘要信息
点击此处可从《浙江大学学报(自然科学版 )》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号