Speech emotion recognition (SER) systems identify emotions from the human voice in the areas of smart healthcare, driving a vehicle, call centers, automatic translation systems, and human-machine interaction. In the classical SER process, discriminative acoustic feature extraction is the most important and challenging step because discriminative features influence the classifier performance and decrease the computational time. Nonetheless, current handcrafted acoustic features suffer from limited capability and accuracy in constructing a SER system for real-time implementation. Therefore, to overcome the limitations of handcrafted features, in recent years, variety of deep learning techniques have been proposed and employed for automatic feature extraction in the field of emotion prediction from speech signals. However, to the best of our knowledge, there is no in-depth review study is available that critically appraises and summarizes the existing deep learning techniques with their strengths and weaknesses for SER. Hence, this study aims to present a comprehensive review of deep learning techniques, uniqueness, benefits and their limitations for SER. Moreover, this review study also presents speech processing techniques, performance measures and publicly available emotional speech databases. Furthermore, this review also discusses the significance of the findings of the primary studies. Finally, it also presents open research issues and challenges that need significant research efforts and enhancements in the field of SER systems.
电力大用户最大需量控制是降低电网峰值负荷、节约用户电费成本的重要技术手段.面向强波动性和冲击性工业电能需量控制,研究了超短期需量负荷的多步预测问题.基于集成经验模态分解(EE-MD)方法,通过二次分解有效分离时间序列中不同频率的信号,采用长短期记忆网络(LSTM)对各信号子序列进行独立预测,最后组合预测结果.实验结果表明,本方法能很好的预测工业需量负荷变化,M A PE/MAE/NRMSE精度指标基本控制在2% 以内,明显优于多种现行主流时序预测模型和最新文献方法,且消除了多步预测的传递误差,预测模型精度和稳定性满足需量控制要求. 相似文献