首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
沈凌洁  王蔚 《声学技术》2018,37(2):167-174
提出一种基于韵律特征(基频、时长)和梅尔倒谱系数(Mel-Frequency Cepstral Coefficient,MFCC)特征的融合特征进行短语音汉语声调识别的方法,旨在利用两种特征的优势提高短语音汉语声调识别率。该融合特征包括7个根据不同模型得到的韵律特征和统计参数以及4个从每个音段的梅尔倒谱系数计算得来的对数化后验概率,使用高斯混合模型表示4个声调的倒谱特征的分布。实验分两步:第一步,将基于韵律特征和倒谱特征的分类器在决策阶段混合起来进行声调分类,分别赋予两个分类器权重,计算倒谱特征和韵律特征在声调分类任务中的权重;第二步,将基于字的韵律特征和基于帧的倒谱特征结合起来生成融合特征的超向量,使用融合特征进行汉语声调识别,根据准确率、未加权平均召回率(Unweigted Average Recall,UAR)和科恩卡帕(Cohen’s Kappa)系数3个指标,比较并评估5种分类器(两种设置的高斯混合模型,后向传播神经网络,支持向量机和卷积神经网络(Convolutional Neural Network,CNN))在不平衡数据集上的分类效果。实验结果表明:(1)倒谱特征方法能够提高汉语声调的识别率,该特征在总体分类任务中的权重为0.11;(2)基于融合特征的深度学习(CNN)方法对声调的识别率最高,为87.6%,与高斯混合模型的基线系统相比,提高了5.87%。该研究证明了倒谱特征法能够提供与韵律特征法互补的信息,从而提高短语音汉语声调识别率;同时,该方法可以运用到韵律检测和副语言信息检测等相关研究中。  相似文献   

2.
Electroencephalogram (EEG) is a medical imaging technology that can measure the electrical activity of the scalp produced by the brain, measured and recorded chronologically the surface of the scalp from the brain. The recorded signals from the brain are rich with useful information. The inference of this useful information is a challenging task. This paper aims to process the EEG signals for the recognition of human emotions specifically happiness, anger, fear, sadness, and surprise in response to audiovisual stimuli. The EEG signals are recorded by placing neurosky mindwave headset on the subject’s scalp, in response to audiovisual stimuli for the mentioned emotions. Using a bandpass filter with a bandwidth of 1–100 Hz, recorded raw EEG signals are preprocessed. The preprocessed signals then further analyzed and twelve selected features in different domains are extracted. The Random forest (RF) and multilayer perceptron (MLP) algorithms are then used for the classification of the emotions through extracted features. The proposed audiovisual stimuli based EEG emotion classification system shows an average classification accuracy of 80% and 88% using MLP and RF classifiers respectively on hybrid features for experimental signals of different subjects. The proposed model outperforms in terms of cost and accuracy.  相似文献   

3.
Electroencephalography (EEG) eye state classification becomes an essential tool to identify the cognitive state of humans. It can be used in several fields such as motor imagery recognition, drug effect detection, emotion categorization, seizure detection, etc. With the latest advances in deep learning (DL) models, it is possible to design an accurate and prompt EEG EyeState classification problem. In this view, this study presents a novel compact bat algorithm with deep learning model for biomedical EEG EyeState classification (CBADL-BEESC) model. The major intention of the CBADL-BEESC technique aims to categorize the presence of EEG EyeState. The CBADL-BEESC model performs feature extraction using the ALexNet model which helps to produce useful feature vectors. In addition, extreme learning machine autoencoder (ELM-AE) model is applied to classify the EEG signals and the parameter tuning of the ELM-AE model is performed using CBA. The experimental result analysis of the CBADL-BEESC model is carried out on benchmark results and the comparative outcome reported the supremacy of the CBADL-BEESC model over the recent methods.  相似文献   

4.
The uncertainty in human brain leads to the formation of epilepsy disease in human. The automatic detection and severity analysis of epilepsy disease is proposed in this article using a hybrid classification algorithm. The proposed method consists of decomposition stage, feature extraction, and classification stages. The electroencephalogram (EEG) signals are decomposed using dual-tree complex wavelet transform and then features are extracted from these coefficients. These features are then classified using the neural network classification approach in order to classify the EEG signals into either focal or nonfocal EEG signals. Furthermore, severity of the focal EEG signal is analyzed using an adaptive neuro-fuzzy inference system classification approach. The proposed hybrid classification method for the classification of focal signals and nonfocal signals achieved 98.6% of sensitivity, 99.1% of specificity, and 99.4% of accuracy. The average detection rate for both focal and nonfocal dataset is about 98.5%.  相似文献   

5.
针对维度情感模型生理信号情绪识别准确率较低的问题,本文基于DEAP维度情绪生理数据集,利用AR模型功率谱估计方法,提取脑电θ,α,β,γ节律的功率谱密度;采用小波包分解提取脑电小波包系数和能量占比时频特征;通过非线性分析提取脑电样本熵和小波包熵特征.然后,设计栈式自编码神经网络算法对脑电组合特征在效价和唤醒度两个情感维...  相似文献   

6.
The Convolutional Neural Network (CNN) is a widely used deep neural network. Compared with the shallow neural network, the CNN network has better performance and faster computing in some image recognition tasks. It can effectively avoid the problem that network training falls into local extremes. At present, CNN has been applied in many different fields, including fault diagnosis, and it has improved the level and efficiency of fault diagnosis. In this paper, a two-streams convolutional neural network (TCNN) model is proposed. Based on the short-time Fourier transform (STFT) spectral and Mel Frequency Cepstrum Coefficient (MFCC) input characteristics of two-streams acoustic emission (AE) signals, an AE signal processing and classification system is constructed and compared with the traditional recognition methods of AE signals and traditional CNN networks. The experimental results illustrate the effectiveness of the proposed model. Compared with single-stream convolutional neural network and a simple Long Short-Term Memory (LSTM) network, the performance of TCNN which combines spatial and temporal features is greatly improved, and the accuracy rate can reach 100% on the current database, which is 12% higher than that of single-stream neural network.  相似文献   

7.
针对语音情感识别中无法对关键的时空依赖关系进行建模,导致识别率低的问题,提出一种基于自身注意力(self-attention)时空特征的语音情感识别算法,利用双线性卷积神经网络、长短期记忆网络和多组注意力(multi-head attention)机制去自动学习语音信号的最佳时空表征.首先提取语音信号的对数梅尔(log...  相似文献   

8.
付荣荣  隋佳新  刘冲  张扬 《计量学报》2022,43(8):1103-1108
运动想象脑电信号的识别与分类问题一直是脑机领域研究的热点问题。针对此问题,使用区别传统线性降维方法的流形学习方法,将共空间模式算法与均匀流形投影算法相结合,充分利用了脑电信号中的非线性特征,对运动想象脑电信号进行了特征提取和数据降维,并使用KNN分类器进行了分类,对分类效果做出了评价;将降维前后的数据分类结果进行对比,说明了数据降维的优点和必要性;进一步讨论了降维结果在数据可视化方面的表现。发现经过数据降维的特征数据的可视化效果明显优于未经过降维的数据,进一步提出了一种基于共空间模式和均匀流形投影的新型脑电信号识别方法,对进行脑电信号深度剖析。挖掘脑电信号非线性特征提供了参考价值,同时也在数据流形分布以及数据可视化的角度为运动想象脑电信号识别提供了新思路。  相似文献   

9.
付荣荣  杨阳  于宝  刘冲  张驰 《计量学报》2021,42(12):1679-1685
为了实现脑机接口系统需要有效的特征提取算法。针对二维主成分分析(2DPCA)的特征提取方法忽略脑电信号(EEG)频域特征的缺点和基于小波分解构建EEG高阶张量时小波参数难以确定的局限性,提出了基于集合经验模态分解(EEMD)构建高阶张量结合多线性主成分分析(MPCA)降维的特征提取方法。设计了3种不同特征提取方法的对照实验,并结合Fisher线性判别分析分类方法取得分类准确率。结果表明:新提出的方法相比基于小波分解构建高阶张量结合MPCA进行降维和2DPCA的特征提取方法,平均识别准确率分别提高4.75%和2.6%,且识别准确率的方差分别减小72.69%和23.86%。该方法在提高单次运动想象脑电信号识别准确率的同时还具有更好的适用性,为实现运动想象脑电信号解码奠定了基础。  相似文献   

10.
通过对脑电信号的熵分析,进行了情感识别研究,并根据脑电信号的非线性特性以及多重分形特性的特点,提出了一种排列熵与多重分形指数相结合的情感脑电特征提取算法。该算法采用排列熵、Hurst指数、质量指数和奇异谱宽度相结合,实现情感脑电的特征提取,采用支持向量机(SVM))实现情感识别。结果显示,该算法两两情感识别,测试集最高正确率达到92.8%,除去激动对可怕外,正确率均在80%以上;与单独使用排列熵和分形指数特征的方法相比,最高正确率分别提高41.9%和31.2%。进一步分析了对积极和消极两类情感状态的识别效果,测试集平均正确率为78.3%,比排列熵与多重分形特征,测试集正确率分别提高了26.7%和1.6%。结果表明,基于排列熵与多重分形指数相结合的特征提取算法,能够充分挖掘脑电信号的非线性特性与多重分形特征信息,是一种有效的情感脑电特征提取算法。  相似文献   

11.
Skin lesion recognition is an important challenge in the medical field. In this paper, we have implemented an intelligent classification system based on convolutional neural network. First of all, this system can classify whether the input image is a dermascopic image with an accuracy of 99%. And then diagnose the dermoscopic image and the non-skin mirror image separately. Due to the limitation of the data, we can only realize the recognition of vitiligo by non-skin mirror. We propose a vitiligo recognition based on the probability average of three structurally identical CNN models. The method is more efficient and robust than the traditional RGB color space-based image recognition method. For the dermoscopic classification model, we were able to classify 7 skin lesions, use weighted optimization to overcome the unbalanced data set, and greatly improve the sensitivity of the model by means of model fusion. The optimization and expansion of the system depend on the increase of database.  相似文献   

12.
吴俊  管鲁阳  鲍明  许耀华  叶炜 《光电工程》2019,46(5):180493-1-180493-8
针对相位敏感光时域反射(Φ-OTDR)分布式光纤振动传感系统如何对振动事件进行高效准确识别的问题,本文提出了一种基于多尺度一维卷积神经网络(MS 1-D CNN)的振动事件识别方法。该方法将原始振动信号经过预加重、归一化和谱减降噪的预处理操作后得到的一维信号,直接通过MS 1-D CNN实现端到端的振动信号特征的提取和识别。MS 1-D CNN在提取入侵振动信号特征时可兼顾信号时间和频率尺度,利用全连接层(FC layer)和Softmax层完成最终的识别过程,与二维卷积神经网络(2-D CNN)和一维卷积神经网络(1-D CNN)相比减少了待定参数数量。对破坏、敲击和干扰三类目标振动事件的光纤振动传感信号识别结果表明,MS 1-D CNN的识别正确率与2-D CNN相近,达到了96%以上,而处理速度提升一倍,在保持识别性能的前提下,有利于提高振动事件识别的实时性。  相似文献   

13.
The electroencephalogram (EEG) is the frequently used signal to detect epileptic seizures in the brain. For a successful epilepsy surgery, it is very essential to localize epileptogenic area in the brain. The signals from the epileptogenic area are focal signals and signals from other area of the brain region nonfocal signals. Hence, the classification of focal and nonfocal signals is important for locating the epileptogenic area for epilepsy surgery. In this article, we present a computer aided automatic detection and classification method for focal and nonfocal EEG signal. The EEG signal is decomposed by Dual Tree Complex Wavelet Transform (DT‐CWT) and the features are computed from the decomposed coefficients. These features are trained and classified using Adaptive Neuro Fuzzy Inference System (ANFIS) classifier. The proposed system achieves 98% sensitivity, 100% specificity, and 99% accuracy for EEG signal classification. The experimental results are presented to show the effectiveness of the proposed classification method to classify the focal and nonfocal EEG signals. © 2016 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 26, 277–283, 2016  相似文献   

14.
生物式水质监测通常是先通过提取水生物在不同环境下的应激反应特征,再进行特征分类,从而识别水质。针对水质监测问题,提出一种使用卷积神经网络(CNN)的方法。鱼类运动轨迹是当前所有文献使用的多种水质分类特征的综合性表现,是生物式水质分类的重要依据。使用Mask-RCNN的图像分割方法,求取鱼体的质心坐标,并绘制出一定时间段内鱼体的运动轨迹图像,制作正常与异常水质下两种轨迹图像数据集。融合Inception-v3网络作为数据集的特征预处理部分,重新建立卷积神经网络对Inception-v3网络提取的特征进行分类。通过设置多组平行实验,在不同的水质环境中对正常水质与异常水质进行分类。结果表明,卷积神经网络模型的水质识别率为99.38%,完全达到水质识别的要求。  相似文献   

15.
The detection of alcoholism is of great importance due to its effects on individuals and society. Automatic alcoholism detection system (AADS) based on electroencephalogram (EEG) signals is effective, but the design of a robust AADS is a challenging problem. AADS’ current designs are based on conventional, hand-engineered methods and restricted performance. Driven by the excellent deep learning (DL) success in many recognition tasks, we implement an AAD system based on EEG signals using DL. A DL model requires huge number of learnable parameters and also needs a large dataset of EEG signals for training which is not easy to obtain for the AAD problem. In order to solve this problem, we propose a multi-channel Pyramidal neural convolutional (MP-CNN) network that requires a less number of learnable parameters. Using the deep CNN model, we build an AAD system to detect from EEG signal segments whether the subject is alcoholic or normal. We validate the robustness and effectiveness of proposed AADS using KDD, a benchmark dataset for alcoholism detection problem. In order to find the brain region that contributes significant role in AAD, we investigated the effects of selected 19 EEG channels (SC-19), those from the whole brain (ALL-61), and 05 brain regions, i.e., TEMP, OCCIP, CENT, FRONT, and PERI. The results show that SC-19 contributes significant role in AAD with the accuracy of 100%. The comparison reveals that the state-of-the-art systems are outperformed by the AADS. The proposed AADS will be useful in medical diagnosis research and health care systems.  相似文献   

16.
付荣荣  李朋  刘冲  张扬 《计量学报》2022,43(5):688-695
脑电信号的识别与分类是脑机接口技术的热点研究问题,单一分类器不能很好利用特征以及分类器的适应性,导致识别的准确率很难进一步提高,基于线性判别分析的分类决策级融合策略,可用于提高脑-机接口系统的分类准确率。首先,通过分离出两种分类器的假性试验特征,从这两种方法中选择更有可能正确决策提高分类准确性;其次为了测量每个决策的不确定性,使用与所对应分类器的最大和第二大相关系数提取特征向量。基于这一思想,提出了一种新的决策选择器,该方法通过整合两种基于线性判别分析的算法选择更有可能是准确的决策,从而达到提高脑电信号分类准确度。实验结果表明,该方法通过与精度相近的算法相结合在运动想象数据分类上获得了较好的分类准确率。  相似文献   

17.
Radial basis function (RBF) neural networks are used to classify real-life audio radar signals that are collected by a ground surveillance radar mounted on a tank. Currently, a human operator is required to operate the radar system to discern among signals bouncing off tanks, vehicles, planes, and so on. The objective of this project is to investigate the possibility of using a neural network to perform this target recognition task, with the aim of reducing the number of personnel required in a tank. Different signal classification methods in the neural net literature are considered. The first method employs a linear autoregressive (AR) model to extract linear features of the audio data, and then perform classification on these features, i.e, the AR coefficients. AR coefficient estimations based on least squares and higher order statistics are considered in this study. The second approach uses nonlinear predictors to model the audio data and then classifies the signals according to the prediction errors. The real-life audio radar data set used here was collected by an AN/PPS-15 ground surveillance radar and consists of 13 different target classes, which include men marching, a man walking, airplanes, a man crawling, and boats, etc. It is found that each classification method has some classes which are difficult to classify. Overall, the AR feature extraction approach is most effective and has a correct classification rate of 88% for the training data and 67% for data not used for training.  相似文献   

18.
水下声信号分类是水声学研究的一个重要方向.一个有效的特征提取和分类决策方法对水声信号分类技术至关重要.文章将鱼声、商船辐射噪声和风关噪声三类实测的水声信号在小波包分解的基础上提取时频图特征,并搭建了一个七层结构的卷积神经网络作为分类器.研究结果表明:三种水声信号的小波包时频图特征结合卷积神经网络在不同测试集可达到(98...  相似文献   

19.
The redundant data in multichannel electroencephalogram (EEG) signals significantly reduces the performance of brain–computer interface (BCI) systems. By removing redundant channels, a channel selection strategy increases the classification accuracy of BCI systems. In this work, a novel channel selection method (stdWC) based on the standard deviation of wavelet coefficients across channels is proposed to identify Motor Imagery (MI) based EEG signals. The wavelet coefficients are calculated by employing a Continuous Wavelet Transform (CWT) filter bank to decompose each trial from the EEG channel. The wavelet coefficient's standard deviation values are obtained across the channels, and these values are then sorted to determine the EEG channels with the highest standard deviation values. The channels with the largest wavelet coefficient divergence are chosen. MI trials are then spatially filtered with the Common Spatial Pattern (CSP), and CWT filter bank-based 2D images are generated from the spatially filtered trials. These images are then classified using a unique nine-layered convolutional neural network (CNN) model that combines two feature maps acquired with differing filter sizes. The proposed framework (stdWC-CSP-CNN) is evaluated using kappa score and classification accuracy on two publically accessible datasets (BCI Competition III dataset IVa and BCI Competition IV dataset 2a). The suggested framework achieved a mean test classification accuracy of 88.8% for dataset IVa from BCI Competition III and 75.03% for dataset 2a from BCI Competition IV, according to the results. The proposed channel selection method outperforms the other channel selection methods examined, according to the results. By rejecting redundant channels, the whole framework can improve the performance of MI-based BCIs.  相似文献   

20.
张立峰  王智  吴思橙 《计量学报》2022,43(10):1306-1312
提出了一种基于卷积神经网络(CNN)与门控循环单元(GRU)的垂直管道气液两相流流型识别方法。该方法基于电阻层析成像(ERT)系统的重建图像,对其填充处理后进行离散余弦变换(DCT),求取最大、最小 DCT 系数的差值,选取一定帧数长度数据作为网络输入,对流型进行识别。分析了输入序列长度对CNN-GRU、CNN 及 GRU 网络分类准确的影响,确定了最佳输入向量维度分别为 60、65 及 50,使用实验数据对3种网络进行训练、测试,结果表明,CNN-GRU网络分类准确率最高,平均流型识别准确率可达 99.40%。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号