共查询到20条相似文献,搜索用时 15 毫秒
1.
现有X线气胸检测存在两个主要问题:一是由于气胸通常与肋骨、锁骨等组织重叠,在临床上存在较大的漏诊,而现有算法的检测性能仍有待提高;二是现有基于卷积神经网络的算法无法给出可疑的气胸区域,缺乏可解释性。针对上述问题,提出了一种结合密集卷积网络(DenseNet)与梯度加权类激活映射的方法用于X线气胸的检测与定位。首先,构建了一个较大规模的胸部X线数据集PX-ray用于模型的训练和测试。其次,修改DenseNet的输出节点并在全连接层后添加一个sigmoid函数对胸片进行二分类(气胸/非气胸)。在训练过程中通过设置交叉熵损失函数的权重来缓解数据不平衡问题,提高模型准确率。最后,提取网络最后一个卷积层的参数以及对应的梯度,通过梯度加权类激活映射算法获得气胸类别的粗略定位图。在PX-ray测试集上的实验结果表明,所提方法的检测准确率为95.45%,并且在曲线下面积(AUC)、敏感度、特异性等指标上均高于0.9,优于VGG19、GoogLeNet以及ResNet算法,同时实现了对气胸区域的可视化。 相似文献
2.
Multimedia Tools and Applications - Data-driven deep learning techniques set the current state of the art in image classification for hyperspectral remote sensing images. The lack of labeled... 相似文献
3.
Teng Jianing Zhang Dong Lee Dah-Jye Chou Yao 《Multimedia Tools and Applications》2019,78(9):11155-11172
Multimedia Tools and Applications - Food recognition is the first step for dietary assessment. Computer vision technology is being viewed as an effective tool for automatic food recognition for... 相似文献
4.
Jain Anamika Singh Satish Kumar Singh Krishna Pratap 《Multimedia Tools and Applications》2020,79(27-28):19993-20018
Multimedia Tools and Applications - Handwritten signatures are an undeniable and unique way to prove the identity of persons. Owing to the simplicity and uniqueness, it finds an essential place in... 相似文献
5.
Multimedia Tools and Applications - Development of expertise in Face Recognition has led researchers to apply its various techniques for newborn recognition as some of the problems such as... 相似文献
6.
提出了一种基于深度卷积神经网络自动识别超声心动图标准切面的方法,并可视化分析了深度模型的有效性。针对网络全连接层占有模型大部分参数的缺点,引入空间金字塔均值池层化替代全连接层,获得更多空间结构信息,并大大减少模型参数、降低过拟合风险,通过类别显著性区域将类似注意力机制引入模型可视化过程。通过超声心动图标准切面的识别问题案例,对深度卷积神经网络模型的鲁棒性和有效性进行解释。在超声心动图上的可视化分析实验表明,改进深度模型作出的识别决策依据,同医师辨别分类超声心动图标准切面的依据一致,表明所提方法的有效性和实用性。 相似文献
7.
针对传统肺结节检测中存在灵敏度低、假阳性高、小结节难检测的问题,提出一种基于深度卷积神经网络的两阶段肺结节检测框架.第一阶段使用特征金字塔子网提取肺部影像的多层次特征,引入多尺度区域建议子网用于在高灵敏度下检测出所有的候选结节;第二阶段设计级联卷积神经网络模型减少假阳性,通过保留分类错误样本用于重新训练模型,将多个模型... 相似文献
8.
在智能监控领域,实现人群计数具有重要价值,针对人群尺度不一、人群密度分布不均及遮挡等问题,提出一种多尺度多任务卷积神经网络(MMCNN)进行人群计数的方法。首先提出一种新颖的自适应人形核生成密度图描述人群信息,消除人群遮挡影响;其次通过构建多尺度卷积神经网络解决人群尺度不一问题,以多任务学习机制同时估计密度图及人群密度等级,解决人群分布不均问题;最后设计一种加权损失函数,提高人群计数准确率。在UCF_CC_50和World Expo'10数据库上进行了评估,验证了自适应人形核的有效性。实验结果表明:所提算法比Sindagi等的方法(SINDAGI V A,PATEL V M.CNN-based cascaded multi-task learning of high-level prior and density estimation for crowd counting.Proceedings of the 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance.Piscataway,NJ:IEEE,2017:1-6)在UCF_CC_50数据库上平均绝对误差(MAE)数值和均方误差(MSE)数值分别降低约1.7和45;与Zhang等的方法(ZHANG Y,ZHOU D,CHEN S,et al.Single-image crowd counting via multi-column convolutional neural network.Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition.Washington,DC:IEEE Computer Society,2016:589-597)相比,在World Expo'10数据库上所提算法的MAE值降低约1.5,且在真实公共汽车数据库上仅0~3人的计数误差,表明其实用性较强。 相似文献
9.
邻域的组成对于基于空间域的图卷积网络(GCN)模型有至关重要的作用。针对模型中节点邻域排序未考虑结构影响力的问题,提出了一种新的邻域选择策略,从而得到改进的GCN模型。首先,为每个节点收集结构重要的邻域并进行层级选择得到核心邻域;然后,将节点及其核心邻域的特征组成有序的矩阵形式;最后,送入深度卷积神经网络(CNN)进行半监督学习。节点分类任务的实验结果表明,该模型在Cora、Citeseer和Pubmed引文网络数据集中的节点分类准确性均优于基于经典图嵌入的节点分类模型以及四种先进的GCN模型。作为一种基于空间域的GCN,该模型能有效运用于大规模网络的学习任务。 相似文献
10.
Sanket Shashwat Vergin Raja Sarobin M. Jani Anbarasi L. Thakor Jayraj Singh Urmila Narayanan Sathiya 《Multimedia Tools and Applications》2022,81(16):22263-22288
Multimedia Tools and Applications - With over 172 Million people infected with the novel coronavirus (COVID-19) globally and with the numbers increasing exponentially, the dire need of a fast... 相似文献
11.
Abeje Bekalu Tadele Salau Ayodeji Olalekan Mengistu Abreham Debasu Tamiru Nigus Kefyalew 《Multimedia Tools and Applications》2022,81(20):29027-29043
Multimedia Tools and Applications - In recent years, several technologies have been utilized to bridge the communication gap between persons who have hearing or speaking impairments and those who... 相似文献
12.
The genre is an abstract feature, but still, it is considered to be one of the important characteristics of music. Genre recognition forms an essential component for a large number of commercial music applications. Most of the existing music genre recognition algorithms are based on manual feature extraction techniques. These extracted features are used to develop a classifier model to identify the genre. However, in many cases, it has been observed that a set of features giving excellent accuracy fails to explain the underlying typical characteristics of music genres. It has also been observed that some of the features provide a satisfactory level of performance on a particular dataset but fail to provide similar performance on other datasets. Hence, each dataset mostly requires manual selection of appropriate acoustic features to achieve an adequate level of performance on it. In this paper, we propose a genre recognition algorithm that uses almost no handcrafted features. The convolutional recurrent neural network‐based model proposed in this study is trained on melspectrogram extracted from 3‐s duration audio clips taken from GTZAN dataset. The proposed model provides an accuracy of 85.36% on 10‐class genre classification. The same model has been trained and tested on 10 genres of MagnaTagATune dataset having 18,476 clips of 29‐s duration. The model has yielded an accuracy of 86.06%. The experimental results suggest that the proposed architecture with melspectrogram as input feature is capable of providing consistent performances across the different datasets 相似文献
13.
This paper presents a method to capture human pose from individual real-world RGB images using a deep learning technique. The current works on estimating human pose by deep learning are designed in a detection or a regression framework, and in a part-based manner. As a new perspective, we introduce a classification scheme for this problem, which reasons the pose holistically. To the best of our knowledge, this is the first work for holistic human pose classification task that owes its feasibility to the great power of convolutional neural networks in feature learning. After training a convolutional neural network to classify the input image to one of the KeyPoses, the final pose is computed as a linear combination of several KeyPoses. In this new holistic classification attitude, the vast and high degree of freedom human pose space is divided into a finite number of subspaces and the convolutional neural network shows promising results in learning the features of each subspace. Empirical results (PCP and PCK rates) demonstrate that the proposed scheme is successfully able to understand human pose (i.e., predict a valid, true and coarse pose) in real-world unconstrained images with challenges like severe occlusion, high articulation, low quality and cluttered background. Furthermore, using the proposed method, the need for defining a complex model (such as appearance model or joints pairwise relations) is relieved. We have also verified a potential application of our proposed method in semantic image retrieval based on human pose. 相似文献
14.
Dorj Ulzii-Orshikh Lee Keun-Kwang Choi Jae-Young Lee Malrey 《Multimedia Tools and Applications》2018,77(8):9909-9924
Multimedia Tools and Applications - This paper addresses the demand for an intelligent and rapid classification system of skin cancer using contemporary highly-efficient deep convolutional neural... 相似文献
15.
Dhiman Ritika Kang Gurkanwal Singh Gupta Varun 《Multimedia Tools and Applications》2021,80(21-23):32041-32069
Multimedia Tools and Applications - Emotion recognition through speech is one of the fundamental approaches for human interaction. Speech modulations stipulate different emotions and context. In... 相似文献
16.
Yolcu Gozde Oztel Ismail Kazan Serap Oz Cemil Palaniappan Kannappan Lever Teresa E. Bunyak Filiz 《Multimedia Tools and Applications》2019,78(22):31581-31603
Multimedia Tools and Applications - Facial expressions are a significant part of non-verbal communication. Recognizing facial expressions of people with neurological disorders is essential because... 相似文献
17.
Chest diseases are one of the greatest health problems for people living in the developing world. Millions of people are diagnosed every year with a chest disease in the world. Chronic obstructive pulmonary, pneumonia, asthma, tuberculosis, lung cancer diseases are most important chest diseases and these are very common illnesses in Turkey. In this paper, a study on chest diseases diagnosis was realized by using artificial immune system. We obtained the classification accuracy with artificial immune system 93.84%. The result of the study was compared with the results of the previous similar studies reported focusing on chest diseases diagnosis. The chest diseases dataset were prepared from a chest diseases hospital’s database using patient’s epicrisis reports. 相似文献
18.
《Expert systems with applications》2007,32(2):329-335
A new neural network model called DIAGNET is proposed in this paper for diagnosing gastrointestinal disorders. DIAGNET is a combination of Backpropagation neural network (BPNN) and radial basis functions neural network (RBFNN). The symptoms and signs are collected from the patients through oral interview. For the linguistic nature of patient’s inputs, an artificial domain is created and fuzzy membership values are defined. The fuzzy values are fed as inputs to the DIAGNET and trained for diagnosing the diseases related to gastrointestinal disorders. The trained model is tested with new patient’s symptoms and signs. The performance of the DIAGNET is compared with the existing Backpropagation neural network and Radial basis functions neural network models. Sensitivity, Specificity and Receiver-Operating Characteristics (ROC) are used as the indicators for testing the accuracy of the models which predict the gastrointestinal disorder diseases. The results suggest that the DIAGNET can be better solution for complex, nonlinear medical decision support systems. 相似文献
19.
基于多尺度密集网络的肺结节图像检索算法 总被引:1,自引:0,他引:1
现有基于内容的医学图像检索(CBMIR)算法存在特征提取的不足,导致图像的语义信息表达不完善、图像检索性能较差,为此提出一种多尺度密集网络算法以提高检索精度。首先,将512×512的肺结节图像降维到64×64,同时加入密集模块以解决提取的低层特征和高层语义特征之间的差距;其次,由于网络的不同层提取的肺结节图像信息不同,为了提高检索精度和效率,采用多尺度方法结合图像的全局特征和结节局部特征生成检索哈希码。实验结果分析表明,与自适应比特位的检索(ABR)算法相比,提出的算法在64位哈希码编码长度下的肺结节图像检索查准率可以达到91.17%,提高了3.5个百分点;检索一张肺切片需要平均时间为48 μs。所提算法的检索结果在表达图像丰富的语义特征和检索效率方面,优于其他对比的网络结构,适用于为医生临床辅助诊断提供依据、帮助患者有效治疗。 相似文献
20.
Multimedia Tools and Applications - Object detection in computer vision has been a significant research area for the past decade. Identifying objects with multiple classes from an image has... 相似文献