首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 16 毫秒
1.
目的研究无需进行复杂的图像预处理和人工特征提取,就能提高光学遥感图像的船只检测准确率和实现船只类型精细分类。方法对输入的检测图像,采用选择性搜索的方法产生船只候选区域,用已经标记好的训练样本对卷积神经网络进行监督训练,得到网络参数,然后使用经过监督训练的卷积神经网络提取抽象特征,并对候选区域进行分类,根据船只候选区域的分类概率同时确定船只的位置以及类型。结果与现有的2种检测方法进行对比,实验结果表明卷积神经网络能有效提高船只检测准确率,平均检测准确率达到了93.3%。结论该检测方法无需进行复杂的预处理,能同时对船只进行检测和分类,并能有效提高船只检测准确率。  相似文献   

2.
Currently, some photorealistic computer graphics are very similar to photographic images. Photorealistic computer generated graphics can be forged as photographic images, causing serious security problems. The aim of this work is to use a deep neural network to detect photographic images (PI) versus computer generated graphics (CG). In existing approaches, image feature classification is computationally intensive and fails to achieve real-time analysis. This paper presents an effective approach to automatically identify PI and CG based on deep convolutional neural networks (DCNNs). Compared with some existing methods, the proposed method achieves real-time forensic tasks by deepening the network structure. Experimental results show that this approach can effectively identify PI and CG with average detection accuracy of 98%.  相似文献   

3.
张莹  黄影平  郭志阳  张冲 《光电工程》2021,48(12):210340-1-210340-12
道路检测是车辆实现自动驾驶的前提。近年来,基于深度学习的多源数据融合成为当前自动驾驶研究的一个热点。本文采用卷积神经网络对激光雷达点云和图像数据加以融合,实现对交通场景中道路的分割。本文提出了像素级、特征级和决策级多种融合方案,尤其是在特征级融合中设计了四种交叉融合方案,对各种方案进行对比研究,给出最佳融合方案。在网络构架上,采用编码解码结构的语义分割卷积神经网络作为基础网络,将点云法线特征与RGB图像特征在不同的层级进行交叉融合。融合后的数据进入解码器还原,最后使用激活函数得到检测结果。实验使用KITTI数据集进行评估,验证了各种融合方案的性能,实验结果表明,本文提出的融合方案E具有最好的分割性能。与其他道路检测方法的比较实验表明,本文方法可以获得较好的整体性能。  相似文献   

4.
As the use of facial attributes continues to expand, research into facial age estimation is also developing. Because face images are easily affected by factors including illumination and occlusion, the age estimation of faces is a challenging process. This paper proposes a face age estimation algorithm based on lightweight convolutional neural network in view of the complexity of the environment and the limitations of device computing ability. Improving face age estimation based on Soft Stagewise Regression Network (SSR-Net) and facial images, this paper employs the Center Symmetric Local Binary Pattern (CSLBP) method to obtain the feature image and then combines the face image and the feature image as network input data. Adding feature images to the convolutional neural network can improve the accuracy as well as increase the network model robustness. The experimental results on IMDB-WIKI and MORPH 2 datasets show that the lightweight convolutional neural network method proposed in this paper reduces model complexity and increases the accuracy of face age estimations.  相似文献   

5.
为了提高目标检测的准确性,提出了一种基于深度学习利用特征图加权融合实现目标检测的方法。首先,提出将卷积神经网络中的浅层特征图采样后与最深层特征图进行加权融合的思想;其次,根据所提的特征图加权融合思想以及卷积神经网络的具体结构,制定相应的特征图加权融合方案,并由该方案得到新特征图;然后,提出改进的RPN网络,并将新特征图输入到改进的RPN网络得到区域建议;最后,将新特征图和区域建议输入到后续网络层完成目标检测。实验结果表明所提方法取得了更高的目标检测精度以及更好的目标检测效果。  相似文献   

6.
金瑶  张锐  尹东 《光电工程》2019,46(9):190053-1-190053-8
视频图像中的小像素目标难以检测。针对城市道路视频中的小像素目标,本文提出了一种改进YOLOv3的卷积神经网络Road_Net检测方法。首先,基于改进的YOLOv3,设计了一种新的卷积神经网络Road_Net;其次,针对小像素目标检测更依赖于浅层特征,采用了4个尺度检测方法。最后,结合改进的M-Softer-NMS算法来进一步提高图像中目标的检测精度。为了验证所提出算法的有效性,本文收集并标注了用于城市道路小像素目标物体检测的数据集Road-garbage Dataset,实验结果表明,本文算法能有效地检测出诸如纸屑、石块等在视频中相对于路面的较小像素目标。  相似文献   

7.
Vehicle type classification is considered a central part of an intelligent traffic system. In recent years, deep learning had a vital role in object detection in many computer vision tasks. To learn high-level deep features and semantics, deep learning offers powerful tools to address problems in traditional architectures of handcrafted feature-extraction techniques. Unlike other algorithms using handcrated visual features, convolutional neural network is able to automatically learn good features of vehicle type classification. This study develops an optimized automatic surveillance and auditing system to detect and classify vehicles of different categories. Transfer learning is used to quickly learn the features by recording a small number of training images from vehicle frontal view images. The proposed system employs extensive data-augmentation techniques for effective training while avoiding the problem of data shortage. In order to capture rich and discriminative information of vehicles, the convolutional neural network is fine-tuned for the classification of vehicle types using the augmented data. The network extracts the feature maps from the entire dataset and generates a label for each object (vehicle) in an image, which can help in vehicle-type detection and classification. Experimental results on a public dataset and our own dataset demonstrated that the proposed method is quite effective in detection and classification of different types of vehicles. The experimental results show that the proposed model achieves 96.04% accuracy on vehicle type classification.  相似文献   

8.
Compressive strength of concrete is a significant factor to assess building structure health and safety. Therefore, various methods have been developed to evaluate the compressive strength of concrete structures. However, previous methods have several challenges in costly, time-consuming, and unsafety. To address these drawbacks, this paper proposed a digital vision based concrete compressive strength evaluating model using deep convolutional neural network (DCNN). The proposed model presented an alternative approach to evaluating the concrete strength and contributed to improving efficiency and accuracy. The model was developed with 4,000 digital images and 61,996 images extracted from video recordings collected from concrete samples. The experimental results indicated a root mean square error (RMSE) value of 3.56 (MPa), demonstrating a strong feasibility that the proposed model can be utilized to predict the concrete strength with digital images of their surfaces and advantages to overcome the previous limitations. This experiment contributed to provide the basis that could be extended to future research with image analysis technique and artificial neural network in the diagnosis of concrete building structures.  相似文献   

9.
目的 针对锂电池极片涂布缺陷种类多,传统方法分类检测精度不高,以及人工依赖性强等问题,提出一种基于卷积神经网络的锂电池极片涂布缺陷自动分类算法。方法 首先对网络结构以及模型参数进行优化,接着在网络中加入跳跃连接结构,将空洞卷积提取到的多尺度特征与高层特征进行融合以获取更多缺陷特征,并采用LeakyReLU(Leaky Rectified Linear Unit)激活函数保留图像中的负值特征信息,最后通过构建的数据集训练模型,实现锂电池极片涂布缺陷的准确分类。结果 实验结果表明,当前方法识别准确率能够达到99.34%,平均检测时间为51ms。结论 改进后的方法能够准确分类出锂电池极片18种涂布缺陷,满足工业生产中实时分类检测的要求。  相似文献   

10.
目的 交通标志识别作为智能驾驶、交通系统研究中的一项重要内容,具有较大的理论价值和应用前景.尤其是文本型交通标志,其含有丰富的高层语义信息,能够提供极其丰富的道路信息.因此通过设计并实现一套新的端到端交通标志文本识别系统,达到有效缓解交通拥堵、提高道路安全的目的.方法 系统主要包括文本区域检测和文字识别两个视觉任务,并基于卷积神经网络的深度学习技术实现.首先以ResNet-50为骨干网络提取特征,并采用类FPN结构进行多层特征融合,将融合后的特征作为文本检测和识别的共享特征.文本检测定位文本区域并输出候选文本框的坐标,文字识别输出词条对应的文本字符串.结果 通过实验验证,系统在Traffic Guide Panel Dataset上取得了令人满意的结果,行识别准确率为71.08%.结论 端到端交通标志文本识别非常具有现实意义.通过卷积神经网络的深度学习技术,提出了一套端到端交通标志文本识别系统,并在开源的Traffic Guide Panel Dataset上证明了该系统的优越性.  相似文献   

11.
Distributed Denial-of-Service (DDoS) has caused great damage to the network in the big data environment. Existing methods are characterized by low computational efficiency, high false alarm rate and high false alarm rate. In this paper, we propose a DDoS attack detection method based on network flow grayscale matrix feature via multiscale convolutional neural network (CNN). According to the different characteristics of the attack flow and the normal flow in the IP protocol, the seven-tuple is defined to describe the network flow characteristics and converted into a grayscale feature by binary. Based on the network flow grayscale matrix feature (GMF), the convolution kernel of different spatial scales is used to improve the accuracy of feature segmentation, global features and local features of the network flow are extracted. A DDoS attack classifier based on multi-scale convolution neural network is constructed. Experiments show that compared with correlation methods, this method can improve the robustness of the classifier, reduce the false alarm rate and the missing alarm rate.  相似文献   

12.
沈明玉  俞鹏飞  汪荣贵  杨娟  薛丽霞 《光电工程》2019,46(11):180489-1-180489-9
卷积神经网络在单帧图像超分辨率重建任务中取得了巨大成功,但是其重建模型多是基于单链结构,层间联系较弱且不能充分利用网络提取的分层特征。针对这些问题,本文设计了一种多路径递归的网络结构(MRCN)。通过使用多路径结构来加强层之间的联系,实现特征的有效利用并且提取丰富的高频成分,同时使用递归结构降低训练难度。此外,通过引入特征融合的操作使得在重建的过程中可以充分利用各层提取的特征,并且自适应的选择有效特征。在常用的基准测试集上进行了大量实验表明,MRCN比现有的方法在重建效果上具有明显提升。  相似文献   

13.
Breast cancer is caused by the abnormal and rapid growth of breast cells. An early diagnosis can ensure an easier and effective treatment. A mass in the breast is a significant early sign of breast cancer, even though differentiating the cancerous mass's tissue from normal tissue for diagnosis is a difficult task for radiologists. The development of computer-aided detection systems in recent years has led to nondestructive and efficient cancer diagnostic techniques. This paper proposes a comprehensive method to locate the cancerous region in the mammogram image. This method employs image noise reduction, optimal image segmentation based on the convolutional neural network, a grasshopper optimization algorithm, and optimized feature extraction and feature selection based on the grasshopper optimization algorithm, thereby improving precision and decreasing the computational cost. This method was applied to the Mammographic Image Analysis Society Digital Mammogram Database and Digital Database for Screening Mammography breast cancer databases and the simulation results were compared with 10 different state-of-the-art methods to analyze the proposed system's efficiency. Final results showed that the proposed method had 96% Sensitivity, 93% Specificity, 85% PPV, 97% NPV, 92% accuracy, and better efficiency than other traditional methods in terms of Sensitivity, Specificity, PPV, NPV, and Accuracy.  相似文献   

14.
Underwater target recognition is a key technology for underwater acoustic countermeasure. How to classify and recognize underwater targets according to the noise information of underwater targets has been a hot topic in the field of underwater acoustic signals. In this paper, the deep learning model is applied to underwater target recognition. Improved anti-noise Power-Normalized Cepstral Coefficients (ia-PNCC) is proposed, based on PNCC applied to underwater noises. Multitaper and normalized Gammatone filter banks are applied to improve the anti-noise capacity. The method is combined with a convolutional neural network in order to recognize the underwater target. Experiment results show that the acoustic feature presented by ia-PNCC has lower noise and are well-suited to underwater target recognition using a convolutional neural network. Compared with the combination of convolutional neural network with single acoustic feature, such as MFCC (Mel-scale Frequency Cepstral Coefficients) or LPCC (Linear Prediction Cepstral Coefficients), the combination of the ia-PNCC with a convolutional neural network offers better accuracy for underwater target recognition.  相似文献   

15.
In the paper, a convolutional neural network based on quaternion transformation is proposed to detect median filtering for color images. Compared with conventional convolutional neural network, color images can be processed in a holistic manner in the proposed scheme, which makes full use of the correlation between RGB channels. And due to the use of convolutional neural network, it can effectively avoid the one-sidedness of artificial features. Experimental results have shown the scheme’s improvement over the state-of-the-art scheme on the accuracy of color image median filtering detection.  相似文献   

16.
为了提高图像中阴影检测的准确性,提出一种利用深度神经网络实现阴影检测的方法.首先,构造了一种密集特征图融合结构,将不同卷积层产生的特征图进行融合;其次,针对图像中阴影的多种尺度特征,设计了一种串并联结合的扩张卷积结构提取图像中阴影多尺度特征;最后,将串并联结合的扩张卷积结构和密集特征图融合结构进行结合,设计出一种端到端...  相似文献   

17.
Deep neural network has proven to be very effective in computer vision fields. Deep convolutional network can learn the most suitable features of certain images without specific measure functions and outperform lots of traditional image processing methods. Generative adversarial network (GAN) is becoming one of the highlights among these deep neural networks. GAN is capable of generating realistic images which are imperceptible to the human vision system so that the generated images can be directly used as intermediate medium for many tasks. One promising application of using GAN generated images would be image concealing which requires the embedded image looks like not being tampered to human vision system and also undetectable to most analyzers. Texture synthesizing has drawn lots of attention in computer vision field and is used for image concealing in steganography and watermark. The traditional methods which use synthesized textures for information hiding mainly select features and mathematic functions by human metrics and usually have a low embedding rate. This paper takes advantage of the generative network and proposes an approach for synthesizing complex texture-like image of arbitrary size using a modified deep convolutional generative adversarial network (DCGAN), and then demonstrates the feasibility of embedding another image inside the generated texture while the difference between the two images is nearly invisible to the human eyes.  相似文献   

18.
Fingerprint identification systems have been widely deployed in many occasions of our daily life. However, together with many advantages, they are still vulnerable to the presentation attack (PA) by some counterfeit fingerprints. To address challenges from PA, fingerprint liveness detection (FLD) technology has been proposed and gradually attracted people's attention. The vast majority of the FLD methods directly employ convolutional neural network (CNN), and rarely pay attention to the problem of over-parameterization and over-fitting of models, resulting in large calculation force of model deployment and poor model generalization. Aiming at filling this gap, this paper designs a lightweight multi-scale convolutional neural network method, and further proposes a novel hybrid spatial pyramid pooling block to extract abundant features, so that the number of model parameters is greatly reduced, and support multi-scale true/fake fingerprint detection. Next, the representation self-challenge (RSC) method is used to train the model, and the attention mechanism is also adopted for optimization during execution, which alleviates the problem of model over-fitting and enhances generalization of detection model. Finally, experimental results on two publicly benchmarks: LivDet2011 and LivDet2013 sets, show that our method achieves outstanding detection results for blind materials and cross-sensor. The size of the model parameters is only 548 KB, and the average detection error of cross-sensors and cross-materials are 15.22 and 1 respectively, reaching the highest level currently available.  相似文献   

19.
目的研究数字图像中的去模糊问题,从受损的模糊图像中恢复出清晰图像。方法针对现有图像去模糊算法无法保留图像高频信息及容易产生振铃效应等问题,提出一种基于Y通道反卷积和卷积神经网络的两阶段自适应去模糊算法(SDYCNN)。在第1阶段,将数字图像转换至YUV颜色空间,根据图像无参考质量评价分数与模糊核尺寸之间的对应关系,在Y通道内自适应确定模糊核尺寸并进行反卷积增强;第2阶段将第1阶段中的反卷积增强作为预处理方式,通过4层卷积神经网络建立反卷积增强后的图像与清晰图像之间的映射关系,实现图像去模糊。结果轻微模糊图像在第1阶段便能够得到较好的去模糊效果,严重模糊图像经过第1阶段的反卷积增强,也有助于神经网络中特征的快速提取。结论实验结果表明,该算法不仅对于模糊图像具有良好的恢复效果,运算效率也有显著提升。  相似文献   

20.
Classification of skin lesions is a complex identification challenge. Due to the wide variety of skin lesions, doctors need to spend a lot of time and effort to judge the lesion image which zoomed through the dermatoscopy. The diagnosis which the algorithm of identifying pathological images assists doctors gets more and more attention. With the development of deep learning, the field of image recognition has made longterm progress. The effect of recognizing images through convolutional neural network models is better than traditional image recognition technology. In this work, we try to classify seven kinds of lesion images by various models and methods of deep learning, common models of convolutional neural network in the field of image classification include ResNet, DenseNet and SENet, etc. We use a fine-tuning model with a multi-layer perceptron, by training the skin lesion model, in the validation set and test set we use data expansion based on multiple cropping, and use five models’ ensemble as the final results. The experimental results show that the program has good results in improving the sensitivity of skin lesion diagnosis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号