共查询到20条相似文献,搜索用时 15 毫秒
1.
This survey paper aims to show methods to analyze and classify field satellite images using deep learning and machine learning algorithms. Users of deep learning-based Convolutional Neural Network (CNN) technology to harvest fields from satellite images or generate zones of interest were among the planned application scenarios (ROI). Using machine learning, the satellite image is placed on the input image, segmented, and then tagged. In contemporary categorization, field size ratio, Local Binary Pattern (LBP) histograms, and color data are taken into account. Field satellite image localization has several practical applications, including pest management, scene analysis, and field tracking. The relationship between satellite images in a specific area, or contextual information, is essential to comprehending the field in its whole. 相似文献
2.
Earlier recognition of breast cancer is crucial to decrease the severity and optimize the survival rate. One of the commonly utilized imaging modalities for breast cancer is histopathological images. Since manual inspection of histopathological images is a challenging task, automated tools using deep learning (DL) and artificial intelligence (AI) approaches need to be designed. The latest advances of DL models help in accomplishing maximum image classification performance in several application areas. In this view, this study develops a Deep Transfer Learning with Rider Optimization Algorithm for Histopathological Classification of Breast Cancer (DTLRO-HCBC) technique. The proposed DTLRO-HCBC technique aims to categorize the existence of breast cancer using histopathological images. To accomplish this, the DTLRO-HCBC technique undergoes pre-processing and data augmentation to increase quantitative analysis. Then, optimal SqueezeNet model is employed for feature extractor and the hyperparameter tuning process is carried out using the Adadelta optimizer. Finally, rider optimization with deep feed forward neural network (RO-DFFNN) technique was utilized employed for breast cancer classification. The RO algorithm is applied for optimally adjusting the weight and bias values of the DFFNN technique. For demonstrating the greater performance of the DTLRO-HCBC approach, a sequence of simulations were carried out and the outcomes reported its promising performance over the current state of art approaches. 相似文献
3.
目的 解决现有工业线束导线排序检测方法中存在的效率低、混色导线检测效果差等问题。方法 基于机器视觉技术设计一种线束导线排序检测装置,并结合图像处理技术和深度学习原理提出一种混色导线排序检测方法。首先根据线束图像中选择的感兴趣区域,分割出线束连接器图像和导线图像,并采用模板匹配和颜色定位方法完成连接器正反面的识别和单色导线的识别定位;然后采集并制作PE混色导线数据集,研究Faster R−CNN、SSD、YOLOv3和YOLOv5m等4种不同目标检测算法对PE混色导线的检测效果。结果 实验结果表明,YOLOv5m检测模型的检测速度和准确率兼顾性最好;改进系统后,检测时间减少了18.55%,平均识别准确率为98.83%。结论 改进后检测系统具有良好的检测效率和可靠性,适用于种类丰富的工业线束导线排序检测。 相似文献
4.
针对人脸关键点检测(人脸对齐)在应用场景下的速度和精度需求,首先在SSD基础之上融合更多分布均匀的特征层,对人脸框坐标进行级联预测,形成对于多尺度人脸信息均具有更加鲁棒响应的深度学习检测器MR-SSD。其次在局部二值特征LBF的级联形状回归方法基础上,提出了基于面部像素差值的多角度初始化算法。采用端正人脸正负90°倾斜范围内的五组特征点形状进行初始化,求取每组回归后形状的眼部特征点像素均方差值并以最大者对应方案作为最终回归形状,从而实现对多角度倾斜人脸优异的拟合效果。本文所提出的最优架构可以实时获得极具鲁棒性的人脸框坐标并且可实现对于多角度倾斜人脸的关键点检测。 相似文献
5.
Automated segmentation of blood vessels in retinal fundus images is essential for medical image analysis. The segmentation of retinal vessels is assumed to be essential to the progress of the decision support system for initial analysis and treatment of retinal disease. This article develops a new Grasshopper Optimization with Fuzzy Edge Detection based Retinal Blood Vessel Segmentation and Classification (GOFED-RBVSC) model. The proposed GOFED-RBVSC model initially employs contrast enhancement process. Besides, GOAFED approach is employed to detect the edges in the retinal fundus images in which the use of GOA adjusts the membership functions. The ORB (Oriented FAST and Rotated BRIEF) feature extractor is exploited to generate feature vectors. Finally, Improved Conditional Variational Auto Encoder (ICAVE) is utilized for retinal image classification, shows the novelty of the work. The performance validation of the GOFED-RBVSC model is tested using benchmark dataset, and the comparative study highlighted the betterment of the GOFED-RBVSC model over the recent approaches. 相似文献
6.
Tarun Agrawal;Prakash Choudhary;Achyut Shankar;Prabhishek Singh;Manoj Diwakar; 《International journal of imaging systems and technology》2024,34(1):e22956
One of the most fatal and prevalent diseases of the central nervous system is a brain tumour. Different subgrades exist for each type of brain tumour because of the broad variety of brain tumours and tumour pathologies. Manual diagnosis may be error-prone and time-consuming, both of which are becoming more challenging as the medical community's workload grows. There is a need for automatic diagnosis. In this study, we have proposed a deep learning model (MultiFeNet) based on a convolutional neural network for the classification of brain tumours. MultiFeNet uses multi-scale feature scaling for feature extraction in magnetic resonance imaging (MRI) images. Multi-scaling helps to learn the better feature representation of the MRI image for enhanced model performance. To evaluate the proposed model, 3064 MRI scans of three distinct categories of brain tumours (meningiomas, gliomas and pituitary tumours) were used. The MultiFeNet obtained 96.4% sensitivity, 96.4% F1-score, 96.4% precision and 96.4% accuracy on the benchmark Figshare dataset. In addition, an ablation study is conducted with the objective of evaluating the role of multi-scaling in model performance. 相似文献
7.
机器视觉作为代替人工检测轮毂表面缺陷的重要手段,是目前该领域的主要研究方向,因此针对汽车轮毂表面缺陷检测技术的研究现状进行了综述与分析。首先,从轮毂表面缺陷的类别和人工检测流程入手,阐述了基于机器视觉的轮毂表面缺陷检测技术的要求和难点。其次,分析了基于机器视觉的智能检测算法的发展历程,包括传统的机器视觉方法在缺陷图像预处理、缺陷定位和特征提取、缺陷分类识别中的应用;卷积神经网络(CNN)等深度学习方法在缺陷检测、分割以及其他方面的应用。最后,介绍了现有轮毂型号识别装置、轮毂缺陷X射线图像采集装置、轮毂表面缺陷图像采集装置,并在分析当前基于机器视觉的智能检测装置在实际应用中的局限性及需要解决的若干关键技术问题的基础上,提出了3种智能检测实验装置设计方案,为全自动快速检测装置的研制与性能提升提供理论与技术支撑。 相似文献
8.
Souad Larabi-Marie-Sainte Eatedal Alabdulkreem Mohammad Alamgeer Mohamed K Nour Anwer Mustafa Hilal Mesfer Al Duhayyim Abdelwahed Motwakel Ishfaq Yaseen 《计算机、材料和连续体(英文)》2022,72(3):4589-4601
Electroencephalography (EEG) eye state classification becomes an essential tool to identify the cognitive state of humans. It can be used in several fields such as motor imagery recognition, drug effect detection, emotion categorization, seizure detection, etc. With the latest advances in deep learning (DL) models, it is possible to design an accurate and prompt EEG EyeState classification problem. In this view, this study presents a novel compact bat algorithm with deep learning model for biomedical EEG EyeState classification (CBADL-BEESC) model. The major intention of the CBADL-BEESC technique aims to categorize the presence of EEG EyeState. The CBADL-BEESC model performs feature extraction using the ALexNet model which helps to produce useful feature vectors. In addition, extreme learning machine autoencoder (ELM-AE) model is applied to classify the EEG signals and the parameter tuning of the ELM-AE model is performed using CBA. The experimental result analysis of the CBADL-BEESC model is carried out on benchmark results and the comparative outcome reported the supremacy of the CBADL-BEESC model over the recent methods. 相似文献
9.
Yongwon Cho Sang Min Lee Young‐Hoon Cho June‐Goo Lee Beomhee Park Gaeun Lee Namkug Kim Joon Beom Seo 《International journal of imaging systems and technology》2021,31(1):72-81
We investigated whether a convolutional neural network (CNN) can enhance the usability of computer‐aided detection (CAD) of chest radiographs for various pulmonary abnormal lesions. The numbers of normal and abnormal patients were 6055 and 3463, respectively. Two radiologists delineated regions of interest for lesions and labeled the disease types as ground truths. The datasets were split into training, tuning, and testing as 7:1: 2. Total test sets were randomly selected in 1214 normal and 690 abnormal. A 5‐fold, cross‐validation was performed on our datasets. For the classification of normal and abnormal, we developed a CNN based on DenseNet169; for abnormal detection, The You Only Look Once (YOLO) v2 with DenseNet was used. Detection and classification of normal and five classes of diseases (nodule[s], consolidation, interstitial opacity, pleural effusion, and pneumothorax) on chest radiographs were analyzed. Our CNN model classified chest radiographs as normal or abnormal with an accuracy of 97.8%. For the results of the abnormal, F1 score, was 75.2 ± 2.28% for nodules, 55.0 ± 4.3% for consolidation, 78.2 ± 7.85% for interstitial opacity, 81.6 ± 2.07% for pleural effusion, and 70.0 ± 7.97% for pneumothorax, respectively. In addition, we conducted the experiments between our method and RetinaNet with only nodules. The results of our method and RetinaNet at cutoff‐0.5 in the free response operating characteristic curve were 83.45% and 80.55%, respectively. Our algorithm demonstrated viable detection and disease classification capacity and could be used for CAD of lung diseases on chest radiographs. 相似文献
10.
R. Deiva Nayagam;D. Selvathi; 《International journal of imaging systems and technology》2024,34(2):e23064
Prostate Cancer (PCa) is a prevalent global threat to male health, contributing significantly to male cancer-related mortality. Timely detection and management are pivotal for improved outcomes, as successful cure rates are highest in the early stages. Deep learning (DL) methodologies offer a promising avenue to enhance the precision of detection, potentially reducing mortality rates. Magnetic resonance imaging (MRI) stands out as an effective tool for PCa diagnosis, providing comprehensive visuals of the prostate and adjacent tissues. This technology aids in identifying cancerous developments early on, crucial for treatment planning. Utilizing MRI for PCa detection has demonstrated increased accuracy, minimizing unnecessary biopsies, and facilitating personalized treatment decisions. Recent studies showcase the potential of DL methods in identifying and segmenting the prostate in MRI scans. These techniques not only improve diagnostic precision but also assist in treatment planning and monitoring. Incorporating DL in MRI-based PCa diagnosis holds immense potential for enhancing efficiency and accuracy, promising better treatment outcomes. However, further research is imperative to explore these methods comprehensively, especially in larger and more diverse patient populations. This review evaluates the progress in employing DL techniques, rooted in artificial intelligence, for categorizing and outlining the prostate and lesions in MR images, underscoring the need for continued investigation and validation in varied clinical settings. 相似文献
11.
Yongtao Shi;Wei Du;Chao Gao;Xinzhi Li; 《International journal of imaging systems and technology》2024,34(5):e23178
Accurately and rapidly segmenting the prostate in transrectal ultrasound (TRUS) images remains challenging due to the complex semantic information in ultrasound images. The paper discusses a cross-layer connection with SegFormer attention U-Net for efficient TRUS image segmentation. The SegFormer framework is enhanced by reducing model parameters and complexity without sacrificing accuracy. We introduce layer-skipping connections for precise positioning and combine local context with global dependency for superior feature recognition. The decoder is improved with Multi-layer Perceptual Convolutional Block Attention Module (MCBAM) for better upsampling and reduced information loss, leading to increased accuracy. The experimental results show that compared with classic or popular deep learning methods, this method has better segmentation performance, with the dice similarity coefficient (DSC) of 97.55% and the intersection over union (IoU) of 95.23%. This approach balances encoder efficiency, multi-layer information flow, and parameter reduction. 相似文献
12.
受限波尔兹曼机 总被引:8,自引:0,他引:8
受限波尔兹曼机(restricted Boltzmann machines,RBM)是一类具有两层结构、对称连接且无自反馈的随机神经网络模型,层间全连接,层内无连接.近年来,随着RBM的快速学习算法一对比散度的出现,机器学习界掀起了研究RBM理论及应用的热潮.实践表明,RBM是一种有效的特征提取方法,用于初始化前馈神经网络可明显提高泛化能力,堆叠多个RBM组成的深度信念网络能提取更抽象的特征.鉴于RBM的优点及其在深度学习中的广泛应用,本文对RBM的基本模型、学习算法、参数设置、评估方法、变形算法等进行了详细介绍,最后探讨了RBM在未来值得研究的方向. 相似文献
13.
Hamza Safwan Zeshan Iqbal Rashid Amin Muhammad Attique Khan Majed Alhaisoni Abdullah Alqahtani Ye Jin Kim Byoungchol Chang 《计算机、材料和连续体(英文)》2023,75(2):2365-2381
Software-defined networking (SDN) represents a paradigm shift in network traffic management. It distinguishes between the data and control planes. APIs are then used to communicate between these planes. The controller is central to the management of an SDN network and is subject to security concerns. This research shows how a deep learning algorithm can detect intrusions in SDN-based IoT networks. Overfitting, low accuracy, and efficient feature selection is all discussed. We propose a hybrid machine learning-based approach based on Random Forest and Long Short-Term Memory (LSTM). In this study, a new dataset based specifically on Software Defined Networks is used in SDN. To obtain the best and most relevant features, a feature selection technique is used. Several experiments have revealed that the proposed solution is a superior method for detecting flow-based anomalies. The performance of our proposed model is also measured in terms of accuracy, recall, and precision. F1 rating and detection time Furthermore, a lightweight model for training is proposed, which selects fewer features while maintaining the model’s performance. Experiments show that the adopted methodology outperforms existing models. 相似文献
14.
15.
V. Karthikeyan;M. Navin Kishore;S. Sajin; 《International journal of imaging systems and technology》2024,34(1):e23022
Kidney disease is a major health problem that affects millions of people around the world. Human kidney problems can be diagnosed with the help of computed tomography (CT), which creates cross-sectional slices of the organ. A deep end-to-end convolutional neural network (CNN) model is proposed to help radiologists detect and characterize kidney problems in CT scans of patients. This has the potential to improve diagnostic accuracy and efficiency, which in turn benefits patient care. Our strategy involves teaching a suggested deep end-to-end CNN to distinguish between healthy and diseased kidneys. The recommended CNN is trained using a standard CT image library that has been annotated to show kidney stones, cysts, and tumors. The model can then be used to detect kidney abnormalities in fresh CT scans, which may enhance the effectiveness and speed with which diagnoses are made. A total of 1812 pictures were used, each one a unique cross-sectional CT scan of the patient. Our model has a detection rate of 99.17% in CT scan validation tests. We employed a different dataset with a total of 5077 normal samples, 3709 cyst samples, 1377 stone samples, and 2283 tumor samples. In tests, our model proved to be 99.68% accurate. The suggested framework has been validated by applying it to the clinical dataset, resulting in 99% accuracy in predictions. As low-cost and portable CT scanners become more commonplace, the described concept may soon be employed outside of a hospital environment, at the point of treatment, or even in the patient's own home. 相似文献
16.
深度卷积神经网络在目标检测与识别等方面表现出了优异性能,但将其用于SAR目标识别时,较少的训练样本和深度模型的优化设计是必须解决的两个问题。本文设计了一种结合二维随机卷积特征和集成超限学习机的SAR目标识别算法。首先,随机生成具有不同宽度的二维卷积核,对输入图像进行卷积与池化操作,提取随机卷积特征向量。其次,为提高分类器的泛化能力,并降低训练时间,基于集成学习思想对提取的卷积特征进行随机采样,然后采用超限学习机训练基分类器。最后,通过投票表决法对基分类器的分类结果进行集成。采用MSTAR数据集进行了SAR目标识别实验,实验结果表明,由于采用的超限学习机具有快速训练能力,训练时间降低了数十倍,在无需进行数据增强的情况下,分类精度与采用数据增强和多层卷积神经网络的深度学习算法相当。提出的算法具有实现简单、需要调整参数少等优点,采用集成学习思想提高了分类器的泛化能力。
相似文献17.
Fast recognition of elevator buttons is a key step for service robots to ride elevators automatically. Although there are some studies in this field, none of them can achieve real-time application due to problems such as recognition speed and algorithm complexity. Elevator button recognition is a comprehensive problem. Not only does it need to detect the position of multiple buttons at the same time, but also needs to accurately identify the characters on each button. The latest version 5 of you only look once algorithm (YOLOv5) has the fastest reasoning speed and can be used for detecting multiple objects in real-time. The advantages of YOLOv5 make it an ideal choice for detecting the position of multiple buttons in an elevator, but it’s not good at specific word recognition. Optical character recognition (OCR) is a well-known technique for character recognition. This paper innovatively improved the YOLOv5 network, integrated OCR technology, and applied them to the elevator button recognition process. First, we changed the detection scale in the YOLOv5 network and only maintained the detection scales of 40 * 40 and 80 * 80, thus improving the overall object detection speed. Then, we put a modified OCR branch after the YOLOv5 network to identify the numbers on the buttons. Finally, we verified this method on different datasets and compared it with other typical methods. The results show that the average recall and precision of this method are 81.2% and 92.4%. Compared with others, the accuracy of this method has reached a very high level, but the recognition speed has reached 0.056 s, which is far higher than other methods. 相似文献
18.
Elif Nur Haner Kırğıl;Çağatay Berke Erdaş; 《International journal of imaging systems and technology》2024,34(4):e23148
Skin cancer occurs when abnormal cells in the top layer of the skin, known as the epidermis, undergo uncontrolled growth due to unrepaired DNA damage, leading to the development of mutations. These mutations lead to rapid cell growth and development of cancerous tumors. The type of cancerous tumor depends on the cells of origin. Overexposure to ultraviolet rays from the sun, tanning beds, or sunlamps is a primary factor in the occurrence of skin cancer. Since skin cancer is one of the most common types of cancer and has a high mortality, early diagnosis is extremely important. The dermatology literature has many studies of computer-aided diagnosis for early and highly accurate skin cancer detection. In this study, the classification of skin cancer was provided by Regnet x006, EfficientNetv2 B0, and InceptionResnetv2 deep learning methods. To increase the classification performance, hairs and black pixels in the corners due to the nature of dermoscopic images, which could create noise for deep learning, were eliminated in the preprocessing step. Preprocessing was done by hair removal, cropping, segmentation, and applying a median filter to dermoscopic images. To measure the performance of the proposed preprocessing technique, the results were obtained with both raw images and preprocessed images. The model developed to provide a solution to the classification problem is based on deep learning architectures. In the four experiments carried out within the scope of the study, classification was made for the eight classes in the dataset, squamous cell carcinoma and basal cell carcinoma classification, benign keratosis and actinic keratosis classification, and finally benign and malignant disease classification. According to the results obtained, the best accuracy values of the experiments were obtained as 0.858, 0.929, 0.917, and 0.906, respectively. The study underscores the significance of early and accurate diagnosis in addressing skin cancer, a prevalent and potentially fatal condition. The primary aim of the preprocessing procedures was to attain enhanced performance results by concentrating solely on the area spanning the lesion instead of analyzing the complete image. Combining the suggested preprocessing strategy with deep learning techniques shows potential for enhancing skin cancer diagnosis, particularly in terms of sensitivity and specificity. 相似文献
19.
目的 针对人工分拣组成的零件包装盒常常会出现缺少部分零件的问题,开发一套集训练、识别、分选于一体的智能分拣系统.方法 在设计过程中,提出一种基于深度学习的改进Yolov3算法,针对工业现场光照、业零件形状和质地等实际因素,对Yolo算法的训练和检测进行改进,通过对包装盒产品的一次拍摄,检测出画面中出现的预设物体,并与标准设置相比对,从而判断出该盒内产品是否有缺料、多料的情况,以此分选出合格与否的包装盒.结果 在物体摆放相互重叠不超过20%的情况下,物体检测的准确率为98.2%,召回率为99.5%.结论 通过文中提出的改进算法,设计的检测系统能够在复杂的工业现场环境下正常工作,并能对包装的完整性进行准确的检测. 相似文献
20.
Kalyanakumar Jayapriya Israel Jeena Jacob 《International journal of imaging systems and technology》2020,30(2):348-357
Fully convolutional networks (FCNs) take the input of arbitrary size and produce correspondingly sized output with efficient inference and learning. The automatic diagnosis of melanoma is very essential for reducing the mortality rate by identifying the disease in earlier stages. A two-stage framework is used for implementing the melanoma detection, segmentation of skin lesion, and identification of melanoma lesions. Two FCNs based on VGG-16 and GoogLeNet are incorporated for improving the segmentation accuracy. A hybrid framework is used for incorporating these two FCNs. The classification is done by extracting the feature from segmented lesion by using deep residual network and a hand-crafted feature. Classification is done by support vector machine. The performance analysis of our framework gives a promising accuracy, that is, 0.8892 for classification in ISBI 2016 dataset and 0.853 for ISIC 2017 dataset. 相似文献