首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
李海山  唐海艳  梁栋  韩军 《包装工程》2021,42(23):170-177
目的 提取样本图像颜色直方图特征对卷积神经网络进行训练,达到快速、高准确率检测图像颜色缺陷的目的.方法 将标准图像从RGB颜色空间转换至HSV颜色空间,通过改变图像H,S,V三分量值获取训练样本和测试样本;在HSV颜色空间中非均匀量化图像的颜色直方图,得到所有训练样本和测试样本的颜色直方图特征;利用样本图像颜色直方图特征训练卷积神经网络,然后对测试样本进行检测,研究检测的速度、准确率,并将该检测方法与逐像素、超像素、BP神经网络和支持向量机方法进行对比.结果 对于图片尺寸为512×512的彩色图像,卷积神经网络检测单幅图片的平均检测时间约为57.66 ms,训练样本图像为50000张时,卷积神经网络方法对10000张测试样本进行检测的准确率为99.77%.结论 卷积神经网络方法在保证高准确率的前提下大幅提高检测精度,对于印刷品色差缺陷在线检测具有良好的应用价值.  相似文献   

2.
An analysis of features extracted from handwriting samples according to writer demographics and writing task characteristics is presented. The individual demographics studied here include age, gender and handedness, while the handwriting tasks considered include writing the individual signature, form-filling, cheque-completion and constructing free-form written text. By analysing different features of handwriting, the authors establish a link between a writer's individual characteristics including demographic properties, the handwriting task being attempted and quantifiable features of handwriting such as pen velocity, acceleration and slant. Additionally, imitated or 'forged' handwriting is also analysed on exactly the same basis. The analysis is performed on a newly collected database of handwriting samples collected from a population of 150 writers, and which can be utilised in both forensic document inspection and automatic handwriting analysis research. All handwriting samples, including forgery attempts, were recorded both temporally as a series of pen positional coordinates and scanned at a resolution of 600 dpi to enable both dynamic and static processing.  相似文献   

3.
4.
A system for classifying four basic table tennis strokes using wearable devices and deep learning networks is proposed in this study. The wearable device consisted of a six-axis sensor, Raspberry Pi 3, and a power bank. Multiple kernel sizes were used in convolutional neural network (CNN) to evaluate their performance for extracting features. Moreover, a multiscale CNN with two kernel sizes was used to perform feature fusion at different scales in a concatenated manner. The CNN achieved recognition of the four table tennis strokes. Experimental data were obtained from 20 research participants who wore sensors on the back of their hands while performing the four table tennis strokes in a laboratory environment. The data were collected to verify the performance of the proposed models for wearable devices. Finally, the sensor and multi-scale CNN designed in this study achieved accuracy and F1 scores of 99.58% and 99.16%, respectively, for the four strokes. The accuracy for five-fold cross validation was 99.87%. This result also shows that the multi-scale convolutional neural network has better robustness after five-fold cross validation.  相似文献   

5.
6.
7.
Handwriting is an obtained apparatus utilized for correspondence of one’s recognition or sentiments. Components that judge a person’s handwriting is not merely subject to the individual’s handwriting depends on the background, additionally considers like nervousness, inspiration and the reason for the handwriting. In spite of the high variation, in a man’s handwriting, recent outcomes from various writers have demonstrated that it has adequate individual quality to be utilized as an identification strategy. In this paper, the authors are the pact with a novel approach to text dependent writer identification in view of pre-segmented Gurmukhi characters. The text dependent writer identification framework proposed in this paper includes distinctive stages like preprocessing, feature extraction, classification or identification. The feature extraction stage incorporates four schemes, zoning, diagonal, transitions and peak extent based features. To analyze the proposed framework execution, experiments are performed with two classifiers, namely, k-NN and SVM. SVM is also considered with linear-kernel in the present work. For experimental results, we have collected 31,500 samples from 90 different writers for 35 class problem. Maximum writer identification accuracy of 89.85% has been achieved by using a combination of zoning, transition and peak extent based features with Linear-SVM classifier when we have taken 70% data as the training set and remaining 30% data as the testing set. Using 10-fold cross validation, we have achieved an accuracy of 94.76% with a combination of zoning, transition and peak extent based features and Linear-SVM classifier.  相似文献   

8.
Artificial intelligence, which has recently emerged with the rapid development of information technology, is drawing attention as a tool for solving various problems demanded by society and industry. In particular, convolutional neural networks (CNNs), a type of deep learning technology, are highlighted in computer vision fields, such as image classification and recognition and object tracking. Training these CNN models requires a large amount of data, and a lack of data can lead to performance degradation problems due to overfitting. As CNN architecture development and optimization studies become active, ensemble techniques have emerged to perform image classification by combining features extracted from multiple CNN models. In this study, data augmentation and contour image extraction were performed to overcome the data shortage problem. In addition, we propose a hierarchical ensemble technique to achieve high image classification accuracy, even if trained from a small amount of data. First, we trained the UC-Merced land use dataset and the contour images for each image on pretrained VGGNet, GoogLeNet, ResNet, DenseNet, and EfficientNet. We then apply a hierarchical ensemble technique to the number of cases in which each model can be deployed. These experiments were performed in cases where the proportion of training datasets was 30%, 50%, and 70%, resulting in a performance improvement of up to 4.68% compared to the average accuracy of the entire model.  相似文献   

9.
Background—Human Gait Recognition (HGR) is an approach based on biometric and is being widely used for surveillance. HGR is adopted by researchers for the past several decades. Several factors are there that affect the system performance such as the walking variation due to clothes, a person carrying some luggage, variations in the view angle. Proposed—In this work, a new method is introduced to overcome different problems of HGR. A hybrid method is proposed or efficient HGR using deep learning and selection of best features. Four major steps are involved in this work-preprocessing of the video frames, manipulation of the pre-trained CNN model VGG-16 for the computation of the features, removing redundant features extracted from the CNN model, and classification. In the reduction of irrelevant features Principal Score and Kurtosis based approach is proposed named PSbK. After that, the features of PSbK are fused in one materix. Finally, this fused vector is fed to the One against All Multi Support Vector Machine (OAMSVM) classifier for the final results. Results—The system is evaluated by utilizing the CASIA B database and six angles 00°, 18°, 36°, 54°, 72°, and 90° are used and attained the accuracy of 95.80%, 96.0%, 95.90%, 96.20%, 95.60%, and 95.50%, respectively. Conclusion—The comparison with recent methods show the proposed method work better.  相似文献   

10.
刘丽  孙刘杰  王文举 《包装工程》2020,41(19):223-229
目的 为了实现高通量dPCR基因芯片荧光图像的亮点分类与计数,提出一种基于支持向量机(SVM)的荧光图像分类与计数方法。方法 首先对荧光图像进行去噪、对比度增强等图像预处理,对预处理后荧光图像进行亮点区域提取标注,去除背景与暗点的冗余信息,利用方向梯度直方图(Histogram of Oriented Gradient, HOG)提取鉴别特征,计算合并所有样本的亮点特征得到HOG特征向量,根据已得到的HOG特征向量创建一个线性SVM分类器,利用训练好的SVM分类器对荧光图像亮点进行分类与计数。结果 对比传统算法,文中算法具有较高的分类识别精度,平均准确率高达98%以上,可以很好地实现荧光图像亮点分类与计数。结论 在有限的小样本标注数据下,文中算法具有良好的分类性能,能够有效识别荧光图像中的亮点,对其他荧光图像分类研究也具有一定参考价值。  相似文献   

11.
The COVID-19 pandemic poses an additional serious public health threat due to little or no pre-existing human immunity, and developing a system to identify COVID-19 in its early stages will save millions of lives. This study applied support vector machine (SVM), k-nearest neighbor (K-NN) and deep learning convolutional neural network (CNN) algorithms to classify and detect COVID-19 using chest X-ray radiographs. To test the proposed system, chest X-ray radiographs and CT images were collected from different standard databases, which contained 95 normal images, 140 COVID-19 images and 10 SARS images. Two scenarios were considered to develop a system for predicting COVID-19. In the first scenario, the Gaussian filter was applied to remove noise from the chest X-ray radiograph images, and then the adaptive region growing technique was used to segment the region of interest from the chest X-ray radiographs. After segmentation, a hybrid feature extraction composed of 2D-DWT and gray level co-occurrence matrix was utilized to extract the features significant for detecting COVID-19. These features were processed using SVM and K-NN. In the second scenario, a CNN transfer model (ResNet 50) was used to detect COVID-19. The system was examined and evaluated through multiclass statistical analysis, and the empirical results of the analysis found significant values of 97.14%, 99.34%, 99.26%, 99.26% and 99.40% for accuracy, specificity, sensitivity, recall and AUC, respectively. Thus, the CNN model showed significant success; it achieved optimal accuracy, effectiveness and robustness for detecting COVID-19.  相似文献   

12.
针对人-机器人语音交互中经过语音识别的文本指令,提出了一种利用汉语拼音中声韵母作为特征的深度学习文本分类模型。首先,以无人驾驶车语音导航控制为人机交互的应用背景,分析其文本指令结构并分别构建单一意图与复杂意图语料库;其次,在以字符作为文本分类特征的基础上,结合汉语拼音与英文单词的区别,提出了一种利用拼音声韵母字符作为中文文本分类的特征表示方法;然后,用门控递归单元(GRU)代替传统递归神经网络单元以解决其难以捕获长时间维度特征的不足,为提取信息的高阶特征、缩短特征序列长度并加快模型收敛速度,建立了一种结合卷积神经网络及GRU递归神经网络的深度学习文本分类模型。最后,为验证模型在处理长、短序列任务上的表现,在上述两个语料库上对提出的模型分别进行十折交叉测试,并与其他分类方法进行比较与分析,结果表明该模型显著地提高了分类准确率。  相似文献   

13.
Citrus fruit crops are among the world’s most important agricultural products, but pests and diseases impact their cultivation, resulting in yield and quality losses. Computer vision and machine learning have been widely used to detect and classify plant diseases over the last decade, allowing for early disease detection and improving agricultural production. This paper presented an automatic system for the early detection and classification of citrus plant diseases based on a deep learning (DL) model, which improved accuracy while decreasing computational complexity. The most recent transfer learning-based models were applied to the Citrus Plant Dataset to improve classification accuracy. Using transfer learning, this study successfully proposed a Convolutional Neural Network (CNN)-based pre-trained model (EfficientNetB3, ResNet50, MobiNetV2, and InceptionV3) for the identification and categorization of citrus plant diseases. To evaluate the architecture’s performance, this study discovered that transferring an EfficientNetb3 model resulted in the highest training, validating, and testing accuracies, which were 99.43%, 99.48%, and 99.58%, respectively. In identifying and categorizing citrus plant diseases, the proposed CNN model outperforms other cutting-edge CNN model architectures developed previously in the literature.  相似文献   

14.
This paper presents a language-based efficient post-processing algorithm for the recognition of online unconstrained handwritten Gurmukhi characters. A total of 93 stroke classes have been identified to recognize the Gurmukhi character set in this work. Support Vector Machine (SVM) classifier has been employed for stroke classification. The main objective of this paper is to improve the character level recognition accuracy using an efficient Finite State Automata (FSA)-based formation of Gurmukhi characters algorithm. A database of 21,945 online handwritten Gurmukhi words is primarily collected in this experiment. After analysing the collected database, we have observed that a character can be written using one or more strokes. Therefore, a total of 65,946 strokes have been annotated using the 93 identified stroke classes. Among these strokes, 15,069 stroke samples are considered for training the classifier. The proposed system achieved promising recognition accuracy of 97.3% for Gurmukhi characters, when tested with a new database of 8,200 characters, written by 20 different writers.  相似文献   

15.
Brain tumor refers to the formation of abnormal cells in the brain. It can be divided into benign and malignant. The main diagnostic methods for brain tumors are plain X-ray film, Magnetic resonance imaging (MRI), and so on. However, these artificial diagnosis methods are easily affected by external factors. Scholars have made such impressive progress in brain tumors classification by using convolutional neural network (CNN). However, there are still some problems: (i) There are many parameters in CNN, which require much calculation. (ii) The brain tumor data sets are relatively small, which may lead to the overfitting problem in CNN. In this paper, our team proposes a novel model (RBEBT) for the automatic classification of brain tumors. We use fine-tuned ResNet18 to extract the features of brain tumor images. The RBEBT is different from the traditional CNN models in that the randomized neural network (RNN) is selected as the classifier. Meanwhile, our team selects the bat algorithm (BA) to optimize the parameters of RNN. We use five-fold cross-validation to verify the superiority of the RBEBT. The accuracy (ACC), specificity (SPE), precision (PRE), sensitivity (SEN), and F1-score (F1) are 99.00%, 95.00%, 99.00%, 100.00%, and 100.00%. The classification performance of the RBEBT is greater than 95%, which can prove that the RBEBT is an effective model to classify brain tumors.  相似文献   

16.
针对传统的滚动轴承故障诊断方法依赖人工特征提取和专家经验,难以自适应提取强噪声信号微弱故障特征的问题,提出一种直方图均衡化和卷积神经网络(CNN)相结合的智能诊断方法。首先,将传感器采集到的一维振动信号通过横向插值法转换为便于模型识别的二维振动图像,利用直方图均衡化技术拉伸像素之间灰度值差别的动态范围,突出纹理细节和对比度,以增强周期性故障特征;然后构建深层CNN模型,采用优化技术降低模型参数量,逐层学习监测数据与故障状态之间的复杂映射关系。实验结果表明该方法具有高达99%以上的准确率,对不同负载下的故障信号仍具有较高的识别精度和泛化能力。  相似文献   

17.
Magnetic resonance imaging (MRI) of brain needs an impeccable analysis to investigate all its structure and pattern. This analysis may be a sharp visual analysis by an experienced medical professional or by a computer aided diagnosis system that can help to predict, what may be the recent condition. Similarly, on the basis of various information and technique, a system can be designed to detect whether a patient is prone to Alzheimer's disease or not. And this task of detection of abnormalities at an initial stage from brain MRI is a major challenge in the field of neurosciences. The main idea behind our research is to utilize the deep layers feature extraction benefited from deep neural network architecture, without extensive hardware resource training, and classifying the image on a basis of simple machine-learning algorithm with selected best features in order to reduce work load, classification error and hardware utilization time. We have utilized convolution neural network (CNN) layer using similar architecture like that of Alexnet with some parametric change, for the automatic extraction of features of images obtained from slice extraction of whole brain MRI whereas 13 manual features based on gray level co-occurrence matrix were also extracted to test the impact of this features on ranking. If we had only classified using CNN network, the misclassification rate was much higher. So, feature selection is achieved with feature ranking algorithms like Mutinffs, ReliefF, Laplacian and UDFS and so on and also tested with different machine-learning techniques like Support Vector Machine, K-Nearest Neighbor and Subspace Ensemble under different testing condition. The performance of the result is satisfactory with classification accuracy around 98% to 99% with 7:3 ratio of random holdout partition of training to testing image sets and also with fivefolds of cross-validation on the same set using a standardized template.  相似文献   

18.
19.
提出一种新的文本段落聚类策略,该策略采用多特征融合思想尽可能多地挖掘段落内的特征,并采用累积Logistic 回归分析方法来拟合这些特征与段落相似度之间的内在关联,使得段落相似度计算的结果更为理想.最后采用层次聚合聚类算法中的complete-link方法对段落集合进行聚类处理.通过网络真实文本进行了段落相似度度量实验和段落聚类实验,实验结果显示了方法的可行性.  相似文献   

20.
Biopsy is one of the most commonly used modality to identify breast cancer in women, where tissue is removed and studied by the pathologist under the microscope to look for abnormalities in tissue. This technique can be time-consuming, error-prone, and provides variable results depending on the expertise level of the pathologist. An automated and efficient approach not only aids in the diagnosis of breast cancer but also reduces human effort. In this paper, we develop an automated approach for the diagnosis of breast cancer tumors using histopathological images. In the proposed approach, we design a residual learning-based 152-layered convolutional neural network, named as ResHist for breast cancer histopathological image classification. ResHist model learns rich and discriminative features from the histopathological images and classifies histopathological images into benign and malignant classes. In addition, to enhance the performance of the developed model, we design a data augmentation technique, which is based on stain normalization, image patches generation, and affine transformation. The performance of the proposed approach is evaluated on publicly available BreaKHis dataset. The proposed ResHist model achieves an accuracy of 84.34% and an F1-score of 90.49% for the classification of histopathological images. Also, this approach achieves an accuracy of 92.52% and F1-score of 93.45% when data augmentation is employed. The proposed approach outperforms the existing methodologies in the classification of benign and malignant histopathological images. Furthermore, our experimental results demonstrate the superiority of our approach over the pre-trained networks, namely AlexNet, VGG16, VGG19, GoogleNet, Inception-v3, ResNet50, and ResNet152 for the classification of histopathological images.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号