首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到15条相似文献,搜索用时 15 毫秒
1.
Epilepsy is a type of brain disorder that causes recurrent seizures. It is the second most common neurological disease after Alzheimer’s. The effects of epilepsy in children are serious, since it causes a slower growth rate and a failure to develop certain skills. In the medical field, specialists record brain activity using an Electroencephalogram (EEG) to observe the epileptic seizures. The detection of these seizures is performed by specialists, but the results might not be accurate due to human errors; therefore, automated detection of epileptic pediatric seizures might be the optimal solution. This paper investigates the detection of epileptic seizures by applying supervised machine learning techniques. The techniques applied on the data of patients with ages seven years and below from children’s hospital boston massachusetts institute of technology (CHB-MIT) scalp EEG database of epileptic pediatric signals. A group of Naïve Bayes (NB), Support vector machine (SVM), Logistic regression (LR), k-nearest neighbor (KNN), Linear discernment (LD), Decision tree (DT), and ensemble learning methods were applied to the classification process. The results demonstrated the outperformance of the present study by achieving 100% for all parameters using the Ensemble learning model in contrast to state-of-the-art studies in the literature. Similarly, the SVM model achieved performance with 98.3% for sensitivity, 97.7% for specificity, and 98% for accuracy. The results of the LD and LR models reveal the lower performance i.e., the sensitivity at 66.9%–68.9%, specificity at 73.5%–77.1%, and accuracy at 70.2%–73%.  相似文献   

2.
Artificial intelligence, which has recently emerged with the rapid development of information technology, is drawing attention as a tool for solving various problems demanded by society and industry. In particular, convolutional neural networks (CNNs), a type of deep learning technology, are highlighted in computer vision fields, such as image classification and recognition and object tracking. Training these CNN models requires a large amount of data, and a lack of data can lead to performance degradation problems due to overfitting. As CNN architecture development and optimization studies become active, ensemble techniques have emerged to perform image classification by combining features extracted from multiple CNN models. In this study, data augmentation and contour image extraction were performed to overcome the data shortage problem. In addition, we propose a hierarchical ensemble technique to achieve high image classification accuracy, even if trained from a small amount of data. First, we trained the UC-Merced land use dataset and the contour images for each image on pretrained VGGNet, GoogLeNet, ResNet, DenseNet, and EfficientNet. We then apply a hierarchical ensemble technique to the number of cases in which each model can be deployed. These experiments were performed in cases where the proportion of training datasets was 30%, 50%, and 70%, resulting in a performance improvement of up to 4.68% compared to the average accuracy of the entire model.  相似文献   

3.
目的通过三维扫描仪得到的点云数据往往存在很多异常值,例如噪点、遗失点和外部点等。在这些异常值存在的情况下,为了提高三维点云数据的分类精度,提出一种基于集成学习的强鲁棒性三维点云数据分类方法。方法提出一种基于最大投票法的集成学习思想,将2个深度神经网络的分类结果进行集成,从而提高网络的泛化性和准确性;采用全局特征增强和中心损失函数来优化神经网络结构,提高分类精度并增强鲁棒性。结果文中方法缩短模型训练时间至30个迭代次数,且在有噪点、丢失点和外部点的情况下分类精度均得到有效提升。结论提出的EL-3D算法在含有噪点、丢失点和外部点的情况下,鲁棒性效果要优于目前的点云分类方法。  相似文献   

4.
情感分类是一种从文本中提取情感倾向的文本分类任务。集成学习通过结合几个分类器,在情感分类任务上能够获得比个体分类器更好的分类效果。但是,由于个体分类器在数据集上的表现不同,个体分类器在集成方法中的权重难以确定。针对集成学习中个体分类器的权重优化问题,提出一种基于差分进化优化个体分类器权重的集成分类方法,并将其应用于中文情感分类。以分类准确率为适应度值,通过差分进化算法优化5种个体分类器的权重组合,在3个领域的评论语料集上进行实验。实验结果表明,与一般的集成方法相比,该方法在中文情感分类上有更好的分类效果。  相似文献   

5.
6.
As the COVID-19 pandemic swept the globe, social media platforms became an essential source of information and communication for many. International students, particularly, turned to Twitter to express their struggles and hardships during this difficult time. To better understand the sentiments and experiences of these international students, we developed the Situational Aspect-Based Annotation and Classification (SABAC) text mining framework. This framework uses a three-layer approach, combining baseline Deep Learning (DL) models with Machine Learning (ML) models as meta-classifiers to accurately predict the sentiments and aspects expressed in tweets from our collected Student-COVID-19 dataset. Using the proposed aspect2class annotation algorithm, we labeled bulk unlabeled tweets according to their contained aspect terms. However, we also recognized the challenges of reducing data’s high dimensionality and sparsity to improve performance and annotation on unlabeled datasets. To address this issue, we proposed the Volatile Stopwords Filtering (VSF) technique to reduce sparsity and enhance classifier performance. The resulting Student-COVID Twitter dataset achieved a sophisticated accuracy of 93.21% when using the random forest as a meta-classifier. Through testing on three benchmark datasets, we found that the SABAC ensemble framework performed exceptionally well. Our findings showed that international students during the pandemic faced various issues, including stress, uncertainty, health concerns, financial stress, and difficulties with online classes and returning to school. By analyzing and summarizing these annotated tweets, decision-makers can better understand and address the real-time problems international students face during the ongoing pandemic.  相似文献   

7.
With the development of satellite technology, the satellite imagery of the earth’s surface and the whole surface makes it possible to survey surface resources and master the dynamic changes of the earth with high efficiency and low consumption. As an important tool for satellite remote sensing image processing, remote sensing image classification has become a hot topic. According to the natural texture characteristics of remote sensing images, this paper combines different texture features with the Extreme Learning Machine, and proposes a new remote sensing image classification algorithm. The experimental tests are carried out through the standard test dataset SAT-4 and SAT-6. Our results show that the proposed method is a simpler and more efficient remote sensing image classification algorithm. It also achieves 99.434% recognition accuracy on SAT-4, which is 1.5% higher than the 97.95% accuracy achieved by DeepSat. At the same time, the recognition accuracy of SAT-6 reaches 99.5728%, which is 5.6% higher than DeepSat’s 93.9%.  相似文献   

8.
In the current era of the internet, people use online media for conversation, discussion, chatting, and other similar purposes. Analysis of such material where more than one person is involved has a spate challenge as compared to other text analysis tasks. There are several approaches to identify users’ emotions from the conversational text for the English language, however regional or low resource languages have been neglected. The Urdu language is one of them and despite being used by millions of users across the globe, with the best of our knowledge there exists no work on dialogue analysis in the Urdu language. Therefore, in this paper, we have proposed a model which utilizes deep learning and machine learning approaches for the classification of users’ emotions from the text. To accomplish this task, we have first created a dataset for the Urdu language with the help of existing English language datasets for dialogue analysis. After that, we have preprocessed the data and selected dialogues with common emotions. Once the dataset is prepared, we have used different deep learning and machine learning techniques for the classification of emotion. We have tuned the algorithms according to the Urdu language datasets. The experimental evaluation has shown encouraging results with 67% accuracy for the Urdu dialogue datasets, more than 10, 000 dialogues are classified into five emotions i.e., joy, fear, anger, sadness, and neutral. We believe that this is the first effort for emotion detection from the conversational text in the Urdu language domain.  相似文献   

9.
In the machine learning (ML) paradigm, data augmentation serves as a regularization approach for creating ML models. The increase in the diversification of training samples increases the generalization capabilities, which enhances the prediction performance of classifiers when tested on unseen examples. Deep learning (DL) models have a lot of parameters, and they frequently overfit. Effectively, to avoid overfitting, data plays a major role to augment the latest improvements in DL. Nevertheless, reliable data collection is a major limiting factor. Frequently, this problem is undertaken by combining augmentation of data, transfer learning, dropout, and methods of normalization in batches. In this paper, we introduce the application of data augmentation in the field of image classification using Random Multi-model Deep Learning (RMDL) which uses the association approaches of multiDL to yield random models for classification. We present a methodology for using Generative Adversarial Networks (GANs) to generate images for data augmenting. Through experiments, we discover that samples generated by GANs when fed into RMDL improve both accuracy and model efficiency. Experimenting across both MNIST and CIAFAR-10 datasets show that, error rate with proposed approach has been decreased with different random models.  相似文献   

10.
Prediction of machine failure is challenging as the dataset is often imbalanced with a low failure rate. The common approach to handle classification involving imbalanced data is to balance the data using a sampling approach such as random undersampling, random oversampling, or Synthetic Minority Oversampling Technique (SMOTE) algorithms. This paper compared the classification performance of three popular classifiers (Logistic Regression, Gaussian Naïve Bayes, and Support Vector Machine) in predicting machine failure in the Oil and Gas industry. The original machine failure dataset consists of 20,473 hourly data and is imbalanced with 19945 (97%) ‘non-failure’ and 528 (3%) ‘failure data’. The three independent variables to predict machine failure were pressure indicator, flow indicator, and level indicator. The accuracy of the classifiers is very high and close to 100%, but the sensitivity of all classifiers using the original dataset was close to zero. The performance of the three classifiers was then evaluated for data with different imbalance rates (10% to 50%) generated from the original data using SMOTE, SMOTE-Support Vector Machine (SMOTE-SVM) and SMOTE-Edited Nearest Neighbour (SMOTE-ENN). The classifiers were evaluated based on improvement in sensitivity and F-measure. Results showed that the sensitivity of all classifiers increases as the imbalance rate increases. SVM with radial basis function (RBF) kernel has the highest sensitivity when data is balanced (50:50) using SMOTE (Sensitivitytest = 0.5686, Ftest = 0.6927) compared to Naïve Bayes (Sensitivitytest = 0.4033, Ftest = 0.6218) and Logistic Regression (Sensitivitytest = 0.4194, Ftest = 0.621). Overall, the Gaussian Naïve Bayes model consistently improves sensitivity and F-measure as the imbalance ratio increases, but the sensitivity is below 50%. The classifiers performed better when data was balanced using SMOTE-SVM compared to SMOTE and SMOTE-ENN.  相似文献   

11.
Flexible piezoelectric acoustic sensors have been developed to generate multiple sound signals with high sensitivity, shifting the paradigm of future voice technologies. Speech recognition based on advanced acoustic sensors and optimized machine learning software will play an innovative interface for artificial intelligence (AI) services. Collaboration and novel approaches between both smart sensors and speech algorithms should be attempted to realize a hyperconnected society, which can offer personalized services such as biometric authentication, AI secretaries, and home appliances. Here, representative developments in speech recognition are reviewed in terms of flexible piezoelectric materials, self-powered sensors, machine learning algorithms, and speaker recognition.  相似文献   

12.
提出了基于总会聚点的有序二叉决策图(BDD)学习方法,用以提高使用布尔可满足性方法的组合电路测试模式生成系统的性能.它有效地结合了基于布尔可满足性、BDD和结构等方法各自的优点,可以有效地解决局部信号赋值之间的关联性.其具体的做法是:首先分析电路的拓扑结构,收集局部信号的关联信息,构造局部电路结构的BDD图;之后从该图中学习出引起冲突赋值的合取范式子句,将这些学习的子句增补到子句库中,用以限制和修剪测试生成的搜索空间,从而加速测试模式的生成过程.实验结果说明了该方法的有效性.  相似文献   

13.
Clinical image processing plays a significant role in healthcare systems and is currently a widely used methodology. In carcinogenic diseases, time is crucial; thus, an image’s accurate analysis can help treat disease at an early stage. Ductal carcinoma in situ (DCIS) and lobular carcinoma in situ (LCIS) are common types of malignancies that affect both women and men. The number of cases of DCIS and LCIS has increased every year since 2002, while it still takes a considerable amount of time to recommend a controlling technique. Image processing is a powerful technique to analyze preprocessed images to retrieve useful information by using some remarkable processing operations. In this paper, we used a dataset from the Mammographic Image Analysis Society and MATLAB 2019b software from MathWorks to simulate and extract our results. In this proposed study, mammograms are primarily used to diagnose, more precisely, the breast’s tumor component. The detection of DCIS and LCIS on breast mammograms is done by preprocessing the images using contrast-limited adaptive histogram equalization. The resulting images’ tumor portions are then isolated by a segmentation process, such as threshold detection. Furthermore, morphological operations, such as erosion and dilation, are applied to the images, then a gray-level co-occurrence matrix texture features, Harlick texture features, and shape features are extracted from the regions of interest. For classification purposes, a support vector machine (SVM) classifier is used to categorize normal and abnormal patterns. Finally, the adaptive neuro-fuzzy inference system is deployed for the amputation of fuzziness due to overlapping features of patterns within the images, and the exact categorization of prior patterns is gained through the SVM. Early detection of DCIS and LCIS can save lives and help physicians and surgeons todiagnose and treat these diseases. Substantial results are obtained through cubic support vector machine (CSVM), respectively, showing 98.95% and 98.01% accuracies for normal and abnormal mammograms. Through ANFIS, promising results of mean square error (MSE) 0.01866, 0.18397, and 0.19640 for DCIS and LCIS differentiation during the training, testing, and checking phases.  相似文献   

14.
Emotion detection from the text is a challenging problem in the text analytics. The opinion mining experts are focusing on the development of emotion detection applications as they have received considerable attention of online community including users and business organization for collecting and interpreting public emotions. However, most of the existing works on emotion detection used less efficient machine learning classifiers with limited datasets, resulting in performance degradation. To overcome this issue, this work aims at the evaluation of the performance of different machine learning classifiers on a benchmark emotion dataset. The experimental results show the performance of different machine learning classifiers in terms of different evaluation metrics like precision, recall ad f-measure. Finally, a classifier with the best performance is recommended for the emotion classification.  相似文献   

15.
Social networking services (SNSs) provide massive data that can be a very influential source of information during pandemic outbreaks. This study shows that social media analysis can be used as a crisis detector (e.g., understanding the sentiment of social media users regarding various pandemic outbreaks). The novel Coronavirus Disease-19 (COVID-19), commonly known as coronavirus, has affected everyone worldwide in 2020. Streaming Twitter data have revealed the status of the COVID-19 outbreak in the most affected regions. This study focuses on identifying COVID-19 patients using tweets without requiring medical records to find the COVID-19 pandemic in Twitter messages (tweets). For this purpose, we propose herein an intelligent model using traditional machine learning-based approaches, such as support vector machine (SVM), logistic regression (LR), naïve Bayes (NB), random forest (RF), and decision tree (DT) with the help of the term frequency inverse document frequency (TF-IDF) to detect the COVID-19 pandemic in Twitter messages. The proposed intelligent traditional machine learning-based model classifies Twitter messages into four categories, namely, confirmed deaths, recovered, and suspected. For the experimental analysis, the tweet data on the COVID-19 pandemic are analyzed to evaluate the results of traditional machine learning approaches. A benchmark dataset for COVID-19 on Twitter messages is developed and can be used for future research studies. The experiments show that the results of the proposed approach are promising in detecting the COVID-19 pandemic in Twitter messages with overall accuracy, precision, recall, and F1 score between 70% and 80% and the confusion matrix for machine learning approaches (i.e., SVM, NB, LR, RF, and DT) with the TF-IDF feature extraction technique.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号