共查询到20条相似文献,搜索用时 15 毫秒
2.
本文主要研究机器学习方法在新闻文本的情感分类中的应用,判断其是正面还是负面。我们利用朴素贝叶斯和最大熵方法进行新闻及评论语料的情感分类研究。实验表明,机器学习方法在基于情感的文本分类中也能取得不错的分类性能,最高准确率能达到90%。同时我们也发现,对于基于情感的文本分类,选择具有语义倾向的词汇作为特征项、对否定词正确处理和采用二值作为特征项权重能提高分类的准确率。总之,基于情感的文本分类是一个更具挑战性的工作。 相似文献
3.
Many language identification (LID) systems are based on language models using techniques that consider the fluctuation of speech over time. Considering these fluctuations necessitates longer recording intervals to obtain reasonable accuracy. Our research extracts features from short recording intervals to enable successful classification of spoken language. The feature extraction process is based on frames of 20 ms, whereas most previous LIDs presented results based on much longer frames (3?s or longer). We defined and implemented 200 features divided into four feature sets: cepstrum features, RASTA features, spectrum features, and waveform features. We applied eight machine learning (ML) methods on the features that were extracted from a corpus containing speech files in 10 languages from the Oregon Graduate Institute (OGI) telephone speech database and compared their performances using extensive experimental evaluation. The best optimized classification results were achieved by random forest (RF): from 76.29% on 10 languages to 89.18% on 2 languages. These results are better or comparable to the state-of-the-art results for the OGI database. Another set of experiments that was performed was gender classification from 2 to 10 languages. The accuracy and the F measure values for the RF method for all the language experiments were greater than or equal to 90.05%. 相似文献
4.
Aiming at the characteristics of varied and complex geomorphic types,crisscross network of ravines and broken terrain in high altitude complicated terrain regions,it is very important to study and find the rapid and effective land use/land cover classification method for obtaining and timely updating of land use information.Taking the Huangshui river basin located in the transitional zone between the Loess Plateau and the Qinghai-Tibet Plateau as acasestudy area,the objective of this study is to explore a kind of effective information extraction method from comparison of four kinds machine learning methods for complicated terrain regions.based on Landsat 8 OLI satellite data,DEM and combined with various thematic features,on the basis of geographical division of the study area,artificial neural network,decision tree,support vector machine and random forest four machine learning methods for land use information extraction were used to obtain land use data,and confusion matrix was constructed to evaluate classification accuracy.The results showed that the classification accuracies of random forest and decision tree are obviously higher than those of support vector machine and artificial neural network.The random forest method has the highest classification accuracy,the overall classification accuracy is 85.65%,the Kappa coefficient is 0.84.based on the above classification,Random forest classification method was chose to further classify Landsat 8 fusion datafrom panchromatic 15 meter and multispectral 30 meter image,the overall classification accuracy is 86.49% and the Kappa coefficient is 0.85.This indicated that the random forest classification method can obtain higher classification efficiency while ensuring the classification accuracy.It is very effective for the extraction of land use information in complicated terrain regions.Data fusion can improve the classification accuracy to a certain extent. 相似文献
5.
Cataracts are the leading cause of visual impairment and blindness globally. Over the years, researchers have achieved significant progress in developing state-of-the-art machine learning techniques for automatic cataract classification and grading, aiming to prevent cataracts early and improve clinicians’ diagnosis efficiency. This survey provides a comprehensive survey of recent advances in machine learning techniques for cataract classification/grading based on ophthalmic images. We summarize existing literature from two research directions: conventional machine learning methods and deep learning methods. This survey also provides insights into existing works of both merits and limitations. In addition, we discuss several challenges of automatic cataract classification/grading based on machine learning techniques and present possible solutions to these challenges for future research. 相似文献
8.
Scheduling with learning effects has gained increasing attention in recent years. A well‐known learning model is called “sum‐of‐processing‐times‐based learning” in which the actual processing time of a job is a nonincreasing function of the jobs already processed. However, the actual processing time of a given job drops to zero precipitously when the normal job processing times are large. Moreover, the concept of learning process is relatively unexplored in a flowshop environment. Motivated by these observations, this article addresses a two‐machine flowshop problem with a truncated learning effect. The objective is to find an optimal schedule to minimize the total completion time. First, a branch‐and‐bound algorithm incorporating with a dominance property and four lower bounds is developed to derive the optimal solution. Then three simulated annealing algorithms are also proposed for near‐optimal solution. The experimental results indicated that the branch‐and‐bound algorithm can solve instances up to 18 jobs, and the proposed simulated annealing algorithm performs well in item of CPU time and error percentage. © 2011 Wiley Periodicals, Inc. 相似文献
9.
Classification learning with association rules has been an active research area during recent years. Thus, it is important to establish some numerical importance measure for association rules. In this paper, we propose a new rule importance measure, called a HD measure, using information theory. A num ber of properties of the new measure are analyzed, and its classification performances are compared with that of other rule measures. 相似文献
10.
Creating dynamic virtual environments consisting of humans interacting with objects is a fundamental problem in computer graphics. While it is well‐accepted that agent interactions play an essential role in synthesizing such scenes, most extant techniques exclusively focus on static scenes, leaving the dynamic component out. In this paper, we present a generative model to synthesize plausible multi‐step dynamic human‐object interactions. Generating multi‐step interactions is challenging since the space of such interactions is exponential in the number of objects, activities, and time steps. We propose to handle this combinatorial complexity by learning a lower dimensional space of plausible human‐object interactions. We use action plots to represent interactions as a sequence of discrete actions along with the participating objects and their states. To build action plots, we present an automatic method that uses state‐of‐the‐art computer vision techniques on RGB videos in order to detect individual objects and their states, extract the involved hands, and recognize the actions performed. The action plots are built from observing videos of everyday activities and are used to train a generative model based on a Recurrent Neural Network (RNN). The network learns the causal dependencies and constraints between individual actions and can be used to generate novel and diverse multi‐step human‐object interactions. Our representation and generative model allows new capabilities in a variety of applications such as interaction prediction, animation synthesis, and motion planning for a real robotic agent. 相似文献
12.
Activity recognition (AR) is a key enabler for a context‐aware smart home since knowing what the residents’ current activities helps a smart home provide more desirable services. This is why AR is often used in assistive technologies for cognitively impaired people to evaluate their abilities to undertake activities of daily living. In a real‐life scenario, multiple‐resident AR has been considered as a very challenging problem, primarily due to the complexity of data association. In addition, most prior research has not considered the potential interpersonal interactions among residents to simplify complexity, especially in an environment monitored by ambient sensors. In this study, we propose two types of multiuser activity models, both of which are derived from an interaction‐feature enhanced multiuser model learning framework. These two models consider interpersonal interactions and data association for multiuser AR using ambient sensors. We then compare their performance with the other two baseline models with or without consideration of data association and interpersonal interactions. The experimental results show that the derived models outperform other baseline classifiers. Therefore, the proposed approach can increase the opportunities for providing context‐aware services for a multiresident smart home. 相似文献
14.
How can the extremely uncertain front end of innovation – managing the fuzzy front end – be taught to graduate students? This paper describes and analyses experiments with experiential, problem‐based learning focused on the front end of innovation. The focus is on the learning and cross‐organizational integration of student teams; factors that have been identified as central to the success of teams involved in the front end of innovation. An experiential course, ‘From an idea to a business plan in product development’, was developed in conjunction with an actual company, and piloted with four student groups in 2007 and 2008. Data on this novel course were collected through participant observation, team self‐assessment and questionnaires. This paper reports favourable results for the effectiveness of the course design; it discusses the impact of team size and cross‐organizational team composition on team performance; and identifies the implications for teaching the front end of innovation. 相似文献
16.
This article contributes to the debate on organizational learning from crisis by shedding light on the phenomenon of crises as learning triggers. To unveil theoretical patterns of how organizational crisis‐induced learning may appear and develop, I suggest a conceptual framework based on concept categories and answers to four fundamental questions: what lessons are learned (single‐ or double‐loop)?; what is the focus of the lessons (prevention or response)?; when are lessons learned (intra‐ or intercrisis)?; is learning blocked from implementation or carried out (distilled or implemented)? The framework's applicability is explored in a study of how a Swedish utility and the city of Stockholm responded to two large‐scale blackouts in Stockholm. The final sections suggest four propositions for further research. 相似文献
19.
In this article, we propose to revisit the China of April 2008 and ponder on one major event that happened that year: the boycott of the Carrefour stores, which resulted in an unprecedented display of demonstrations and protests. After having proposed an original theoretical framework (the information cascade), we move on to carry out a detailed reading of this case in order to learn from it and have a better understanding of current crises. 相似文献
|