首页 | 本学科首页   官方微博 | 高级检索  
     


Environmental Sound Classification Using Deep Learning
Authors:SHANTHAKUMAR S  SHAKILA S  SUNETH Pathiran  JAYALATH Ekanayake
Affiliation:Dept. of Computer Science and Informatics, Faculty of Applied Sciences, Uva Wellassa University, Passara Rd. Badulla, 90000, Sri Lanka
Abstract:Perhaps hearing impairment individuals cannot identify the environmental sounds due to noise around them. However, very little research has been conducted in this domain. Hence, the aim of this study is to categorize sounds generated in the environment so that the impairment individuals can distinguish the sound categories. To that end first we define nine sound classes--air conditioner, car horn, children playing, dog bark, drilling, engine idling, jackhammer, siren, and street music-- typically exist in the environment. Then we record 100 sound samples from each category and extract features of each sound category using Mel-Frequency Cepstral Coefficients (MFCC). The training dataset is developed using this set of features together with the class variable; sound category. Sound classification is a complex task and hence, we use two Deep Learning techniques; Multi Layer Perceptron (MLP) and Convolution Neural Network (CNN) to train classification models. The models are tested using a separate test set and the performances of the models are evaluated using precision, recall and F1-score. The results show that the CNN model outperforms the MLP. However, the MLP also provided a decent accuracy in classifying unknown environmental sounds.
Keywords:Mel-Frequency Cepstral Coefficients  MFCC  Multi-Layer Perceptron  MLP  Convolutional Neural Network  CNN
点击此处可从《国外电子测量技术》浏览原始摘要信息
点击此处可从《国外电子测量技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号