Hybrid Bayesian network classifiers: Application to species distribution models |
| |
Authors: | P.A. Aguilera A. Fernández F. Reche R. Rumí |
| |
Affiliation: | 1. U.S. Geological Survey, Fort Collins Science Center, 2150 Center Ave Bldg. C, Fort Collins, CO 80526, USA;2. Natural Resource Ecology Laboratory, Colorado State University, Fort Collins, CO 80523-1499, USA;3. Department of Interior, North Central Climate Science Center, Colorado State University, Fort Collins, CO 80523, USA;1. University of Trento, Center Agriculture Food Environment (C3A), Via E. Mach 1, S. Michele all’ Adige (TN) 38010, Italy;2. University of Trento, Department of Cellular Computational and Integrative Biology (CIBIO), Via Sommarive, 14, Povo 38123 (TN), Italy;3. Fondazione Edmund Mach, Research and Innovation Centre, Department of Biodiversity and Molecular Ecology, Via E. Mach 1, S. Michele all’ Adige 38010 (TN), Italy;4. Department of Applied Geoinformatics and Spatial Planning, Faculty of Environmental Sciences, Czech University of Life Sciences Prague, Kamýcka 129, Praha – Suchdol, 16500, Czech Republic;5. Department of Pathology, Microbiology, and Immunology, School of Veterinary Medicine, University of California, Davis, USA;6. Ecological Modelling Laboratory, Department of Physical & Environmental Sciences, University of Toronto, Toronto, ON M1C 1A4, Canada;7. BIGEA, Department of Biological, Geological and Environmental Sciences, Alma Mater Studiorum, University of Bologna, Via Irnerio 42, Bologna, 40126, Italy;8. Department of Biological Sciences, Murray State University, Murray, KY 42071, USA |
| |
Abstract: | Bayesian networks are one of the most powerful tools in the design of expert systems located in an uncertainty framework. However, normally their application is determined by the discretization of the continuous variables. In this paper the naïve Bayes (NB) and tree augmented naïve Bayes (TAN) models are developed. They are based on Mixtures of Truncated Exponentials (MTE) designed to deal with discrete and continuous variables in the same network simultaneously without any restriction. The aim is to characterize the habitat of the spur-thighed tortoise (Testudo graeca graeca), using several continuous environmental variables, and one discrete (binary) variable representing the presence or absence of the tortoise. These models are compared with the full discrete models and the results show a better classification rate for the continuous one. Therefore, the application of continuous models instead of discrete ones avoids loss of statistical information due to the discretization. Moreover, the results of the TAN continuous model show a more spatially accurate distribution of the tortoise. The species is located in the Doñana Natural Park, and in semiarid habitats. The proposed continuous models based on MTEs are valid for the study of species predictive distribution modelling. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|