Improved algorithms for learning bayesian network classifiers

This thesis applies objective Bayesian probability theory techniques to improve Bayesian network classifiers. The main contributions are: {u2500} A parallelizable distance based discretization method that allows to extend discrete classifiers to non-discrete domains. {u2500} The concepts of qualitat...

Descripción completa

Detalles Bibliográficos
Autor principal: Cerquides Bueno, Jesús (-)
Formato: Tesis
Idioma:Inglés
Publicado: Bellaterra : Consejo Superior de Investigaciones Científicas 2005.
Colección:Monografies de l'Institut d'Investigació en Intel-ligencia Artificial.
Materias:
Acceso en línea:Conectar con la versión electrónica
Ver en Universidad de Navarra:https://innopac.unav.es/record=b33072747*spi
Descripción
Sumario:This thesis applies objective Bayesian probability theory techniques to improve Bayesian network classifiers. The main contributions are: {u2500} A parallelizable distance based discretization method that allows to extend discrete classifiers to non-discrete domains. {u2500} The concepts of qualitative influences and synergies, which allow to improve understandability of Bayesian network classifiers. {u2500} The first order and second order qualitative Bayesian classifiers, which are classifiers based on qualitative influences and synergies, with easily understandable results and with a reasonable accuracy. {u2500} INDIFFERENTNB, an improved version of the naive Bayes classifier based on the naive Bayes model, naive distributions, Bayesian model averaging and the principle of indifference that improves the quality of the probabilities of the maximum likelihood naive Bayes classifier. {u2500} STAN+BMA, a classification algorithm based on applying empirical local Bayesian model averaging to the stan (softened TAN) classifier and which improves its accuracy. {u2500} TBMATAN, a classification algorithm based on the computability of the averaging of TAN models under decomposable distributions over TANs and the principle of indifference that improves stan accuracy. {u2500} SSTBMATAN, an efficient approximation to TBMATAN which provides also improved accuracy. {u2500} MAPTAN, a classification algorihtm that computes the maximum a posteriori TAN model and improves STAN accuracy. {u2500} MAPTAN+BMA, a classification algorithm that computes the k maximum a posteriori TAN models and their relative weights efficiently and improves STAN accuracy and STAN+BMA accuracy and learning time. These results show that the joint application of Bayesian model averaging and a careful selection of the prior probability distribution over the set of models, following objective Bayesian techniques whenever it is possible, can provide significant improvements to classification algorithms.
Descripción Física:XVII, 170 p. : gráf
ISBN:9788400083175