Department of Computer Science at UH

University of Houston

Department of Computer Science

In Partial Fulfillment of the Requirements for the Degree of
Master of Science

Mark Paterson

Will defend his thesis

Multivariate Mixture Models for
High-Energy Physics Classification

Abstract

Problems in High-Energy Particle Physics commonly involve sifting large amounts of data in search of a specific pattern of behavior. Due to size and accuracy requirements, this process is intractable if handled manually, but it is an excellent candidate for binary classification techniques using machine learning. We consider the nature and challenge of applying such techniques and implement three specific algorithms within the Root framework developed at the European Organization for Nuclear Research. Our primary approach is Bayesian, using mixtures of Cauchy and Gaussian distributions to model the complex feature space, and Expectation Maximization to derive the model parameters. Since performance is a concern with large datasets, we implement an enhanced version of Gaussian mixture models with significantly better convergence speed, and improve this approach further to preserve accuracy.

Date: Tuesday, November 17, 2009
Time: 3:00 PM
Place: 218-PGH
Faculty, students, and the general public are invited.
Advisor: Prof. Ricardo Vilalta