In Partial Fulfillment of the Requirements for the Degree of Master of Science
will defend her thesis
Aerosol Data Modeling & Similarity Assessment - A Probabilistic Approach
Bio-threat detection is a problem of paramount importance to modern society. One of the major requirements of a system that could detect such threats, is that it should have low false alarm rate. An important reason behind false alarms is the inability of the system to take into consideration the ambient aerosol background of the location under consideration. This is because aerosol backgrounds typically differ across locations—one agent may be naturally present in one location but might be unusual for some other location. Although work has been done in the past to characterize aerosol backgrounds of certain locations, there is no general algorithm available that creates aerosol background at a particular location. This work centers on the design and implementation of a Sensor Modeling Toolbox. The toolbox assumes that the sensor data comes from a Gaussian Mixture distribution and, therefore, uses Gaussian Mixture Models (GMM) to model the sensor data. Expectation Maximization algorithm is used to estimate the parameters of the GMM and Bayesian Information Criterion (BIC) is used to select the best model by considering both the log-likelihood and model complexity (Number of parameters to be estimated) of the GMM. The toolbox provides various functionalities, the major ones bein g to create a model for a set of sensor observations, to generate a model for the aerosol background at a particular location and to assess similarity between two sets of sensor readings by introducing two novel distance functions. The functionalities are evaluated on two real-world data sets that were obtained from aerosol sensors deployed on the campus of the University of Houston. Due to the unavailability of labeled data, this work assesses the quality of the obtained results by comparing them with the characteristics of the raw input data and shows that they are in good agreement. It is also observed that it is usually better to remove outliers from datasets before creating a GMM. The experimental results demonstrate that the toolbox is useful to model and analyze aerosol data and that it provides important capabilities for building successful bio-threat detection systems in the future.
Date: Tuesday, November 24, 2015
Time: 1:30 PM
Place: PGH 550
Advisor: Prof. Christoph F. Eick
Faculty, students, and the general public are invited.