Supervised Clustering—Algorithms and Applications
Thursday, March 3, 2022
11:00 am - 12:00 pm
About the Event
This talk introduces a novel data mining technique we term “supervised clustering.” Unlike traditional clustering, supervised clustering assumes that the examples to be clustered are classified, and has the goal of identifying class-uniform clusters that have high probability densities. The differences between supervised and traditional clustering will be discussed and two supervised clustering algorithms CLEVER, which is a prototype-based supervised clustering algorithm, and STAXAC, which is an agglomerative, hierarchical supervised clustering algorithm, will be introduced and evaluated. Finally, applications of supervised clustering are discussed which include distance metric learning, generation of taxonomies in bioinformatics, data set editing, and the discovery of subclasses for a given set of classes.
About the Speaker
Christoph F. Eick received his Ph.D. from the University of Karlsruhe in Germany. He is currently an Associate Professor in the Department of Computer Science at UH and the Director of the UH Data Analysis and Intelligent Systems Lab. His research interests include data mining, machine learning, artificial intelligence, and geographical information systems and his current research centers on spatial data mining, clustering, and association analysis. He published close to 180 papers in these and related areas. He serves on the program committee of top data mining and AI conferences, such as the IEEE International Conference on Data Mining (ICDM).