WORLDCOMP'10 Tutorial: Prof. Vladimir Cherkassky
![]() |
Advanced Methodologies for Learning with Sparse Data
Prof. Vladimir Cherkassky Fellow of IEEE; ECE Department, University of Minnesota, Minneapolis, MN, USA; Former Director, NATO Advanced Study Institute (ASI); www.ece.umn.edu/users/cherkass/predictive_learning Date: July 12, 2010 Time: 6:00-9:30 PM Location: Ballroom 5 |
DESCRIPTION
The field of Predictive Learning is concerned with estimating ‘good’ predictive models from available data. Such problems can be usually stated in the framework of inductive learning, where the goal is to come up with a good predictive model from known observations (or training data samples). In recent years, there has been a growing interest in applying learning methods to sparse high-dimensional data (i.e., in genomics, medical imaging, object recognition, etc.). In such applications, many successful approaches represent minor modifications of existing inductive learning methods (such as neural networks, support vector machines, discriminant analysis etc.) combined with clever preprocessing and feature extraction. At the same time, in the statistical learning community, there is a trend towards development and better understanding of new non-standard and non-inductive learning settings. Examples include (a) several powerful learning formulations developed in VC-theory: transduction, learning through contradictions, SVM+ (Vapnik, 1998, 2006); and (b) non-standard settings proposed in machine learning community, such as Multi-Task Learning (Ben-David et al, 2002), Semi-Supervised Learning (Chapelle et al, 2006) etc.. These new learning formulations are motivated by practical needs (to improve generalization for learning with sparse high-dimensional data). This tutorial will present an overview of recent novel learning formulations, investigate possible connections between these formulations, and discuss application examples illustrating advantages of using these approaches for sparse high-dimensional data. The presentation will be based, to a large extent, on the conceptual framework developed by Vapnik [1998, 2006].
OBJECTIVES
-
This tutorial will cover three major parts. The first part will present VC-theoretical framework for predictive learning and discuss standard inductive learning setting, in order to motivate alternative approaches. Second part presents several non-standard learning formulations such as transduction, learning through contradictions, learning with hidden information and multi-task learning. In the third part, we discuss practical issues and difficulties arising in application of these advanced learning techniques. Throughout this tutorial, many important points will be illustrated by empirical comparisons and related to practical applications (mainly, biomedical applications).
INTENDED AUDIENCE
Researchers and practitioners interested in understanding advanced learning methodologies, and their applications. This tutorial is also helpful for developing improved understanding of the methodological issues for learning with high-dimensional data.
Vladimir Cherkassky is Professor of Electrical and Computer Engineering at the University of Minnesota. He received Ph.D. in Electrical Engineering from University of Texas at Austin in 1985. His current research is on methods for predictive learning from data, and he has co-authored a monograph Learning From Data published by Wiley in 1998. Prof. Cherkassky has served on the Governing Board of INNS. He has served on editorial boards of IEEE Transactions on Neural Networks, the Neural Networks Journal, the Natural Computing Journal and the Neural Processing Letters. He served on the program committee of major international conferences on Artificial Neural Networks. He was Director of NATO Advanced Study Institute (ASI) From Statistics to Neural Networks: Theory and Pattern Recognition Applications held in France, in 1993. He presented numerous tutorials on neural network and statistical methods for learning from data. In 2007, he became Fellow of IEEE, for ‘contributions and leadership in statistical learning and neural network research’.
Professor Vladimir Cherkassky
Dept. ECE, University of Minnesota
Minneapolis, MN 55455
tel: 612 625-9597 fax: 612 625-4583
email: cherkass@ece.umn.edu
http://www.ece.umn.edu/users/cherkass/predictive_learning/





