Personal tools
You are here: Home Keynotes Keynote - Prof. Ophir Frieder
Current Events
WORLDCOMP'10
Click Here

Other Events
WORLDCOMP'09
Click Here

WORLDCOMP'07
Click Here

WORLDCOMP'06
Click Here

Join Our Mailing List
Sign up to receive email announcements and updates about conferences and future events




 
Document Actions

Keynote - Prof. Ophir Frieder

Last modified 2008-06-21 09:25

Searching in the "Real World"
Professor Ophir Frieder
Georgetown University, Washington, DC, USA
The Royden B. Davis, SJ, Chair in Interdisciplinary Studies at Georgetown University
IITRI Chair Professor of Computer Science at Illinois Institute of Technology
Director, Information Retrieval lab (IIT)
Fellow of IEEE, ACM, and AAAS

Date: July 14, 2008
Time: 9:40 AM
Location: Lance Burton Theater


Abstract

    For many, "searching" is considered a mostly solved problem. In fact, for text processing, this belief is factually based. The problem is that most "real world" search applications involve "complex documents", and such applications are far from solved. Complex documents, or less formally, "real world documents", comprise of a mixture of images, text, signatures, tables, etc, and are often available only in scanned hardcopy formats. Search systems for such document collections are currently unavailable.

    We describe our efforts at building a complex document information processing prototype. This prototype integrates "point solution" (mature) technologies, such as OCR capability, signature matching and handwritten word spotting techniques, search and mining approaches, among others, to yield a system capable of searching "real world documents". The described prototype demonstrates the adage that "the whole is greater than the sum of its parts". Our complex document benchmark development efforts are likewise presented.

    Having described the global approach, we describe some potential future point solutions which we have developed over the years. These include an Arabic stemmer and a natural language source integration fabric called the Intranet Mediator. In terms of stemming, we developed and commercially licensed an Arabic stemmer and search system. Our approach was evaluated using the benchmark Arabic collections and favorably compared against the state of the art.

    We also focused on source integration and ease of user interaction. By integrating structured and unstructured sources, we developed and commercially licensed our mediator technology that provides a single, natural language interface to querying distributed sources. Rather than providing a set of links as possible answers, the described approach actually answers the posed question. Both the Arabic stemmer and the mediator efforts are likewise discussed.

Biography

    Ophir Frieder is the Royden B. Davis, S.J., Chair in Interdisciplinary Studies at Georgetown University. Since 1998, he has been the IITRI Chair Professor of Computer Science and the Director of the Information Retrieval Laboratory at the Illinois Institute of Technology, from which he is currently on leave. He frequently consults for industry and government and for key intellectual property litigation. His research interests focus on scalable information retrieval systems spanning search and retrieval and communications issues. His systems are deployed in commercial and governmental production environments worldwide. Recently, Springer Science and Business Media designated his co-authored book entitled Information Retrieval: Algorithms and Heuristics with the “Top Selling Title” award. He is the recipient of the 2007 ASIS&T Research in Information Science Award and is a Fellow of the AAAS, ACM, and IEEE.

Academic Co-Sponsors

Computational Biology and Functional Genomics Laboratory, Harvard University, Cambridge, Massachusetts, USA


International Society of Intelligent Biological Medicine

Horvath Laboratory, University of California, Los Angeles (UCLA), USA
Minnesota Supercomputing Institute, University of Minnesota, USA
Functional Genomics Laboratory, University of Illinois at Urbana-Champaign, USA
BioMedical Informatics & Bio-Imaging Laboratory, Georgia Institute of Technology and Emory University, Atlanta, Georgia, USA
Intelligent Data Exploration and Analysis Laboratory, University of Texas at Austin, Austin, Texas, USA
Biomedical Cybernetics Laboratory, HST of Harvard University and MIT, USA
Center for the Bioinformatics and Computational Genomics, Georgia Institute of Technology, Atlanta, Georgia, USA
Harvard Statistical Genomics and Computational Laboratory, Harvard University, Cambridge, Massachusetts, USA
Bioinformatics & Computational Biology Program, George Mason University, Virginia, USA
Hawkeye Radiology Informatics, Department of Radiology, College of Medicine, University of Iowa, Iowa, USA
Medical Image HPC & Informatics Lab (MiHi Lab), University of Iowa, Iowa, USA
The University of North Dakota, Grand Forks, North Dakota, USA
PSU - Prince Sultan University, Saudi Arabia
Institute for Informatics Problems of the Russian Academy of Sciences, Moscow, Russia.
NEMO/European Union at Institute of Discrete Mathematics and Geometry, TU Vienna

Corporate Sponsors






Other Co-Sponsors

High Performance Computing for Nanotechnology (HPCNano)

International Technology Institute (ITI)


GRIDtoday


HPCwire

Hodges' Health



 


Administered by UCMSS
Universal Conference Management Systems & Support
San Diego, California, USA
Contact: Kaveh Arbtan

If you can read this text, it means you are not experiencing the Plone design at its best. Plone makes heavy use of CSS, which means it is accessible to any internet browser, but the design needs a standards-compliant browser to look like we intended it. Just so you know ;)