| 
 Summary 
  
The information age 
 has made it easy to store large amounts of data. The 
 proliferation of documents available on the Web, on 
 corporate intranets, on news wires, and elsewhere is 
 overwhelming. However, while the amount of data available 
 to us is constantly increasing, our ability to absorb and 
 process this information remains constant. Search engines 
 only exacerbate the problem by making more and more 
 documents available in a matter of a few key strokes. 
 Link Analysis is a new and exciting research area that 
 tries to solve the information overload problem by using 
 techniques from data mining, machine learning, 
 Information Extraction, Text Categorization, 
 Visualization and Knowledge Management. Link Analysis is 
 the process of building up networks of interconnected 
 objects through various relationships in order to 
 discover patterns and trends. The main tasks of link 
 analysis are to extract, discover, and link together 
 sparse evidence from vast amounts of data sources, to 
 represent and evaluate the significance of the related 
 evidence, and to learn patterns to guide the extraction, 
 discovery, and linkage of entities. The relationships 
 could be transactional, geographical, social, or 
 temporal. Link Analysis involves the preprocessing of 
 document collections (text categorization, term 
 extraction, and information extraction), integration with 
 structured information sources, the storage of the 
 intermediate representations, the techniques to analyze 
 these intermediate representations (distribution 
 analysis, clustering, trend analysis, association rules, 
 etc.) and visualization of the results. In this tutorial 
 we will present the general theory of Link Analysis and 
 will demonstrate several systems that use these 
 principles to enable interactive exploration of a 
 combination of structured and unstructured collections. 
 We will present a general architecture of link analysis 
 systems and will outline the algorithms and data 
 structures behind the systems. The Tutorial will cover 
 the state of the art in this rapidly growing area of 
 research. Several real world applications of link 
 analysis will be presented.
  
 | 
 
 Target Audience 
The tutorial should be of interest to practitioners from Data Mining,
Bio Information, NLP, IR, Knowledge Management and the general AI
audience interested in this fast-growing research area.   | 
 
 Instructor's Short Biography 
 
Ronen Feldman is a 
 senior lecturer at the Mathematics and Computer Science 
 Department of Bar-Ilan University in Israel, and the 
 Director of the Data Mining Laboratory. He received his 
 B.Sc. in Math, Physics and Computer Science from the 
 Hebrew University, M.Sc. in Computer Science from Bar-Ilan 
 University, and his Ph.D. in Computer Science from 
 Cornell University in NY. He is the founder and president 
 of ClearForest Corporation, a NY based company 
 specializing in development of text mining tools and 
 applications. He is also an Adjunct Professor at NYU 
 Stern Business School. 
 |