Scalable Association-based Text Classification, Dimitris Meretakis (Hong Kong University of Science & Technology), Dimitris Fragoudis (University of Patras, Greece), Hongjun Lu (Hong Kong University of Science and Technology), Spiros Likothanassis (University of Patras, Greece) 

Fast Supervised Dimensionality Reduction Algorithm with Applications to Document Categorization & Retrieval, Eui-Hong Han, George Karypis (University of Minnesota) 

Clustering Through Decision Tree Construction, Bing Liu, Yiyuan Xia (National University of Singapore), Philip S. Yu (IBM T. J. Watson Research Center) 

A Semi-Supervised Document Clustering Technique for Information Organization, Han-joon Kim, Sang-goo Lee (Seoul National University) 


Dynamic Generation of Data Broadcasting Programs for a Broadcast Disk Array, Ming-Syan Chen, Wen-Chih Peng (National Taiwan University) 

SAIU: An Efficient Cache Replacement Policy for Wireless On-demand Broadcasts, Jianliang Xu (Hong Kong University of Science and Technology), Qinglong Hu (Aleph Computer Systems, Inc.), Dik-Lun Lee (Hong Kong University of Science and Technology), Wang-Chien Lee (GTE Labs) 

A Framework for Designing Update Objects to Improve Server Scalability in Intermittently Synchronized Databases, Wai Gen Yee (Georgia Institute of Technology), Michael J. Donahoo (Baylor University), Shamkant B. Navathe (Georgia Institute of Technology) 

A Framework for Modeling Buffer Replacement Strategies, Stephane Bressan, Chong Leng Goh, Beng Chin Ooi, Kian-Lee Tan (National University Of Singapore) 

1:30 - 2:45  MACHINE LEARNING 

Boosting for Document Routing, Raj D. Iyer, David D. Lewis, Robert E. Schapire (AT&T Labs), Yoram Singer (Hebrew University), Amit Singhal (AT&T Labs) 

An Improved Boosting Algorithm and its Application to Text Categorization, Fabrizio Sebastiani (Consiglio Nazionale delle Ricerche), Alessandro Sperduti, Nicola Valdambrini (Università di Pisa) 

Analyzing the Effectiveness and Applicability of Co-training, Kamal Nigam, Rayid Ghani (Carnegie Mellon University) 


Estimating Nested Selectivity in Object-Oriented Databases, Wan-Sup Cho (Chungbuk National University), Wook-Shin Han, Kyu-Young Whang (KAIST), Ki-Hyung Hong (Sungshin Women's University) 

Maintaining Views in Object-relational Databases', Jixue Liu, Millist Vincent (University of South Australia), Mukesh Mohania Western Michigan University 

Indexing Inheritance and Aggregation, Unmi Tina Kang, Karen C. Davis, Shobha Ravishankar (University of Cincinnati) 

Relevance and Reinforcement in Interactive Browsing, Anton Leuski (University of Massachusetts) 

Models for Reader Interaction with Texts, Daniel Berleant (Iowa State University) 

Elicitations Queries to the Excite Web Search Engine, Amanda Spink, Stephanie Milchak, Michelle Sollenberger (The Pennsylvania State University) 


Object and Query Transformation: Supporting Multi-Dimensional Queries through Code Reuse, Ratko Orlandic (Illinois Institute of Technology), Byunggu Yu (University of Wyoming) 

Structural Join Index Driven Complex Object Retrieval: Mechanisms and Selection, Qing Li (City University of Hong Kong), Chi-wai Fung, Kamalakar Karlapalem (Hong Kong University of Science and Technology) 

Sampling from Databases Using B+-Trees, Dimuthu Prasanna Makawita (Ngee Ann Polytechnic), Kian-Lee Tan (National University of Singapore), Huan Liu (Arizona State University) 



Creating and Evaluating Multi-Document Sentence Extract Summaries, Jade Goldstein, Vibhu Mittal, Jaime Carbonell, Jamie Callan (Carnegie Mellon University) 

Automatically Summarising Web Sites - Is There A Way Around It?, Einat Amitay (Macquarie & CSIRO), Cécile Paris (CSIRO) 

Retrieving Descriptive Phrases from Large Amounts of Free Text, Hideo Joho, Mark Sanderson (University of Sheffield) 

Learning a Monolingual Language Model from a Multilingual Text Database, Rayid Ghani, Rosie Jones (Carnegie Mellon University) 


Space Efficient Bitmap Indexing, Nick Koudas 

Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets, Hakan Ferhatosmanoglu, Ertem Tuncel, Divyakant Agrawal, Amr El Abbadi (University of California, Santa Barbara) 

The Subspace Coding Method: A New Indexing Scheme for High-Dimensional Data, Yasushi Sakurai (NTT Cyber Solutions Laboratories), Masatoshi Yoshikawa, Shunsuke Uemura (Nara Institute of Science and Technology), Haruhiko Kojima (NTT Cyber Solutions Laboratories) 

Dimensionality Reduction and Similarity Computation by Inner Product Approximations, Omer Egecioglu, Hakan Ferhatosmanoglu (University of California Santa Barbara) 

1:30 - 2:45  WEB 

Personal Ontologies for Web Navigation, Jason Chaffee, Susan Gauch 

Persistence of information on the web: Analyzing citations contained in research articles, Steve Lawrence, Frans Coetzee, Gary Flake, David Pennock, Bob Krovetz, Finn Nielsen, Andries Kruger, Lee Giles (NEC Research Institute) 

Semantic Search on Internet Tabular Information Extraction for Answering Queries, Huei-Long Wang (National Tsing Hua University), Shih-Hung Wu, I. C. Wang (Academia Sinica), Cheng-Lung Sung (National Tsing Hua University), W. L. Hsu (Academia Sinica), W. K. Shih (National Tsing Hua University) 


Learning to Extract Hierarchical Information from Semi-structured Documents, Wai Lam, Wai-Yip Lin (The Chinese University of Hong Kong) 

A Visual Tool for Structuring and Modeling Organizational Memories , Tang-Ho Le (University of Moncton), Luc Lamontagne (Defence Research Establishment Valcartier), Tho-Hau Nguyen (University of Quebec in Montreal) 

On Equivalence of Queries in Uncertain Databases, Fereidoon Sadri, Michael F. Bianco (University of North Carolina at Greensboro) 

3:00 - 4:15  WEB / DISTRIBUTED IR 

DEADLINER: Building a New Niche Search Engine, Frans Coetzee, Andries Kruger, C. Lee Giles, Eric Glover, Gary Flake, Steve Lawrence (NEC Research Institute) 

Collection Selection and Results Merging with Topically Organized U.S. Patents and TREC Data, Leah Larkey, Margaret Connell (University of Massachusetts), Jamie Callan (Carnegie Mellon University) 

Discovery of Similarity Computations of Search Engines, King-Lup Liu (DePaul University), Weiyi Meng (SUNY at Binghamton), Clement Yu (University of Illinois at Chicago), Naphtali Rishe (Florida International University) 


High Performance Clustering Based on the Similarity Join, Markus Breunig, Christian Boehm, Bernhard Braunmueller, Hans-Peter Kriegel (University of Munich) 

Using Star Clusters for Filtering, Javed Aslam, Ekaterina Pelekhov, Daniela Rus (Dartmouth) 

Supporting Subseries Nearest Neighbor Search via Approximation, Changzhou Wang, Xiaoyang Sean Wang (George Mason University) 

n23Tool: a Tool for Exploring Large Relational Data Sets Through 3D Dynamic Projections, Li Yang (Western Michigan University) 

An Access Control Model for Video Database Systems, Elisa Bertino (Universita degli Studi di Milano), Moustafa A. Hammad, Walid G. Aref, Ahmed K. Elmagarmid (Purdue University) 

Visual Query and Analysis Tool of the Object-Relational GIS Framework, Zoran Stojanovic (Delft University of Technology), Slobodanka Djordjevic-Kajan, Dragan Stojanovic (Faculty of Electronic Engineering, Yugoslavia) 


A Meta Model and an Infrastructure for the Non-Transparent Replication of Object Databases, Werner Dreyer (Microsoft Corp.), Klaus R. Dittrich (University of Zurich, Switzerland) 

Polar: An Architecture for a Parallel ODMG Compliant Object Database, Jim Smith (University of Newcastle upon Tyne), Sandra de F. Mendes Sampaio (University of Manchester), Paul Watson (University of Newcastle upon Tyne), Norman Paton (University of Manchester) 

Data Replication for External Searching in Static Tree Structures, Susanne Hambrusch, Chuan-Ming Liu (Purdue University) 


Digital Libraries: Extending and Applying Library and Information Science and Technology
10:15 - 11:55  IR MODELS 

Query Optimization Using An Improved Genetic Algorithm, Mohand Boughanem, Linda Tamine (IRIT-SIG) 

First Story Detection In TDT Is Hard, James Allan, Victor Lavrenko (University of Massachusetts), Hubert Jin ( 

A Distributed Multi-Agent System for Collaborative Information Management and Sharing, James Chen, Shawn R. Wolfe, Stephen D. Wragg (NASA Ames Research Center) 
Language Models for Financial News Recommendation, Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan (University of Massachusetts) 


On Efficient Storage Space Distribution Among MaterializedViews and Indices in Data Warehousing Environments, Ladjel Bellatreche, Kamalakar Karlapalem (Hong Kong University of Science and Technology), Michel Schneider (Universite Blaise Pascal) 

Extending OLAP Querying to External Object Databases, Torben Bach Pedersen (Aalborg University), Arie Shoshani, Junmin Gu (Lawrence Berkeley National Laboratory), Christian S. Jensen (Aalborg University) 

Using Wavelet Decomposition to Support Progressive and Approximate Range-Sum Queries over Data Cubes, Yi-Leh Wu, Divyakant Agrawal, Amr El Abbadi (University of California, Santa Barbara) 

Sequence Mining in Categorical Domains: Incorporating Constraints, Mohammed Zaki (RPI) 

Retrieval from Captioned Image Databases Using Natural Language Processing, David Elworthy (Microsoft Research Ltd) 

The Webspace Method: On the Integration of Database Technology with Multimedia Retrieval, Roelof van Zwol, P.M.G. Apers (University of Twente) 

Extensible Perfect Hashing, Takao Miura (Hosei University), Isamu Shioya (Sanno College), Wataru Matsumoto (Hosei University), Yukio Wada (Hitachi Software Engineering, Co.Ltd.) 


A Query based Approach for Integrating Heterogeneous Data Sources, Ruxandra Domenig, Klaus R. Dittrich (University of Zurich) 

Theoretical Foundations of Schema Restructuring in Heterogeneous Multidatabase Systems, Joseph Albert (Portland State University) 

A Market-based Resource Management and QoS Support Framework for Distributed Multimedia Systems, Wonjun Lee (University of Missouri - Kansas City), Jaideep Srivastava (University of Minnesota) 

Index Interpolation: An Approach to Subsequence Matching Supporting Normalization Transform in Time-Series Databases, Woong-Kee Loh (KAIST), Sang-Wook Kim (Kangwon National University), Kyu-Young Whang (KAIST) 

A Comparison of DFT and DWT based Similarity Search in Time-Series Databases, Yi-Leh Wu, Divyakant Agrawal, Amr El Abbadi (University of California, Santa Barbara) 

A Comparative Study of Log-Only and In-Place Update Based Temporal Object Database Systems, Kjetil Nørvåg (Norwegian University of Science and Technology) 


Rule-Assisted Prefetching in Web-Server Caching, Bin Lan, Stephane Bressan, Beng Chin Ooi, Kian-Lee Tan (National University of Singapore) 

WebCQ: Detecting and Delivering Information Changes on the Web, Ling Liu, Calton Pu, Wei Tang (Georgia Tech) 

Collaborative Proxy System for Distributed Web Content Transcoding, Philip S. Yu (IBM T.J. Watson Research Center), Valeria Cardellini (University of Rome), Yun-Wu Huang (IBM T.J Watson Research Center)