Tuesday, November 5, 2002
|
8:30-9:00am |
Opening Remarks
|
9:00-10:00am |
Keynote I: On Scalable Information Retrieval Systems
|
Dr. Ophir Frieder, Illinois Institute of Technology
|
Room: Auditorium
|
|
10:00-10:15am |
BREAK |
10:15-noon |
Research Session 1: Pattern Discovery and Forecasting |
Chair: Arbee L. P. Chen (National Dong Hwa University, Taiwan) |
Room: Einstein Room |
|
F4: Large-Scale Automated Forecasting using Fractals
Deepayan Chakrabarti and Christos Faloutsos (Carnegie Mellon University)
|
An Iterative Strategy for Pattern Discovery in Multi-dimensional Data Sets
Chun Tang and Aidong Zhang (University at Buffalo, The State University of New York)
|
Mining Sequential Pattterns with Constraints in Large Databases
Jian Pei (Simon Fraser University, Canada), Jiawei Han (Univeristy of Illinois
at Urbana-Champaign), and Wei Wang (Fudan University, China)
|
Research Session 2: Web Search I |
Chair: Abdur Chowdhury (Illinois Institute of Technology)
|
Room: Drew Room
|
|
Structuring Keywords for Querying
Web Databases
Rodrigo Vieira, Pavel Calado,
Aligran Silva, Alberto Laender, and Berthier Ribeiro-Neto
(Federal University of Minas Gerais, Brazil)
|
Topic-Oriented Collaborative
Crawling
Chiasen Chung, Charles Clarke
(University of Waterloo, Canada)
|
Meta-recommendation Systems:
User-controlled Integration of Diverse Recommendations
J. Ben Schafer
(University of Northern Iowa), Joseph A. Konstan, and John
Riedl (University of Minnesota)
|
A Likelihood-Based Approach
to Data Selection for Collaborative Filtering
Kai Yu
(Siemens AG and University of Munich, Germany), Xiaowei
Xu (Siemens AG, Germany), Anton Schwaighofer
(Siemens AG and TU Graz, Germany), and Hans-Peter
Kriegel (University of Munich, Germany)
|
Research Session 3: Data Warehousing and OLAP
|
Chair: Il-Yeol Song (Drexel University)
|
Room: Curie Room
|
|
Analysis of Pre-computed Partition
Top Method for Range Top-k Queries in OLAP Data Cubes
Zheng Xuan Loh, Tok Wang Ling,
ChuanHeng Ang, and Sin Yueng Lee(National
University of Singapore)
|
Batch Data Warehouse Maintenance
in Dynamic Environment
Bin Liu, Songting Chen, and Elke
A. Rundensteiner(Worcester Polytechnic Institute)
|
A Fast Filtering Scheme for
Large Database Cleansing
Sam Y. Sung, Zhao Li, and Peng
Sun (National University of Singapore)
|
Semantic-based Delivery of
OLAP Summary Tables in Wireless Environments
Mohamed Sharaf and Panos Chrysanthis
(University of Pittsburgh)
|
|
noon-1:15pm
|
Lunch
|
1:15-2:15pm
|
Keynote II: Future directions in Data Mining: Streams, Networks, self-similarity and power laws
|
Dr. Christos Faloutsos, Carnegie Mellon University
|
Room: Auditorium
|
|
2:15-2:30pm
|
BREAK
|
2:30-3:45pm
|
Research Session 4: Image Similarity Search and
Systems
|
Chair: Anupam Joshi (University of Maryland Baltimore County)
|
Room: Einstein Room
|
|
Symbolic Photograph Content-Based
Retrieval
Philippe Mulhem
(IPAL - CNRS, Singapore) and Joo Hwee Lim
(LIT, Singapore)
|
A Compact and Efficient Image
Retrieval Approach Based on Border/Interior Pixel Classification
Renato Stehling
(University of Campinas, Brazil), Mario Nascimento
(University of Alberta, Canada), and Alexandre
Falcao(University of Campinas, Brazil)
|
Vulnerabilities in Similarity
Search Based Systems
Ali Saman Tosun
and Hakan Ferhatosmanoglu (Ohio
State University)
|
Research Session 5: XML Query Processing
|
Chair: Sajda Quereshi (University of Nebraska at Omaha)
|
Room: Drew Room
|
|
Efficient Evaluation of Multiple
Queries on Streaming XML Data
Mong-Li Lee, Boon Chin Chua,
WynneHsu,and Kian-Lee Tan (National
UniversityofSingapore)
|
Query Processing of Streamed
XML Data
Leonidas Fegaras, David Levine,
Sujoe Bose, and Vamsi Chaluvadi (University
of Texasat Arlington)
|
Multi-level Operator Combination
in XML Query Processing
Shurug Al-Khalifa and H. V. Jagadish
(University of Michigan, Ann Arbor)
|
Industry Session 1: Knowledge Management and Semantics
|
Chair: Len Seligman (MITRE Corporation)
|
Room: Curie Room
|
|
Thematic Mapping - from unstructured
documents to taxonomies
C. Chung, R. Lieu, J. Liu, A. Luk, J. Mao, and P. Raghavan (Verity)
|
Voquette Enterprise Content
Management
A. Sheth (University of Georgia & Voquette), and Y. Warke (Voquette)
|
Rule-based Data Quality
D. Loshin (Knowledge Integrity Inc.)
|
|
3:45-4:00pm
|
BREAK
|
4:00-5:15pm
|
Research Session 6: XML Transactions and Applications
|
Chair: Sajda Quereshi (University of Nebraska at Omaha)
|
Room: Einstein Room
|
|
XMLTM: Efficient Transaction
Management for XML Documents
Torsten Grabs
(ETH Zurich, Switzerland), Klemens Bohm
(Otto-von-Guericke-Universitat Magdeburg, Germany), and
Hans-Jorg Schek (ETH Zurich, Switzerland)
|
Efficient Synchronization for
Mobile XML Data
Franky Lam, Nicole Lam, and Raymond
Wong (University of New South Wales,
Australia)
|
An Object-Oriented Extension
of XML for Autonomous Web Applications
Hasan M. Jamil and Giovanni A.
Modica (Mississippi State University)
|
Research Session 7: Caching
|
Chair: R. Scott Cost (University of Maryland Baltimore County)
|
Room: Drew Room
|
|
Efficient Prediction of Web
Accesseson a Proxy Server
Wenwu Lou and Hongjun Lu
(Hong Kong University of Science and Technology,
Hong Kong)
|
A Self-managing Data Cache
for Edge-of-Network Web Applications
Khalil Amiri, Sanghyun Park,
Renu Tewari, and Sriram Padmanabhan (IBM T.
J. Watson ResearchCenter)
|
Cooperative Caching by Mobile
Clients in Push-based Information Systems
Takahiro Hara
(Osaka University, Japan)
|
Research Session 8: Information Extraction and
Text Segmentation
|
Chair: Charles Clarke (University of Waterloo, Canada)
|
Room: Curie Room
|
|
Augeas (Authoritativness Grading,
Estimationand Sorting)
Ayman Farahat, Geoff Nunberg,
and Francine Chen(Palo Alto Rsearch Center)
|
Structural Extraction from
Visual Layout of Documents
Ronen Feldman, Benjamin Rosenfeld,
and Yonatan Aumann (ClearForest Corporation)
|
Topic-Based Document Segmentation
with Probabilistic Latent Semantic Analysis
Thorsten Brants, Francine Chen
(Palo Alto Research Center), and Ioannis
Tsochantaridis (Brown University)
|
|
6:30-8:30pm
|
Reception and Poster Session
|
Chair: Tim Oates (University of Maryland
Baltimore County)
|
Room: Einstein Room
|
|
A New Cache Replacement Algorithm for the
Integration of Web Caching and Prefetching
Cheng-Yue Chang and Ming-Syan Chen (National Taiwan University)
|
A Syntactic Approach for Searching Similarities
inside Sentences
Federica Mandreoli, Riccardo Martoglia, Paolo Tiberio (Universita di
Modena e Reggio Emilia, Italy)
|
A System for Knowledge Management in
Bioinformatics
Sudeshna Adak, Vishal Batra, Deo Bhardwaj, Pasumatri Kamesam, Pankaj
Kankar, Manish Kurhekar, and Biplav Srivastava (IBM India Research Lab)
|
An Agent-Based Approach to Knowledge
Management
Bin Yu and Munindar P. Singh (North Carolina State University, Raleigh)
|
Characterizing Relevant and Non-Relevant
Returns for Task-Oriented Questions
Vanessa Murdock, W. Bruce Croft (University of Massachusetts, Amherst),
and Diane Kelly (Rutgers University, New Brunswick)
|
Data Fusion Models with Estimated Weights
Shengli Wu and Fabio Crestani (The University of Strathclyde, U.K.)
|
Discovering the Representative of a Search
Engine
King-Lup Liu (DePaul University), Clement Yu (University of Illinois
at Chicago), and Weiyi Meng (Binghamton University)
|
Ginga: A Self-Adaptive Query Processing System
Henrique Paques, Ling Liu, and Calton Pu (Georgia Institute of Technology)
|
High Performing and Scalable Feature
Selection for Text Classification
Monica Rogati and Yiming Yang (Carnegie Mellon University)
|
Index Compression vs. Retrieval Time
of Inverted Files for XML Documents
Norbert Fuhr and Norbert Govert (University of Dortmund)
|
Interactive Methods for Taxonomy Editing and
Validation
Scott Spangler and Jeffrey Kreulen (IBM Almaden Research Center) |
Knowledge Discovery from Texts: A Concept
Frame Graph Approach
Rajaraman Kanagasabai and Ah-Hwee Tan(Laboratories for Information
Technology,Singapore)
|
Knowledge Discovery in Patent Databases
Konstantinos Markellos, Penelope Markellou, George Mayritsakis, Katerina Perdikuri,
Spiros Sirmakessis, and Athanasios Tsakalidis (Computer Technology Institute
and University of Patras, Greece)
|
Making Digital Libraries from the Web
Pavel Calado (Universidade Federal de Minas Gerais, Brazil), Marcos
Goncalves (Virginia Polytechnic Institute and State University), Berthier
Ribeiro-Neto (Universidade Federal de Minas Gerais, Brazil), and Edward
Fox (Virginia Polytechnic Institute and State University)
|
Mining Coverage Statistics for Websource
Selection in a Mediator
Zaiqing Nie, Ullas Nambiar, Sreelakshmi Vaddi, Subbarao Kambhampati
(Arizona State University)
|
Mining Soft-Matching Association Rules
Un Nahm and Raymod Mooney (University of Texas at Austin)
|
Parallelizing the Buckshot Algorithm
for Efficient Document Clustering
Steven Beitzel, Eric Jensen, and Angelo Pilotto (Illinois Institute
of Technology)
|
|
Wednesday,
November 6, 2002
|
9:00-10:15am
|
Research Session 9: Sequence Similarity Search
andAccess Methods
|
Chair: Hakan Ferhatosmanoglu (Ohio State University)
|
Room: Einstein Room
|
|
How to Improve the Pruning
Ability of Dynamic Metric Access Methods
Caetano Traina Jr., Agma Traina,
Roberto Santos Filho (University of Sao Paulo
at Sao Carlos, Brazil), and Christos Faloutsos
(Carnegie Mellon University)
|
On the Efficient Evaluation
of Relaxed Queries in Biological Databases
Yangjun Chen (University of Winnipeg, Canada), Duren Che,
and Karl Aberer (IPSI Institute, GMD GmbH, Germany)
|
Similarity based retrieval
from Sequence Datausing Automata as Queries
A. Prasad Sistla, Tao Hu, and
Vikas Chowdhary (University of Illinois
at Chicago)
|
Research Session 10: Information Retrieval Models
|
Chair: Charles Clarke (University of Waterloo, Canada)
|
Room: Drew Room
|
|
Detecting Similar Documents
Using Salient Terms
James W. Cooper, Anni R. Coden,
and Eric W. Brown (IBM T J Watson Research
Center)
|
The Role of Variance in Term
Weighting for Probabilistic Information Retrieval
Warren R. Greiff, William T. Morgan, and Jay M. Ponte
(The MITRE Corporation)
|
Inferring Query Models by
Computing InformationFlow
Peter D. Bruza and Dawei Song
(University of Queensland, Australia)
|
Research Session 11: XML Schemas: Integration
andTranslation
|
Chair: Donald H. Kraft (Louisiana State University)
|
Room: Curie Room
|
|
Logical and Physical Support
for Heterogeneous Data
Sihen Amer-Yahia, Mary Fernandez,
Rick Greer, and Divesh Srivastava(AT&T
Labs -Research)
|
NeT & CoT: Translating
Relational Schemas to XML Schemas Using Semantic Constraints
Dongwon Lee
(Penn State University), Murali Mani, Frank Chiu, and Wesley
W.Chu (University of California Los
Angeles)
|
XClust: Clustering XML Schemas
for Effective Integration
Mong Li Lee, Wynne Hsu, LiangHuai
Yang,and Xia Yang (National University
of Singapore)
|
|
10:15-10:30am
|
BREAK
|
10:30-11:45am
|
Research Session 12: Peer-to-Peer and Database
Systems
|
Chair: Sajda Quereshi (University of Nebraska at Omaha)
|
Room: Einstein Room
|
|
A Local Search Mechanism for
Peer-to-Peer Mechanism
Vana Kalogeraki
(Hewlett-Packard Labs), Dimitrios Gunopulos, and Demetrios
Zeinalipour-Yiatzi (University of California, Riverside)
|
Efficient Knowledge Discovery
in Peer-to-Peer File Sharing
Yugyung Lee, Changgyu Oh, and
E.K. Park (University of Missouri,
Kansas City)
|
Partial Rollback in Object-Oriented/Object-Relational
Database Management Systems
Kim Won-Young, Whang Kyu-Young
(Korea Advanced Institute of Science and
Technology), Lee Byung Suk (University
of Vermont), Lee Young-Koo, and Chang Ji-Woong
(Korea Advanced Institute of Science and Technology)
|
Research Session 13: Information Retrieval I
|
Chair: Clement Yu (University of Illinois at Chicago)
|
Room: Drew Room
|
|
Query Association for Effective
Retrieval
Falk Scholer and Hugh E. Williams
(RMIT University, Australia)
|
Pruning Long Documents for
Distributed Information Retrieval
Jie Lu and Jamie Callan
(Carnegie Mellon University)
|
On Arabic Search: Improving
the Retrieval Effectiveness via a Light Stemming Approach
Mohammed Aljlayl and Ophir Frieder
(Illinois Institute of Technology)
|
Research Session 14: Classification
|
Chair: Eric Brown (IBM)
|
Room: Curie Room
|
|
Boosting to Correct the Inductive
Bias for Text Classification
Yan Liu, Yiming Yang, and Jaime
Carbonell(Carnegie Mellon University)
|
Using Conjunction of Attribute
Values for Classification
Mukund Deshpande
and George Karypis (University
of Minnesota)
|
Categorizing Information Objects
from User Access Patterns
Mao Chen, Andrea LaPaugh, and
Jaswinder Pal Singh(Princeton University)
|
|
11:45-1:00pm
|
Lunch
|
1:00-2:00pm
|
Keynote III:
|
Knowledge and Information Management: Is it possible to do interesting and important research, get funded, be
useful and appreciated?
|
|
Dr. Maria Zemankova, National Science Foundation
|
Room: Auditorium
|
|
2:00-2:15pm
|
BREAK
|
2:15-3:30pm
|
Research Session 15: Language Models for Information
Retrieval
|
Chair: Warren R. Greiff (MITRE Corporation)
|
Room: Einstein Room
|
|
Passage Retrieval Based on
Language Models
Xiaoyong Liu and W. Bruce Croft
(University of Massachusetts, Amherst)
|
Capturing Term Dependencies
Using a Sentence Tree Language Model
Ramesh Nallapati
and James Allan(University
of Massachusetts, Amherst)
|
Language Modeling Framework
for Resource Selection and Results Merging
Luo Si, Rong Jin, Jamie Callan,
and Paul Ogilvie (Carnegie Mellon University)
|
Research Session 16: Spatial Search and Moving
Objects
|
Chair: Prasad Sistla (University of Illinois at Chicago)
|
Room: Drew Room
|
|
An Efficient and Effective
Algorithm for Density Biased Sampling
Alexandros Nanopoulos
(Aristotle University, Greece), Yannis Theodoridis
(Computer Technology Institute, Greece), and Yannis
Manolopoulos (Aristotle University, Greece)
|
"GeoPlot": Spatial Data Mining
on Video Libraries
Jia-Yu Pan and Christos Faloutsos
(Carnegie Mellon University)
|
Trajectory Queries and Octagons
in Moving Object Databases
Hongjun Zhu, Jianwen Su, and
Oscar H. Ibarra(University of California,
Santa Barbara)
|
Industry Session 2: Data Mining and Federated
Systems
|
Chair: Victor Perez-Nunez (MITRE Corporation)
|
Room: Curie Room
|
|
Comparison of Interestingness
Functions for Learning Web Usage Patterns
Xiangji Huang (University of Waterloo), Aijun An (York University), Nick Cercone
(University of Waterloo), and Gary Promhouse (Open Text Corp.)
|
The Verity Federated Infrastructure
Kiam Choo, Rajat Mukherjee, Rami Smair, and Wei Zhang (Verity)
|
Automatically Classifying Database
Workloads
S. Elnaffar and P. Martin (Queen's University) and and R. Horman (IBM
Toronto)
|
|
3:30-3:45pm
|
BREAK
|
3:45-5:00pm
|
Research Session 17: Music Information Retrieval
|
Chair: Maria Zemankova (NSF)
|
Room: Einstein Room
|
|
The Effectiveness Study of
Various Music Information Retrieval Approaches
Jia-Lien Hsu
(Industrial Technology Research Institute, Taiwan), Arbee
L.P. Chen, and Hung-Chen Chen (National
Tsing Hua University, Taiwan)
|
Harmonic Models for Polyphonic
Music Retrieval
Jeremy Pickens
(University of Massachusetts Amherst) and Tim Crawford
(Kings College London, United Kingdom)
|
A Singer Identification Technique
for Content-Based Classification of MP3 Music Objects
Chih-Chin Liu
and Chuan-Sung Huang (Chung
Hua University, Taiwan)
|
Research Session 18: XML Constraints and the
SemanticWeb
|
Chair: Yugyung Lee (University of Missouri-Kansas City)
|
Room: Drew Room
|
|
XKvalidator: A constraint validator
for XML
Yi Chen, Susan Davidson, and Yifeng Zheng
(University of Pennsylvania)
|
Discovering Approximate Keys
in XML Data
Gosta Grahne
and Jianfei Zhu (Concordia
University, Canada)
|
Information Retrieval on the
Semantic Web
Urvi Shah, Tim Finin, Anupam
Joshi,R. Scott Cost (University of Maryland
Baltimore County) and James Mayfield (Johns
Hopkins University Applied Physics Laboratory)
|
|
7:00-10:00pm
|
Banquet and Cruise
|
Thursday,
November 7, 2002
|
9:00-10:40am
|
Research Session 19: Data Streams and Time-series
|
Chair: Arbee L. P. Chen (National Dong Hwa University, Taiwan)
|
Room: Einstein Room
|
|
RHist: Adaptive Summarization
over Continuous Data Streams
Lin Qiao, Divy Agrawal, and Amr
ElAbbadi (University of California, Santa
Barbara)
|
Efficient Query Monitoring
Using Adaptive Multiple Key Hashing
Kun-Lung Wu and Philip Yu
(IBM, T. J. Watson Research Center)
|
Evaluating Continuous Nearest
Neighbor Queriesfor Streaming Time Series via Pre-fetching
Like Gao, Zhengrong Yao, and
X.Sean Wang (George Mason University)
|
Mining Temporal Classes from
Time Series Data
Takao Miura, Masahiro Motoyoshi,
Kohei Watanabe (Hosei University, Japan),
and IsamuShioya (SANNO University,
Japan)
|
Research Session 20: Web Clustering
|
Chair: Jamie Callan (Carnegie Mellon University)
|
Room: Drew Room
|
|
Evaluating Contents-Link Coupled
Web Page Clusteringfor Web Search Results
Yitong Wang
and Masaru Kitsuregawa (The
University of Tokyo, Japan)
|
Inferring Hierarchical Descriptions
Eric J. Glover, David M. Pennock,
Steve Lawrence, and Robert Krovetz (NEC
Research Institute)
|
Evaluations of Algorithms
for Obtaining Hierarchical Clustering Solutions
Ying Zhao and George Karypis
(University of Minnesota)
|
Improved Categorization for
Web Hierarchies
Wahyu Wibowo and Hugh E. Williams
(RMIT University, Australia)
|
Industry Session 3: Database Performance and Interfaces
|
Chair: George Kasper (Virginia Commonwealth University)
|
Room: Curie Room
|
|
A Mapping Mechanism to Support
Bitmap Index and other Auxiliary Structures on Tables Stored as Primary
B+-trees
E. Chong, J. Srinivasan, S. Das, C. Freiwald, A. Yalamanchi, M. Jagannath, A. Tran,
R. Krishnan, and R. Jiang (Oracle)
|
Using Specification-Driven
Concepts for Distributed Data Management and Dissemination
M. Brian Blake (MITRE & Georgetown U.)
|
|
|
10:40-11:00am
|
BREAK
|
11:00-12:15pm
|
Research Session 21: Information Retrieval II
|
Chair: Eric Brown (IBM)
|
Room: Einstein Room
|
|
Knowledge-Based Extraction
ofNamed Entities
Jamie Callan
and Teruko Mitamura(Carnegie
MellonUniversity)
|
Condorcet Fusion for Improved
Retrieval
Mark Montague
and Javed A. Aslam (Dartmouth
College)
|
I/O-Efficient Techniques for
Computing Pagerank
Yen-Yu Chen, Qingqing Gan, and
TorstenSuel (Polytechnic University)
|
Research Session 22: Web Search II
|
Chair: Warren R. Greiff (MITRE Corpporation)
|
Room: Drew Room
|
|
Personalized Web Search by
Mapping User Queries to Categories
Fang Liu, Clement Yu
(University of Illinois at Chicago), and Weiyi
Meng (SUNY at Binghamton)
|
Using Micro Information Units
for Internet Search
Xiaoli Li, Bing Liu, Tong-Heng
Phang,and Minqing Hu (National University
ofSingapore)
|
Entropy-Based Link Analysis
for Mining Web Informative Structures
Hung-Yu Kao
(National Taiwan University), Shian-Hua Lin, Jan-Ming Ho
(Institute of Information Science, Academia
Sinica, Taiwan), and Ming-Syan Chen (National
Taiwan University)
|
Research Session 23: Clustering Algorithms
|
Chair: Walid G. Aref (Purdue University)
|
Room: Curie Room
|
|
COOLCAT: An Entropy-Based Algorithm
for Categorical Clustering
Daniel Barbara, Julia Couto,
and Yi Li (George Mason University)
|
FREM: Fast and Robust EM Clustering
for Large Data Sets
Carlos Ordonez
(Teradata, a division of NCR) and Edward Omiecinski
(Georgia Institute of Technology)
|
Alternatives to the k-means
Algorithm that Find Better Clusterings
Greg Hamerly
and Charles Elkan (University
of California, San Diego)
|
|
12:15-1:15pm
|
Lunch and Steering Committee Meeting
|
1:15-5:30pm
|
Birds-of-a-Feather Meetings
|
End of CIKM 2002
|