PC Chairs message

Accepted papers

Program committees
Conference officers
Important Dates
Guidelines for Authors
Co-located conferences
Auckland Airport Transport
Auckland Accommodation
Conference Details
Conference video

Accepted Papers

Research Track: Core Database Technology

Title Name (Org)
Hexastore: Sextuple Indexing for Semantic Web Data Management Cathrin Weiss (University of Zurich), Panagiotis Karras (University of Zurich), Abraham Bernstein (University of Zurich)
Parallelizing Query Optimization Wook-Shin Han (Kyungpook National University), Wooseong Kwak, Jinsoo Lee, Guy Lohman (IBM Research Almaden, USA), Volker Markl (IBM Research Almaden, USA)
Selectivity Estimation of Set Similarity Selection Queries Marios Hadjieleftheriou (AT&T Labs Inc.), Xiaohui Yu (York University), Nick Koudas (U of Toronto), Divesh Srivastava (AT&T, USA)
Identifying Robust Plans through Plan Diagram Reduction Harish D (Indian Institute of Science), Pooja Darera (Indian Institute of Science), Jayant Haritsa (Indian Institute of Science)
Transaction Time Indexing with Version Compression David Lomet (Microsoft Research, USA), Mingsheng Hong (Cornell University), Rimma Nehme (Purdue University), Rui Zhang (University of Melbourne)
Constrained Physical Design Tuning Nicolas Bruno (Microsoft Research, USA), Surajit Chaudhuri (Microsoft Research, USA)
Fault-tolerant Stream Processing using a Distributed, Replicated File System YongChul Kwon (University of Washington), Magdalena Balazinska (University of Washington), Albert Greenberg (Microsoft Research)
Managing and Querying Transaction-time Databases under Schema Evolution Hyun Moon (UCLA), Carlo Curino (Politecnico di Milano), Alin Deutsch (UCSD), Chien-Yi Hou (UCSD), Carlo Zaniolo (UCLA)
Indexing Land Surface for Efficient kNN Query Cyrus Shahabi (Univ. of Southern California), Lu-An Tang (Univ. of Southern California), Songhua Xing (Univ. of Southern California)
StreamTX: Extracting Tuples from Streaming XML Data Wook-Shin Han (Kyungpook National University), Haifeng Jiang (Google), Howard Ho (IBM Almaden Research Center), Quanzhong Li (IBM)
LEEWAVE: Level-Wise Distribution of Wavelet Coefficients for Processing kNN Queries over Distributed Streams Mi-Yen Yeh (National Taiwan University), Kun-Lung Wu (IBM T. J. Watson Research Center), Philip Yu (University of Illinois at Chicago), Ming-Syan Chen (National Taiwan University, Taiwan)
RDF-3X: a RISC-style Engine for RDF Thomas Neumann (Max-Planck-Institut Informatik), Gerhard Weikum (MPI)
Keyword Query Cleaning Ken Pu (UOIT), Xiaohui Yu (York University)
Dependable Cardinality Forecasts for XQuery Jens Teubner (IBM T.J. Watson Research Cente), Torsten Grust (Technische Universität München), Sebastian Maneth (NICTA), Sherif Sakr (NICTA)
Access Control over Uncertain Data Vibhor Rastogi (University of Washington), Dan Suciu (University of Washington and Microsoft), Evan Welbourne (University of Washington)
Efficient Network-Aware Search in Collaborative Tagging Sites Michael Benedikt (Oxford University), Sihem Amer Yahia (Yahoo Research, USA), Laks Lakshmanan (University of British Columbia), Julia Stoyanovich (Columbia University)
Approximate Lineage for Probabilistic Databases Chris Re (University of Washington), Dan Suciu (University of Washington and Microsoft)
A practical scalable distributed B-tree Marcos Aguilera (HP Labs), Wojciech Golab (University of Toronto), Mehul Shah (Hewlett-Packard Labs)
Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints Chuan Xiao (University of New South Wales), Wei Wang (UNSW, Australia), Xuemin Lin (UNSW)
Rewriting Procedures for Batched Bindings Ravindra Guravannavar (IIT Bombay), S. Sudarshan (IIT Bombay, India)
Space-Efficient Synopses for Sliding-Window Top-k Queries on Uncertain Streams Cheqing Jin (ECUST), Ke Yi (Hong Kong University of Science and Technology), Lei Chen (HKUST), Jeffrey Xu Yu (Chin. U. HK), Xuemin Lin (UNSW)
Overcoming the I/O Bottleneck in BI Queries Lin Qiao (IBM Almaden Research Lab), Vijayshankar Raman (IBM Almaden Research Lab), Frederick Reiss (IBM Almaden Research Lab), Peter Haas (IBM Almaden Research Lab)
Tighter Estimation using Bottom-k Sketches Edith Cohen (AT&T), Haim Kaplan (Tel Aviv University)
Evita Raced: Metacompilation for Declarative Networks Tyson Condie (UC Berkeley), Joseph Hellerstein, Petros Maniatis, David Chu
Read-Optimized Databases, In-Depth Allison Holloway (University of Wisconsin), David DeWitt (UW - Madison)
Anonymizing Bipartite Graph Data using Safe Groupings Graham Cormode (AT&T Labs, USA), Divesh Srivastava (AT&T, USA), Ting Yu (North Carolina State University), Qing Zhang (North Carolina State University)
Conditioning Probabilistic Databases Christoph Koch (Cornell University), Dan Olteanu (Oxford University)
Out-of-Order Processing: A New Architecture for High-Performance Stream Systems Jin Li (Portland State University), Kristin Tufte, Vladislav Shkapenyuk (at&t labs - research), Vassilis Papadimos, Theodore Johnson (AT&t labs - Research), David Maier (Portland State University)
Rose: Compressed, log-structured replication Russell Sears (UC Berkeley), Mark Callaghan (Google), Eric Brewer (UC Berkeley)
Hash-based Subgraph Query Processing Method for Graph-structured XML Documents Hongzhi Wang (Harbin Institute of Technology), Jianzhong Li (Harbin Institute of Technology, China), Jizhou Luo (Harbin Institute of Technology)
Authenticating Query Results for Text Search Engines HweeHwa Pang (Singapore Management Univ), Kyriakos Mouratidis (Singapore Management University)
Scalable Ad-hoc Entity Extraction from Text Collections Sanjay Agrawal (Microsoft Research), Kaushik Chakrabarti (Microsoft Research), Surajit Chaudhuri(Microsoft Research, USA), Venkatesh Ganti (Microsoft Research)
A Pay-As-You-Go Framework for Query Execution Feedback Surajit Chaudhuri (Microsoft Research, USA), Vivek Narasayya (Microsoft Research), Ravishankar Ramamurthy (Microsoft Research)
Efficient Top-K Processing over Query-Dependent Functions Lin Guo (Yahoo! Inc.), Sihem Amer Yahia (Yahoo Research, USA), Raghu Ramakrishnan (Yahoo), Jayavel Shanmugasundaram (Yahoo! Research), Utkarsh Srivastava (Yahoo! Research), Erik Vee (Yahoo! Research)
Row-wise Parallel Predicate Evaluation Ryan Johnson (Carnegie Mellon University), Vijayshankar Raman (IBM Almaden Research Lab), Richard Sidle (IBM Research (Almaden)), Garret Swart (Oracle Corporation)
Generating XML Structure Using Examples and Constraints Sara Cohen (Hebrew University of Jerusalem)
Efficient Skyline Querying with Variable User Preferences on Nominal Attributes Raymond Chi-Wing Wong (The Chinese University of HK), Ada WaiChee Fu (The Chinese University of Hong Kong), Jian Pei (Simon Fraser University), Yip Sing Ho (The Chinese University of HK), Tai Wong (The Chinese University of HK), Yubao Liu (Sun Yat-Sen University)
BayeStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models Daisy Zhe Wang (UC Berkeley), Eirinaios Michelakis (UC Berkeley), Minos Garofalakis (Yahoo Research, USA), Joseph Hellerstein
Flashing Up The Storage Layer Ioannis Koltsidas (University of Edinburgh), Stratis Viglas (University of Edinburgh)
Output Perturbation with Query Relaxation Xiaokui Xiao (Chinese University of Hong Kong), Yufei Tao (CUHK)
Exploiting Shared Correlations in Probabilistic Databases Prithviraj Sen (University of Maryland), Amol Deshpande (University of Maryland, College Park, USA), Lise Getoor(Maryland)
Efficient Search for the Top-k Probable Nearest Neighbors inUncertain Databases George Beskales (University of Waterloo), Mohamed Soliman (University of Waterloo), Ihab Francis Ilyas (University of Waterloo, Canada)
Privacy Preserving Serial Data Publishing by Role Composition Yingyi Bu (The Chinese University of HK), Ada WaiChee Fu (The Chinese University of Hong Kong), Raymond Chi-Wing Wong (The Chinese University of HK), Lei Chen (HKUST)
Reasoning and Identifying Relevant Matches for XML Keyword Search Ziyang Liu (Arizona State University), Yi Chen (Arizona State University, USA)
Structural Signatures for Tree Data Structures Ashish Kundu (Computer Science, Purdue Univ), Elisa Bertino (Purdue University, USA)
On Efficiently Searching Archival Data for Historical Similarities Reza Sherkat (University of Alberta), Davood Rafiei (University of Alberta )

Research Track: Infrastructure and Information Systems

Title Name (Org)
Discovery of Convoys in Trajectory Databases Hoyoung Jeung (University of Queensland), Man Lung Yiu (Aalborg University), Xiaofang Zhou (University of Queensland), Christian Jensen (Aalborg University), Heng Tao Shen (University of Queensland)
Discovering Data Quality Rules Fei Chiang (University of Toronto), Renee Miller (University of Toronto)
Mining Non-Redundant High Order Correlations in Binary Data Xiang Zhang (Univeristy of North Carolina), Feng Pan (Univeristy of North Carolina), Wei Wang (UNC), Andrew Nobel
Type Inference and Type Checking for Queries on Execution Traces Daniel Deutch (Tel Aviv University), Tova Milo (Tel Aviv University)
TraClass: Trajectory Classification Using Hierarchical Region-Based and Trajectory-Based Clustering Jae-Gil Lee (UIUC), Jiawei Han (Illinois), Xiaolei Li (UIUC), Hector Gonzalez (UIUC)
Dynamic Active Probing of Helpdesk Databases Shenghuo Zhu (NEC Lab), Tao Li (Florida International University), Zhiyuan Chen (UMBC), Dingding Wang (Florida International University), Yihong Gong (NEC Lab)
Scheduling Shared Scans of Large Data Files Parag Agrawal (Stanford University), Daniel Kifer (Yahoo! Research), Christopher Olston (Yahoo! Research)
Cleaning Uncertain Data with Quality Guarantees Reynold Cheng (Hong Kong Polytechnic University), Jinchuan Chen (Hong Kong Polytechnic University), Xike Xie (Hong Kong Polytechnic University)
End-to-End Support for Joins in Large-Scale Publish/Subscribe Systems Badrish Chandramouli (Duke University), Jun Yang (Duke)
The V-Diagram: a Query-Dependent Method for Moving kNN Queries Sarana Nutanong (The University of Melbourne), Rui Zhang (University of Melbourne), Egemen Tanin, Lars Kulik
Taming Verification Hardness: an efficient algorithm for testing subgraph isomorphism HAICHUAN SHANG (UNSW), Ying Zhang (UNSW), Xuemin Lin (UNSW), Jeffrey Xu Yu (Chin. U. HK)
Mining Search Engine Query Logs via Suggestion Sampling Maxim Gurevich (Technion), Ziv Bar-Yossef (Google and Technion)
Clustera: An Integrated Computation and Data Management System David DeWitt (UW - Madison), Eric Robinson (UW - Madison), Srinath Shankar (UW - Madison), Erik Paulson (UW - Madison), Jeffrey Naughton (UW - Madison), Andrew Krioukov (UW - Madison), Joshua Royalty (UW - Madison)
Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases Dieter Van de Craen (Hasselt University), Frank Neven (Hasselt University), Anastasios Kementsietsidis (IBM T.J. Watson Research Center), Stijn Vansummeren (Hasselt University)
On Generating Near-Optimal Tableaux for Conditional Functional Dependencies Lukasz Golab (AT&T Labs - Research), Howard Karloff (AT&T Labs - Research), Flip Korn (AT&T Labs - Research), Divesh Srivastava (AT&T, USA), Bei Yu (Singapore-MIT Alliance (SMA), Singapore)
Maintaining Dynamic Channel Profiles on the Web Haggai Roitman (IBM), David Carmel (IBM-Haifa Research Lab), Elad Yom-Tov (IBM-Haifa Research Lab)
STMark: Towards a Benchmark for Mapping Systems Bogdan Alexe (UC Santa Cruz), Wang-Chiew Tan (UCSC), Yannis Velegrakis (University of Trento)
Plan-based Complex Event Detection across Distributed Sources Mert Akdere (Brown University), Ugur Cetintemel (Brown University), Nesime Tatbul (ETH Zurich)
Finding Relevant Patterns in Bursty Sequences Alexander Lachmann (Cornell University), Mirek Riedewald (Cornell University)
Dynamic Partitioning of the Cache Hierarchy in Shared Data Centers Gokul Soundararajan (University of Toronto), Jin Chen, Mohamed Sharaf (University of Toronto), Cristiana Amza
WebTables: Exploring the Power of Tables on the Web Michael Cafarella (University of Washington), Alon Halevy (Google, Inc.), Daisy Zhe Wang (UC Berkeley), Eugene Wu (MIT), Yang Zhang (MIT)
Resisting Structural Identification in Anonymized Social Networks Michael Hay (University of Massachusetts), Gerome Miklau (UMass Amherst), David Jensen (University of Massachusetts Amherst), Don Towsley (University of Massachusetts Amherst)
Learning to Extract Form Labels Hoa Nguyen (University of Utah), Thanh Nguyen (University of Utah), Juliana Freire (University of Utah)
Graceful Database Schema Evolution: the PRISM Workbench Carlo Curino (Politecnico di Milano), Hyun Moon (UCLA), Carlo Zaniolo (UCLA)
Toward Analyzing and Revising Mediated Schemas to Improve Their Matchability Xiaoyong Chai (University of Wisconsin-Madiso), Mayssam Sayyadian (University of Wisconsin-Madison), AnHai Doan (Wisconsin), Arnon Rosenthal (The MITRE Corporation), Len Seligman (The MITRE Corporation)
Learning to Create Data-Integrating Queries Partha Talukdar (University of Pennsylvania), Marie Jacob (University of Pennsylvania), Mohammad Mehmood (University of Pennsylvania), Koby Crammer (University of Pennsylvania), Zachary Ives (University of Pennsylvania, USA), Fernando Pereira (University of Pennsylvania), Sudipto Guha (University of Pennsylvania)
Simrank++: Query Rewriting through Link Analysis of the Click Graph Ioannis Antonellis (Stanford University), Hector Garcia-Molina (Stanford University), Chi-Chao Chang (Yahoo!)
WYSIWYG Development of Data Driven Web Applications Fan Yang (Yahoo), Chavdar Botev (Cornell University), Nitin Gupta (Cornell University), Elizabeth Churchill (Yahoo! Research), Levchenko George, Jayavel Shanmugasundaram (Yahoo! Research)
On the Provenance of Non-Answers to Queries over Extracted Data Jiansheng Huang (Univ. of Wisconsin-Madison), Ting Chen (Univ. of Wisconsin-Madison), AnHai Doan (Wisconsin), Jeffrey Naughton (UW - Madison)
Interactive Source Registration in GLAV-based Information Integration Yannis Katsis (UC San Diego), Alin Deutsch (UCSD), Yannis Papakonstantinou (University of California, San Diego, USA)
Data Exchange with Data-Metadata Translations Mauricio Hernandez (IBM Almaden Research Center), Paolo Papotti (Universita Roma Tre), Wang-Chiew Tan (UCSC)
Online Maintenance of Very Large Random Samples on Flash Storage Suman Nath (Microsoft Research), Phillip Gibbons (Intel Research, Pittsburgh, USA)
A Skip-list Approach for Efficiently Processing Forecasting Queries Tingjian Ge (Brown University), Stan Zdonik (Brown University)
Automated Creation of a Forms-based Database Query Interface Magesh Jayapandian (University of Michigan), H V Jagadish (University of Michigan, Ann Arbor)
Finch: Evaluating Reverse k-Nearest-Neighbor Queries on Location Data Wei Wu (NUS), Fei Yang (NUS), Chee-Yong Chan (National University of Singapore), Kian-Lee Tan (Singapore)
Scalable Ranked Publish/Subscribe Ashwin Machanavajjhala (Cornell University), Erik Vee (Yahoo! Research), Minos Garofalakis (Yahoo Research, USA), Jayavel Shanmugasundaram (Yahoo! Research)
Keyword Search on External Memory Data Graphs Bhavana Dalvi (IIT Bombay), Meghana Kshirsagar (IIT Bombay), S. Sudarshan (IIT Bombay, India)
A Request-Routing Framework for SOA-Based Enterprise Computing Thomas Phan, Wen-Syan Li (SAP Research Center - China)
Multidimensional Content eXploration Alkis Simitsis (Stanford University), Akanksha Baid (University of Wisconsin-Madison), Yannis Sismanis (IBM Research Almaden, USA), Berthold Reinwald (IBM Almaden Research Center)
Sorting Hierarchical Data in External Memory for Archiving Ioannis Koltsidas (University of Edinburgh), Heiko Mueller (University of Edinburgh), Stratis Viglas (University of Edinburgh)
Propagating Functional Dependencies with Conditions Shuai Ma (University of Edinburgh), Wenfei Fan (University of Edinburgh, UK), Yanli Hu (University of Edinburgh), Jie Liu (Chinese Academy of Sciences), Yinghui Wu (University of Edinburgh)
Accuracy Estimate and Optimization Techniques for SimRank Computation Dmitry Lizorkin (ISP RAS), Pavel Velikhov (ISP RAS), Maxim Grinev (ISP RAS), Denis Turdakov (ISP RAS)
Privacy-preserving Anonymization of Set-valued Data Manolis Terrovitis (Univeristy of Hong Kong), Nikos Mamoulis (University of Hong Kong), Panos Kalnis (National University of Singapore)
Performance Profiling with EndoScope, an Acquisitional Software Monitoring Framework Alvin Cheung (MIT), Samuel Madden (MIT CSAIL, USA)
Constrained Locally Weighted Clustering Hao Cheng (University of Central Florida), Kien Hua (University of Central Florida), Khanh Vu (University of Central Florida)
Web Page Language Identification Based on URLs Eda Baykan, Monika Henzinger, Ingmar Weber (EPF Lausanne)
Scalable Query Result Caching for Web Applications Charles Garrod (Carnegie Mellon University), Amit Manjhi (Carnegie Mellon University), Bruce Maggs (Carnegie Mellon University), Todd Mowry (Carnegie Mellon University), Anthony Tomasic (Carnegie Mellon University), Christopher Olston (Yahoo! Research), Anastasia Ailamaki (Carnegie Mellon University)
Relaxation in Text Search using Taxonomies Marcus Fontoura, Vanja Josifovski, Ravi Kumar (Yahoo! Research), Christopher Olston (Yahoo! Research), Sergei Vassilvitskii, Andrew Tomkins
Optimization of Multi-Domain Queries on the Web Daniele Braga (Politecnico di Milano), Stefano Ceri (Milan), Florian Daniel (Politecnico di Milano), Davide Martinenghi (Politecnico di Milano)

Industrial Applications and Experience

Title Name (Org)
Industry-Scale Duplicate Detection Melanie Weis (Hasso-Plattner-Institut), Felix Naumann (Hasso-Plattner-Institute), Ulrich Jehle (Schufa), Holger Schuster (Schufa), Jens Lufter (Schufa)
Energy Cost, The Key Challenge of Today’s Datacenters - A Power Analysis of TPC-C Benchmark Results Meikel Poess (Oracle USA), Raghunath Nambiar (Hewlett Packard)
Towards a Physical XML independent XQuery/SQL/XML Engine Zhen Hua Liu (Oracle), Thomas Baby (Oracle), Sivasankaran Chandrasekar (Oracle), Hui Chang (Oracle)
SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets Ronnie Chaiken (Microsoft), Bob Jenkins (Microsoft), Paul Larson (Microsoft Research, USA), Bill Ramsey (Microsoft), Darren Shakib (Microsoft), Simon Weaver (Microsoft), Jingren Zhou (Microsoft Research, USA)
Relational Support for Flexible Schema Scenarios Srini Acharya (Microsoft Corp.), Peter Carlin (Microsoft Corp.), Cesar Galindo-Legaria (Microsoft Corp.), Krzysztof Kozielczyk (Microsoft Corp.), Pawel Terlecki (Microsoft Corp.), Peter Zabback (Microsoft Corp.)
Oracle Securefiles System Niloy Mukherjee (Oracle USA Inc.), Bharath Aleti, Amit Ganesh, Krishna Kunchithapadam, Scott Lynn, Sujatha Muthulingam, Kam Shergill, Shaoyu Wang, Wei Zhang
Optimizer Plan Change Management: Improved Stability and Performance in Oracle 11g Mohamed Ziauddin (Oracle), Dinesh Das (Oracle), Hong Su (Oracle), Yali Zhu (Oracle), Khaled Yagoub (Oracle)
Closing The Query Processing Loop in Oracle 11g Mohamed Zait (Oracle), Allison Lee (Oracle)
Brighthouse: An Analytic Data Warehouse for Ad-hoc Queries Dominik Slezak (INFOBRIGHT), Jakub Wroblewski (INFOBRIGHT), Victoria Eastwood (INFOBRIGHT), Piotr Synak (INFOBRIGHT)
Towards a Streaming SQL Standard Stan Zdonik (Brown University), Namit Jain (Oracle), Shailendra Mishra (Oracle), Anand Srinivasan (Oracle), Johannes Gehrke (Cornell University, USA), Jennifer Widom (Stanford University), Hari Balakrishnan (MIT), Mitch Cherniack (Brandeis University), Ugur Cetintemel (Brown University), Richard Tibbetts (Streambase, Inc.)
Efficiently Approximating Query Optimizer Plan Diagrams Atreyee Dey (Indian Institute of Science), Sourjya Bhaumik (Indian Institute of Science), Harish D (Indian Institute of Science), Jayant Haritsa (Indian Institute of Science)
SLEUTH: Single-pubLisher attack dEtection Using correlaTion Hunting Ahmed Metwally (UCSB), Fatih Emekci (UCSB), Divyakant Agrawal (UCSB), Amr El Abbadi (UCSB)
Surfacing the Deep Web Jayant Madhavan (Google, USA), David Ko (Google Inc.), Lucja Kot (Cornell University), Vignesh Ganapathy (Google Inc.), Alex Rasmussen (University of California - San Diego), Alon Halevy (Google, Inc.)
PNUTS: Yahoo!'s hosted data serving platform Brian Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Phil Bohannon, Hans-Arno Jacobsen, Nick Puz, Daniel Weaver, Ramana Yerneni
Efficient implementation of sorting on multi-core SIMD CPU architecture William Macy, Akram Baransi, Anthony Nguyen, Jatin Chhugani, Mostafa Hagog, Sanjeev Kumar, Victor Lee, Yen-Kuang Chen, Pradeep Dubey


Title Name (Org)
Capri/MR: Exploring Protein Databases from a Structural and Physicochemical Point of View Eric Paquet (National Research Council), Herna Viktor (University of Ottawa)
EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution Qihong Shao (Arizona State University), Yi Chen (Arizona State University, USA), Shu Tao (IBM T.J.Watson Research Center), Xifeng Yan (IBM T.J.Watson Research Center), Nikos Anerousis (IBM.T.J.Watson Research Center)
AJAXSearch: Crawling, Indexing and Searching Web 2.0 Applications Cristian Duda (ETH Zurich), Gianni Frey (ETH Zurich), Donald Kossman (ETH Zurich), Chong Zhou (Huazhong University of Science and Technology, China)
ManyAspects: A System for Highlighting Diverse Concepts in Documents Kun Liu (IBM Almaden Research Center), Evimaria Terzi (IBM Almaden), Tyrone Grandison (IBM Almaden Research)
Xnippet: Generating Query Biased Result Snippet for XML Search Yu Huang (Arizona State University), Ziyang Liu (Arizona State University), Yi Chen (Arizona State University, USA)
QueryScope: Visualizing Queries for Repeatable Database Tuning Ling Hu (Northeastern University), Yuan-chi Chang (IBM T J Watson), Christian Lang (IBM Research), Kenneth Ross (Columbia University), Donghui Zhang (Northeastern)
Large-Scale Collaborative Analysis and Extraction of Web Data Felix Weigel (Cornell University), Biswanath Panda (Cornell University), Mirek Riedewald (Cornell University), Johannes Gehrke (Cornell University, USA)
C-DEM: A Multi-Modal Query System for Drosophila Embryo Databases Fan Guo (Carnegie Mellon University), Lei Li (Carnegie Mellon University), Eric Xing (Carnegie Mellon University), Christos Faloutsos (Carnegie Mellon University, USA)
Language-Integrated Querying of XML Data in SQL Server James Terwilliger (Portland State University), Sergey Melnik* (Microsoft), Philip Bernstein (Microsoft)
AuditGuard: A system for database auditing under retention restrictions Wentian Lu (University of Massachusetts), Gerome Miklau (UMass Amherst)
An Effective and Versatile Keyword Search Engine on Heterogeneous Data Sources Guoliang Li (Tsinghua University), Jianhua Feng , Jianyong Wang (Tsinghua, China), Lizhu Zhou (Tsinghua University)
XTCcmp: XQuery Compilation on XTC Christian Mathis (University of Kaiserslautern), Andreas Weiner (University of Kaiserslautern), Theo Härder (TU Kaiserslautern), Caesar Ralf Franz Hoppen (University of Kaiserslautern)
When is it Time to Rethink the Aggregate Configuration of Your OLAP Server? Katja Hose (TU Ilmenau), Daniel Klan (TU Ilmenau), Matthias Marx (TU Ilmenau), Kai-Uwe Sattler (TU Ilmenau)
DBPubs: Multidimensional Exploration of Database Publications Akanksha Baid (University of Wisconsin-Madison), Andrey Balmin (IBM Almaden Research Center), Heasoo Hwang (UC San Diego), Erik Nijkamp (IBM Germany), Jun Rao (IBM Almaden Research Center), Berthold Reinwald (IBM Almaden Research Center), Alkis Simitsis (Stanford University), Yannis  Sismanis (IBM Research Almaden, USA), Frank Van Ham (IBM Cambridge)
RIDE: A Tool for Interactive Source Registration in GLAV-based Information Integration Yannis Katsis (UC San Diego), Alin Deutsch (UCSD), Yannis Papakonstantinou (University of California, San Diego, USA), Keliang Zhao (UC San Diego)
Comparing and Evaluating Mapping Systems with STMark Bogdan Alexe (UC Santa Cruz), Wang-Chiew Tan (UCSC), Yannis Velegrakis* (University of Trento)
DObjects: Enabling Distributed Data Services for Metacomputing Platforms Pawel Jurczyk (Emory University), Li Xiong (Emory University)
Organizing and Indexing Non-Convex Regions Eric Perlman (Johns Hopkins University), Randal Burns (Johns Hopkins University), Michael Kazhdan (Johns Hopkins University)
Periscope/GQ: A Graph Querying Toolkit Yuanyuan Tian (University of Michigan), Jignesh Patel (Michigan), Viji Nair (University of Michigan), Sebastian Martini (University of Michigan), Matthias Kretzler (University of Michigan)
H-Store: A High-Performance, Distributed Main Memory Transaction Processing System Robert Kallman (Brown University), Jonathan Natkins (Brown University), Hideaki Kimura, Andrew Pavlo (Brown University), Alexander Rasin (Brown University), Stan Zdonik (Brown University), Evan Jone (MIT), Samuel Madden (MIT CSAIL, USA), Michael Stonebraker (MIT), Daniel Abadi (Yale University, USA)
SEDA: A System for Search, Exploration, Discovery, and Analysis of XML Data Andrey Balmin (IBM Almaden Research Center), Latha Colby (IBM Almaden Research Center, USA), Emiran Curtmola (UC, San Diego), Quanzhong Li (IBM), Fatma Ozcan (IBM Almaden Research Center), Sharath Srinivas (Uniersity of Maryland, College Park), Zografoula Vagena (Microsoft Research, Cambridge)
Making SENSE: Socially Enhanced Search and Exploration Tom Crecelius (MPI Informatik), Mouna Kacimi (MPI Informatik), Sebastian Michel (EPFL), Thomas Neumann (Max-Planck-Institut Informatik), Josiane Xavier Parreira, Ralf Schenkel (Max-Planck Institute of Computer Science, Germany), Gerhard Weikum
P3N: Profiling the Potential of a Peer-based Data Management System Mihai Lupu (NUS), Y. C. Tay (National University of Singapore)
P2P Logging and Timestamping for Reconciliation Mounir TLILI (Atlas team, INRIA and LINA), Kokou DEDZOE (INRIA), Esther Pacitti (LINA), Patrick Valduriez (INRIA and LINA, University of Nantes), Reza Akbarinia (University of Waterloo)
AlvisP2P: Scalable Peer-to-Peer Text Retrieval in a Structured P2P Network Toan Luu (EPFL), Gleb Skobeltsyn (EPFL), Fabius Klemm (EPFL), Maroje Puh (University of Zagreb), Ivana Podnar Zarko (University of Zagreb), Martin Rajman (EPFL), Karl Aberer (EPFL Lausanne)
Ad-Hoc Data Processing in the Cloud Dionysios Logothetis (UCSD), Kenneth Yocum (UCSD)
WebContent: Efficient P2P Warehousing of Web Data Serge Abiteboul (INRIA), Tristan Allard (Univ. Versailles, France), Philippe Chatalic (INRIA Saclay--Île-de-France), Georges Gardarin (Univ. Versailles, France), Anca Ghitescu (INRIA Saclay--Île-de-France), Francois Goasdoué (LRI, Université Paris Sud and INRIA Saclay--Île-de-France), Ioana  Manolescu (INRIA, France), Benjamin Nguyen (Univ. Versailles, France), Mohamed Ouazara (INRIA Saclay--Île-de-France), Aditya Somani (IIT Bombay, India), Nicolas Travers (Conservatoire National des Arts et Métiers), Gabriel Vasile (INRIA Saclay--Île-de-France), Spyros Zoupanos (INRIA Saclay--Île-de-France)
XTreeNet: Democratic Community Search Emiran Curtmola (UC, San Diego), Alin Deutsch (UCSD), Kadangode Ramakrishnan (AT&T Research), Divesh Srivastava (AT&T, USA), Kenneth Yocum (UCSD), Dionysios Logothetis (UCSD)
Process Spaceship: Discovering and Exploring Process Views from Event Logs in Data Spaces Hamid Reza Motahari Nezhad (UNSW), Boualem Benatallah (UNSW), Fabio Casati (University of Trento), Periklis Andritsos (University of Trento, Italy), Regis Saint-Paul (CREATE-NET)
Semandaq: A Data Quality System Based on Conditional Functional Dependencies Wenfei Fan (University of Edinburgh, UK), Floris Geerts (University of Edinburgh, UK), Xibei Jia (The University Of Edinburgh)

Experiments and Analyses

Title Name (Org)
Finding Frequent Items in Data Streams Graham Cormode (AT&T Labs, USA), Marios Hadjieleftheriou (AT&T Labs Inc.)
A Benchmark for Evaluating Moving Objects Indexes Su Chen (NUS), Dan Lin (Purdue University, USA), Christian Jensen (Aalborg University)
Column-Store Support for RDF Data Management Lefteris Sidirourgos (CWI, Amsterdam, The Netherlands), Romulo Goncalves (CWI, Amsterdam, The Netherlands), Martin Kersten (CWI, Amsterdam, The Netherlands), Niels Nes (CWI, Amsterdam, The Netherlands), Stefan Manegold (CWI)
Dwarfs in the Rearview Mirror: How Big are they Really? Jens Dittrich (ETH Zurich), Lukas Blunschi (ETH Zurich), Marcos Antonio Vaz Salles (ETH Zurich)
Prefix based numbering schemes for XML : Techniques, Applications and Performances Virginie Sans (ETIS/CNRS laboratory)
Querying and Mining of Time Series Data: Experimental

Comparison of Representations and Distance

Hui Ding (Northwestern University), Goce Trajcevski (Northwestern University), Peter Scheuermann (Northwestern University), Xiaoyue Wang (University of California, Riverside), Eamonn Keogh (Ucr)


Title Name (Org)
Querying and Monitoring Distributed Business Processes Tova Milo and Daniel Deutch (Tel Aviv University, Israel)
Dataspaces Michael Franklin (University of California, Berkeley, USA), Alon Halevy (Google) and David Maier (Portland State University, USA
A Revival of Integrity Constraints for Data Cleaning Wenfei Fan (University of Edinburgh, UK and Bell Labs, USA) and Floris Geerts (University of Edinburgh, UK)
Ontologies and Databases: myths and challenges Enrico Franconi (Free University of Bozen-Bolzano, Italy)
Systems Aspects of Probabilistic Data Management Magdalena Balazinska, Christopher Re and Dan Suciu (University of Washington, USA)
XML Structural Summaries Mirella M. Moro (Univ. Fed. Rio Grande do Sul, Brazil), Zografoula Vagena (Microsoft Research, UK) and Vassilis J. Tsotras (University of California Riverside, USA)
Detecting Clusters in Moderate-to-High Dimensional Data: Subspace Clustering, Pattern-based Clustering, and Correlation Clustering Hans-Peter Kriegel, Peer Kröger, Arthur Zimek (Ludwig-Maximilians-Universität München, Germany
Scheduling Continuous Queries in Data Stream Management Systems Mohamed A. Sharaf (University of Toronto, Canada), Alexandros Labrinidis (University of Pittsburgh, USA), Panos K. Chrysanthis (University of Pittsburgh, USA)

PhD Workshop

Title Author
Studying Interaction Methodologies in Video Retrieval Hopfgartner, Frank
XML-Document-Filtering Automaton Silvasti, Panu
Towards Efficient Main-Memory Use For Optimum Tree Index Update Biveinis, Laurynas
Implementing Filesystems by tree-aware DBMSs Holupirek, Alexander
Adaptive Workflow Scheduling Under Resource Allocation Constraints and Network Dynamics Avanes, Artin
Incompleteness in Information Integration Kharlamov, Evgeny
GS-TMS: A Global Stream-based Threat Monitor System Miao, Jiajia
Community-Driven Data Grids Scholl, Tobias
Challenges and Techniques for Effective and Efficient Similarity Search in Large Video Databases Shao, Jie
Querying Web-Based Applications Under Models of Uncertainty Deutch, Daniel
Privacy Preserving Document Indexing Infrastructure for a Distributed Environment Zerr, Sergej
Mining Patterns and Rules for Software Specification Discovery Lo, David