Table of Contents
Invited Talks
Tradeoffs between Parallel Database Systems, Hadoop, and HadoopDB as Platforms for Petabyte-Scale Analysis Daniel J. Abadi 1
Emerging Trends and Converging Technologies in Data Intensive Scalable Computing Roger S. Barga 4
Query Processing
Deriving Spatio-temporal Query Results in Sensor Networks Markus Bestehorn Klemens Böhm Patrick Bradley Erik Buchmann 6
Efficient and Adaptive Distributed Skyline Computation George Valkanas Apostolos N. Papadopoulos 24
On the Efficient Construction of Multislices from Recurrences Romans Kasperovics Michael H. Böhlen Johann Gamper 42
Optimizing Query Processing in Cache-Aware Wireless Sensor Networks Mario A. Nascimento Romulo A.E. Alencar Angelo Brayner 60
Approximate Query Answering and Result Refinement on XML Data Katja Seidler Eric Peukert Gregor Hackenbroich Wolfgang Lehner 78
Efficient and Scalable Method for Processing Top-k Spatial Boolean Queries Ariel Cary Ouri Wolfson Naphtali Rishe 87
Scientific Data Management and Analysis
A Framework for Moving Sensor Data Query and Retrieval of Dynamic Atmospheric Events Shen-Shyang Ho Wenqing Tang W. Timothy Liu Markus Schneider 96
Client + Cloud: Evaluating Seamless Architectures for Visual Data Analytics in the Ocean Sciences Keith Grochow Bill Howe Mark Stoermer Roger Barga Ed Lazowska 114
Scalable Clustering Algorithm for N-Body Simulations in a Shared-Nothing Cluster YongChul Kwon Dylan Nunley Jeffery P. Gardner Magdalena Balazinska Bill Howe Sarah Loebman 132
Database Design for High-Resolution LIDAR Topography Data Viswanath Nandigam Chaitan Baru Christopher Crosby 151
PetaScope: An Open-Source Implementation of the OGC WCS Geo Service Standards Suite Andrei Aiordachioaie Peter Baumann 160
Towards Archaeo-Informatics: Scientific Data Management for Archaeobiology Hans-Peter Kriegel Peer Kröger Christiaan Hendrikus van der Meijden Henriette Obermaier Joris Peters Matthias Renz 169
Data Mining
DESSIN: Mining Dense Subgraph Patterns in a Single Graph Shirong Li Shijie Zhang Jiong Yang 178
Discovery of Evolving Convoys Htoo Htet Aung Kian-Lee Tan 196
Finding Top-k Similar Pairs of Objects Annotated with Terms from an Ontology Arnab Bhattacharya Abhishek Bhownick Ambuj K. Singh 214
Identifying the Most Influential User Preference from and Assorted Collection Hua Lu Linhao Xu 233
MC-Tree: Improving Bayesian Anytime Classification Philipp Kranen Stephan Günnemann Sergej Fries Thomas Seidl 252
Non-intrusive Quality Analysis of Monitoring Data Mark Brightwell Anastasia Ailamaki Anna Suwalska 270
Visual Decision Support for Ensemble Clustering Martin Hahmann Dirk Habich Wolfgang Lehner 279
Indexes and Data Representation
An Indexing Scheme for Fast and Accurate Chemical Fingerprint Database Searching Zeyar Aung See-Kiong Ng 288
BEMC: A Searchable, Compressed Representation for Large Seismic Wavefields Julio López Leonardo Ramírez-Guzmán Jacobo Bielak David O'Hallaron 306
Dynamic Data Reorganization for Energy Savings in Disk Storage Systems Ekow Otoo Doron Rotem Shih-Chiang Tsao 322
Organization of Data in Non-convex Spatial Domains Eric Perlman Randal Burns Michael Kazhdan Rebecca R. Murphy William P. Ball Nina Amenta 342
PrefIndex: An Efficient Supergraph Containment Search Technique Gaoping Zhu Xuemin Lin Wenjie Zhang Wei Wang Haichuan Shang 360
Supporting Web-Based Visual Exploration of Large-Scale Raster Geospatial Data Using Binned Min-Max Quadtree Jianting Zhang Simin You 379
Scientific Workflow and Provenance
Bridging Workflow and Data Provenance Using Strong Links David Koop Emanuele Santos Bela Bauer Matthias Troyer Juliana Freire Cláudio T. Silva 397
LIVE: A Lineage-Supported Versioned DBMS Anish Das Sarma Martin Theobald Jennifer Widom 416
Optimizing Resource Allocation for Scientific Workflows Using Advance Reservations Christoph Langguth Heiko Schuldt 434
A Fault-Tolerance Architecture for Kepler-Based Distributed Scientific Workflows Pierre Mouallem Daniel Crawl Ilkay Altintas Mladen Vouk Ustun Yildiz 452
Provenance Context Entity (PaCE): Scalable Provenance Tracking for Scientific RDF Data Satya S. Sahoo Olivier Bodenreider Pascal Hitzler Amit Sheth Krishnaprasad Thirunarayan 461
Taverna, Reloaded Paolo Missier Stian Soiland-Reyes Stuart Owen Wei Tan Alexandra Nenadic Ian Dunlop Alan Williams Tom Oinn Carole Goble 471
Similarity
Can Shared-Neighbor Distance Defeat the Curse of Dimensionality? Michael E. Houle Hans-Peter Kriegel Peer Kröger Erich Schubert Arthur Zimek 482
Optimizing All-Nearest Neighbor Queries with Trigonometric Pruning Tobias Emrich Franz Graf Hans-Peter Kriegel Matthias Schubert Marisa Thoma 501
Prefix Tree Indexing for Similarity Search and Similarity Joins on Genomic Data Astrid Rheinländer Martin Knobloch Nicky Hochmuth Ulf Leser 519
Similarity Estimation Using Rayes Ensembles Tobias Emrich Franz Graf Hans-Peter Kriegel Matthias Schubert Marisa Thoma 537
Subspace Similarity Search: Efficient k-NN Queries in Arbitrary Subspaces Thomas Bernecker Tobias Emrich Franz Graf Hans-Peter Kriegel Peer Kröger Matthias Renz Erich Schubert Arthur Zimek 555
Data Stream Processing
Continuous Skyline Monitoring over Distributed Data Streams Hua Lu Yongluan Zhou Jonas Haustad 565
Propagation of Densities of Streaming Data within Query Graphs Michael Daum Frank Lauterwald Philipp Baumgärtel Klaus Meyer-Wegener 584
Spatio-temporal Event Stream Processing in Multimedia Communication Systems Mingyan Gao Xiaoyan Yang Ramesh Jain Beng Chin Ooi 602
Stratified Reservoir Sampling over Heterogeneous Data Streams Mohammed Al-Kateb Byung Suk Lee 621
Tree Induction over Perennial Objects Zaigham Faraz Siddiqui Myra Spiliopoulou 640
Author Index 659