
Database Research Group (CIS)
As one of the top database research groups in the US, Penn has made many fundamental contributions to the field -- particularly in areas relating to scientific data management, Web data management, and data provenance. Our research spans from theory to systems and applications, and connects to other research areas within Penn's CIS Department (such as machine learning, programming languages, logic and computation, and approximation algorithms). Within the University, we also collaborate frequently with bioinformatics and genomics.
Papers from 2009
Containment of Conjunctive Queries on Annotated Relations, Todd J. Green
Reconcilable Differences, Todd J. Green, Zachary G. Ives, and Val Tannen
Modeling and Analysis of Multi-hop Control Networks, Alur Rajeev, Alessandro D'Innocenzo, Karl H. Johansson, George James Pappas, and Gera Weiss
Papers from 2008
Annotated XML: Queries and Provenance, John N. Foster, Todd J. Green, and Val Tannen
Sideways Information Passing for Push-Style Query Processing, Zachary G. Ives and Nicholas E. Taylor
A Substrate for In-Network Sensor Data Integration, Svilen Mihaylov, Marie Jacob, Zachary G. Ives, and Sudipto Guha
Papers from 2007
Addressing the Provenance Challenge using ZOOM, Sarah Cohen-Boulakia, Olivier Biton, Shirley Cohen, and Susan B. Davidson
BioGuideSRS: Querying Multiple Sources with a user-centric perspective, Sarah Cohen-Boulakia, Olivier Biton, Susan B. Davidson, and Christine Froidevaux
Provenance Semirings, Todd J. Green, Grigoris Karvounarakis, and Val Tannen
Orchestra: Facilitating Collaborative Data Sharing, Todd J. Green, Grigoris Karvounarakis, Nicholas E. Taylor, Olivier Biton, Zachary G. Ives, and Val Tannen
Papers from 2006
Implementing Mapping Composition, Philip A. Bernstein, Todd J. Green, Sergey Melnik, and Alan Nash
Path-based systems to guide scientists in the maze of biological data sources, Sarah Cohen-Boulakia, Susan B. Davidson, Christine Froidevaux, Zoe Lacroix, and Maria-Esther Vidal
Selecting Biological Data Sources and Tools with XPR, a Path Language for RDF, Sarah Cohen-Boulakia, Christine Froidevaux, and Emmanuel Pietriga
Towards a Model of Provenance and User Views in Scientific Workflows, Shirley Cohen, Sarah Cohen-Boulakia, and Susan B. Davidson
Models for Incomplete and Probabilistic Information, Todd J. Green and Val Tannen
Reconciling while Tolerating Disagreement in Collaborative Data Sharing, Nicholas E. Taylor and Zachary G. Ives
Papers from 2005
Extending XPath to Support Linguistic Queries, Steven Bird, Yi Chen, Susan B. Davidson, Haejoong Lee, and Yifeng Zheng
A User-centric Framework for Accessing Biological Sources and Tools, Sarah Cohen-Boulakia, Susan B. Davidson, and Christine Froidevaux
Selecting biomedical data sources according to user preferences, Sarah Cohen-Boulakia, Severine Lair, Nicolas Stransky, Stephane Graziani, Francois Radvanyi, Emmanuel Barillot, and Christine Froidevaux
Papers from 2004
BLAS : An Efficient XPath Processing System, Yi Chen, Susan B. Davidson, and Yifeng Zheng
EXPedite: A System for Encoded XML Processing, Yi Chen, George A. Mihaila, Susan B. Davidson, and Sriram Padmanabhan
L-Tree: a Dynamic Labeling Structure for Ordered XML Data, Yi Chen, George Mihaila, Rajesh Bordawekar, and Sriram Padmanabhan
Optimizing Taxonomic Semantic Web Queries using Labeling Schemes, Vassilis Christophides, Grigoris Karvounarakis, Dimitris Plexousakis, Vassilika Vouton, Michel Scholl, and Sotiris Tourtounis
Processing XML Streams with Deterministic Automata and Stream Indexes, Todd J. Green, Ashish Gupta, Gerome Miklau, Makoto Onizuka, and Dan Suciu
The Piazza peer data management system, Alon Y. Halevy, Zachary G. Ives, Jayant Madhavan, Peter Mork, Dan Suciu, and Igor Tatarinov
Reasoning about functional and key dependencies in hierarchically structured data, Carmem Satie Hara
Piazza: Mediation and Integration Infrastructure for Semantic Web Data, Zachary G. Ives, Alon Y. Halevy, Peter Mork, and Igor Tatarinov
Adapting to Source Properties in Processing Data Integration Queries, Zachary G. Ives, Alon Y. Halevy, and Daniel S. Weld
Papers from 2003
The ICS-FORTH SWIM: A Powerful Semantic Web Integration Middleware, Vassilis Christophides, Grigoris Karvounarakis, I. Koffina, G. Kokkinidis, A. Magkanaraki, Dimitris Plexousakis, G. Serfiotis, and Val Tannen
Propagating XML Constraints to Relations, Susan B. Davidson, Wenfei Fan, Carmem Hara, and Jing Qin
MARS: A System for Publishing XML from Mixed and Redundant Storage , Alin Deutsch and Val Tannen
Crossing the Structure Chasm , Oren Etzioni, Alon Halevy, Anhai Doan, Zachary G. Ives, Jayant Madhaven, Luke McDowell, and Igor Tatarinov
Processing XML Streams with Deterministic Automata, Todd J. Green, Gerome Miklau, Makoto Onizuka, and Dan Suciu
Processing XML Streams with Deterministic Automata, Todd J. Green, Gerome Miklau, Makoto Onizuka, and Dan Suciu
Schema Mediation in Peer Data Management Systems, Alon Halevy, Zachary G. Ives, Dan Suciu, and Igor Tatarinov
Piazza: Data Management Infrastructure for Semantic Web Applications, Alon Y. Halevy, Zachary G. Ives, Peter Mork, and Igor Tatarinov
The Piazza Peer Data Management Project, Igor Tatarinov, Zachary G. Ives, Jayant Madhavan, Alon Halevy, Dan Suciu, Nilesh Dalvi, Xin (Luna) Dong, Yana Kadiyska, Gerome Miklau, and Peter Mork
Papers from 2002
XMLTK: An XML Toolkit for Scalable XML Stream Processing, Iliana Avila-Campillo, Todd J. Green, Ashish Gupta, Makoto Onizuka, Demian Raven, and Dan Suciu
Archiving Scientific Data, Peter Buneman, Sanjeev Khanna, Keishi Tajima, and Wang-Chiew Tan
On Propagation of Deletions and Annotations Through Views, Peter Buneman, Sanjeev Khanna, and Wang-Chiew Tan
Validating Constraints in XML, Yi Chen, Susan B. Davidson, and Yifeng Zheng
XKvalidator: A Constraint Validator For XML, Yi Chen, Susan B. Davidson, and Yifeng Zheng
What Are Real DTDs Like, Byron Choi
XML query reformulation over mixed and redundant storage, Alin Bernard Deutsch
Querying XML With Mixed and Redundant Storage, Alin Deutsch and Val Tannen
An XML Query Engine for Network-Bound Data, Zachary G. Ives, Alon Y. Halevy, and Daniel S. Weld
Interviewing During a Tight Job Market, Zachary G. Ives and Qiong Luo
ubQL: A distributed query language to program distributed query systems, Arnaud Sahuguet
Data annotations, provenance, and archiving, Wang-Chiew Tan
Papers from 2001
Reasoning about Keys for XML, Peter Buneman, Susan B. Davidson, Wenfei Fan, Carmem Hara, and Wang-Chiew Tan
Why and Where: A Characterization of Data Provenance, Peter Buneman, Sanjeev Khanna, and Wang-Chiew Tan
Indexing Keys in Hierarchical Data, Yi Chen, Susan B. Davidson, and Yifeng Zheng
Beyond Discrete E-Services: Composing Session-oriented Services in Telecommunications , Vassilis Christophides, Richard Hull, Grigoris Karvounarakis, Akhil Kumar, Geliang Tong, and Ming Xiong
K2/Kleisli and GUS: Experiments in Integrated Access to Genomic Data Sources, Susan B. Davidson, Jonathan Crabtree, Brian P. Brunk, Jonathan Schug, Val Tannen, Chris Overton, and Christian J. Stoeckert
Containment and Integrity Constraints for XPath Fragments, Alin Deutsch and Val Tannen
Integrating Network-Bound XML Data, Zachary G. Ives, Alon Y. Halevy, and Daniel S. Weld
On Computing Functions with Uncertainty, Sanjeev Khanna and Wang-Chiew Tan
Building Intelligent Web Applications Using Lightweight Wrappers, Arnaud Sahuguet and Fabien Azavant
ubQL, a Language for Programming Distributed Query Systems, Arnaud Sahuguet and Val Tannen
Updating XML, Igor Tatarinov, Zachary G. Ives, Alon Y. Halevy, and Daniel S. Weld
Papers from 2000
Towards A Query Language for Annotation Graphs, Steven Bird, Peter Buneman, and Wang-Chiew Tan
Reasoning About Keys for XML, Peter Buneman, Susan B. Davidson, Wenfei Fan, Carmem Hara, and Wang-Chiew Tan
Data Provenance: Some Basic Issues, Peter Buneman, Sanjeev Khanna, and Wang-Chiew Tan
Adaptive Query Processing for Internet Applications, Zachary G. Ives, Alon Y. Levy, Daniel S. Weld, Daniela Florescu, and Marc Friedman
View Maintenance for Hierarchical Semistructured Data, Hartmut Liefke and Susan B. Davidson
Object /relational query optimization with chase and backchase, Lucian Popa
A Chase Too Far?, Lucian Popa, Alin Deutsch, Arnaud Sahuguet, and Val Tannen
Papers from 1999
Taming Web Sources with "Minute-Made" Wrappers, Fabien Azavant and Arnaud Sahuguet
Physical Data Independence, Constraints and Optimization with Universal Plans , Alin Deutsch, Lucian Popa, and Val Tannen
Path constraints for databases with or without schemas, Wenfei Fan
An Adaptive Query Execution System for Data Integration, Zachary G. Ives, Daniela Florescu, Marc Friedman, Alon Levy, and Daniel S. Weld
Web Ecology: Recycling HTML pages as XML documents using W4F, Arnaud Sahuguet and Fabien Azavant
WysiWyg Web Wrapper Factory (W4F), Arnaud Sahuguet and Fabien Azavant
HOLON/CADSE: Integrating Open Software Standards and Formal Methods to Generate Guideline-Based Decision Support Agents, Barry G. Silverman, Alex Wong, Lance Lang, Allan Khoury, Keith Campbell, Val Tannen, Arnaud Sahuguet, and Chen Qiang
An Equational Chase for Path-Conjunctive Queries, Constraints, and Views , Val Tannen and Lucian Popa
Papers from 1998
Beyond XML Query Languages, Peter Buneman, Alin Deutsch, Wenfei Fan, Hartmut Liefke, Arnaud Sahuguet, and Wang-Chiew Tan
Semantics of Database Transformations, Susan Davidson, Peter Buneman, and Anthony S. Kosky
Papers from 1997
Adding Structure to Unstructured Data , Peter Buneman, Susan B. Davidson, Mary Fernandez, and Dan Suciu
WOL: A Language for Database Transformations and Constraints , Susan B. Davidson and Anthony S. Kosky
Papers from 1996
Effecting Database Transformations Using Morphase, Susan B. Davidson and Anthony S. Kosky
Papers from 1995
A Data Transformation System for Biological Data Sources , Peter Buneman, Susan B. Davidson, Kyle Hart, Chris Overton, and L. Wong
Transforming Databases with Recursive Data Structures, Anthony S. Kosky
Papers from 1994
Facilitating Transformations in a Human Genome Project Database, Susan B. Davidson, Anthony Kosky, and B. Eckman
Papers from 1992
Theoretical Aspects of Schema Merging , Peter Buneman, Susan B. Davidson, and Anthony Kosky
Papers from 1991
Modeling and Merging Database Schemas, Anthony S. Kosky
Papers from 1988
Partial Computation in Real-Time Database Systems, Susan B. Davidson and Aaron Watters