Departmental Papers (CIS)

Date of this Version


Document Type

Conference Paper


The Orchestra Collaborative Data Sharing System. ACM SIGMOD Record. Zachary Ives Todd J. Green Grigoris Karvounarakis Nicholas Taylor Val Tannen Partha Pratim Talukdar Marie Jacob Fernando Pereira (2008).

doi: 10.1145/1462571.1462577

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Copyright 2008, held by the authors.


Sharing structured data today requires standardizing upon a single schema, then mapping and cleaning all of the data. This results in a single queriable mediated data instance. However, for settings in which structured data is being collaboratively authored by a large community, e.g., in the sciences, there is often a lack of consensus about how it should be represented, what is correct, and which sources are authoritative. Moreover, such data is seldom static: it is frequently updated, cleaned, and annotated. The ORCHESTRA collaborative data sharing system develops a new architecture and consistency model for such settings, based on the needs of data sharing in the life sciences. In this paper we describe the basic architecture and implementation of the ORCHESTRA system, and summarize some of the open challenges that arise in this setting.



Date Posted: 24 July 2012