Provenance for Aggregate Queries

dc.contributor.authorAmsterdamer, Yael
dc.contributor.authorDeutch, Daniel
dc.contributor.authorTannen, Val
dc.date2023-05-17T07:14:10.000
dc.date.accessioned2023-05-22T12:50:08Z
dc.date.available2023-05-22T12:50:08Z
dc.date.issued2011-06-13
dc.date.submitted2012-07-24T11:54:38-07:00
dc.description.abstractWe study in this paper provenance information for queries with aggregation. Provenance information was studied in the context of various query languages that do not allow for aggregation, and recent work has suggested to capture provenance by annotating the different database tuples with elements of a commutative semiring and propagating the annotations through query evaluation. We show that aggregate queries pose novel challenges rendering this approach inapplicable. Consequently, we propose a new approach, where we annotate with provenance information not just tuples but also the individual values within tuples, using provenance to describe the values computation. We realize this approach in a concrete construction, first for “simple” queries where the aggregation operator is the last one applied, and then for arbitrary (positive) relational algebra queries with aggregation; the latter queries are shown to be more challenging in this context. Finally, we use aggregation to encode queries with difference, and study the semantics obtained for such queries on provenance annotated databases.
dc.description.commentsYael Amsterdamer, Daniel Deutch, and Val Tannen. 2011. Provenance for aggregate queries. In Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (PODS '11). ACM, New York, NY, USA, 153-164. © ACM, 2011. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, {(2011)} doi: http://dx.doi.org/10.1145/1989284.1989302 Email permissions@acm.org
dc.identifier.urihttps://repository.upenn.edu/handle/20.500.14332/6710
dc.legacy.articleid1683
dc.legacy.fulltexturlhttps://repository.upenn.edu/cgi/viewcontent.cgi?article=1683&context=cis_papers&unstamped=1
dc.source.issue645
dc.source.journalDepartmental Papers (CIS)
dc.source.statuspublished
dc.subject.otherComputer Sciences
dc.titleProvenance for Aggregate Queries
dc.typePresentation
digcom.contributor.authorAmsterdamer, Yael
digcom.contributor.authorDeutch, Daniel
digcom.contributor.authorisAuthorOfPublication|email:val@cis.upenn.edu|institution:University of Pennsylvania|Tannen, Val
digcom.identifiercis_papers/645
digcom.identifier.contextkey3126837
digcom.identifier.submissionpathcis_papers/645
digcom.typeconference
dspace.entity.typePublication
relation.isAuthorOfPublication9ed4699c-5b2b-4655-8ddb-abd0c3312402
relation.isAuthorOfPublication9ed4699c-5b2b-4655-8ddb-abd0c3312402
relation.isAuthorOfPublication.latestForDiscovery9ed4699c-5b2b-4655-8ddb-abd0c3312402
upenn.schoolDepartmentCenterDepartmental Papers (CIS)
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
1101.1110v1.pdf
Size:
353.4 KB
Format:
Adobe Portable Document Format
Collection