Crimson: A Data Management System to Support Evaluating Phylogenetic Tree Reconstruction Algorithms

Loading...
Thumbnail Image
Penn collection
Departmental Papers (CIS)
Degree type
Discipline
Subject
databases
phylogenetics
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Zheng, Yifeng
Fisher, Stephen
Cohen, Shirley
Guo, Sheng
Kim, Junhyong
Contributor
Abstract

Evolutionary and systems biology increasingly rely on the construction of large phylogenetic trees which represent the relationships between species of interest. As the number and size of such trees increases, so does the need for efficient data storage and query capabilities. Although much attention has been focused on XML as a tree data model, phylogenetic trees differ from document-oriented applications in their size and depth, and their need for structure based queries rather than path-based queries. This paper focuses on Crimson, a tree storage system for phylogenetic trees used to evaluate phylogenetic tree reconstruction algorithms within the context of the NSF CIPRes project. A goal of the modeling component of the CIPRes project is to construct a huge simulation tree representing a "gold standard" of evolutionary history against which phylogenetic tree reconstruction algorithms can be tested. In this demonstration, we highlight our storage and indexing strategies and show how Crimson is used for benchmarking phylogenetic tree reconstruction algorithms. We also show how our design can be used to support more general queries over phylogenetic trees.

Advisor
Date of presentation
2006-09-01
Conference name
Departmental Papers (CIS)
Conference dates
2023-05-17T00:20:33.000
Conference location
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Postprint version. Published in 32nd International Conference on Very Large Data Bases, ACM 2006, September 12-15, 2006, pages 1231-1234. Publisher URL: http://www.vldb.org/dblp/db/conf/vldb/vldb2006.html
Recommended citation
Collection