Date of this Version
Background RNA molecules fold into complex three-dimensional shapes, guided by the pattern of hydrogen bonding between nucleotides. This pattern of base pairing, known as RNA secondary structure, is critical to their cellular function. Recently several diverse methods have been developed to assay RNA secondary structure on a transcriptome-wide scale using high-throughput sequencing. Each approach has its own strengths and caveats, however there is no widely available tool for visualizing and comparing the results from these varied methods. Methods To address this, we have developed Structure Surfer, a database and visualization tool for inspecting RNA secondary structure in six transcriptome-wide data sets from human and mouse (http://tesla.pcbi.upenn.edu/strucuturesurfer/). The data sets were generated using four different high-throughput sequencing based methods. Each one was analyzed with a scoring pipeline specific to its experimental design. Users of Structure Surfer have the ability to query individual loci as well as detect trends across multiple sites. Results Here, we describe the included data sets and their differences. We illustrate the database’s function by examining known structural elements and we explore example use cases in which combined data is used to detect structural trends. Conclusions In total, Structure Surfer provides an easy-to-use database and visualization interface for allowing users to interrogate the currently available transcriptome-wide RNA secondary structure information for mammals.
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Berkowitz, N. D., Silverman, I. M., Childress, D. M., Kazan, H., Wang, L., & Gregory, B. D. (2016). A Comprehensive Database of High-Throughput Sequencing-Based RNA Secondary Structure Probing Data (Structure Surfer). BMC Bioinformatics, 17 (215), http://dx.doi.org/10.1186/s12859-016-1071-0
Additional FilesFigure S1_A comprehensive database of high-throughput sequencing-based RNA.jpg (269 kB)
Figure 2_A comprehensive database of high-throughput sequencing-based RNA.jpg (115 kB)
Table S1_A comprehensive database of high-throughput sequencing-based RNA.docx (39 kB)
Table S2_A comprehensive database of high-throughput sequencing-based RNA.docx (59 kB)
Date Posted: 14 July 2017
This document has been peer reviewed.