Leung, Yuk Y

Email Address
ORCID
Disciplines
Research Projects
Organizational Units
Position
Introduction
Research Interests

Search Results

Now showing 1 - 4 of 4
  • Publication
    HAMR: High-Throughput Annotation of Modified Ribonucleotides
    (2013-12-01) Ryvkin, Paul; Leung, Yuk Y; Childress, Micah; Valladares, Otto; Gregory, Brian D; Wang, Li-San; Silverman, Ian M; Dragomir, Isabelle
    RNA is often altered post-transcriptionally by the covalent modification of particular nucleotides; these modifications are known to modulate the structure and activity of their host RNAs. The recent discovery that an RNA methyl-6 adenosine demethylase (FTO) is a risk gene in obesity has brought to light the significance of RNA modifications to human biology. These noncanonical nucleotides, when converted to cDNA in the course of RNA sequencing, can produce sequence patterns that are distinguishable from simple base-calling errors. To determine whether these modifications can be detected in RNA sequencing data, we developed a method that can not only locate these modifications transcriptome-wide with single nucleotide resolution, but can also differentiate between different classes of modifications. Using small RNA-seq data we were able to detect 92% of all known human tRNA modification sites that are predicted to affect RT activity. We also found that different modifications produce distinct patterns of cDNA sequence, allowing us to differentiate between two classes of adenosine and two classes of guanine modifications with 98% and 79% accuracy, respectively. To show the robustness of this method to sample preparation and sequencing methods, as well as to organismal diversity, we applied it to a publicly available yeast data set and achieved similar levels of accuracy. We also experimentally validated two novel and one known 3-methylcytosine (3mC) sites predicted by HAMR in human tRNAs. Researchers can now use our method to identify and characterize RNA modifications using only RNA-seq data, both retrospectively and when asking questions specifically about modified RNA.
  • Publication
    Changes in the Transcriptome of Human Astrocytes Accompanying Oxidative Stress-Induced Senescence
    (2016-08-01) Gregory, Brian D; Donahue, Greg; Crowe, Elizabeth P; Tuzer, Ferit; Leung, Yuk Y; Nativio, Raffaella; Gosai, Sager J; Berger, Shelley L; Cohen, Justin; Yetkin, Emre; Johnson, F. B; Torres, Claudio
    Aging is a major risk factor for many neurodegenerative disorders. A key feature of aging biology that may underlie these diseases is cellular senescence. Senescent cells accumulate in tissues with age, undergo widespread changes in gene expression, and typically demonstrate altered, pro-inflammatory profiles. Astrocyte senescence has been implicated in neurodegenerative disease, and to better understand senescence-associated changes in astrocytes, we investigated changes in their transcriptome using RNA sequencing. Senescence was induced in human fetal astrocytes by transient oxidative stress. Brain-expressed genes, including those involved in neuronal development and differentiation, were downregulated in senescent astrocytes. Remarkably, several genes indicative of astrocytic responses to injury were also downregulated, including glial fibrillary acidic protein and genes involved in the processing and presentation of antigens by major histocompatiblity complex class II proteins, while pro-inflammatory genes were upregulated. Overall, our findings suggest that senescence-related changes in the function of astrocytes may impact the pathogenesis of age-related brain disorders.
  • Publication
    CoRAL: Predicting Non-Coding RNAs from Small RNA-Sequencing Data
    (2013-08-01) Leung, Yuk Y; Ryvkin, Paul; Ungar, Lyle H; Gregory, Brian D; Wang, Li-San
    The surprising observation that virtually the entire human genome is transcribed means we know little about the function of many emerging classes of RNAs, except their astounding diversities. Traditional RNA function prediction methods rely on sequence or alignment information, which are limited in their abilities to classify the various collections of non-coding RNAs (ncRNAs). To address this, we developed Classification of RNAs by Analysis of Length (CoRAL), a machine learning-based approach for classification of RNA molecules. CoRAL uses biologically interpretable features including fragment length and cleavage specificity to distinguish between different ncRNA populations. We evaluated CoRAL using genome-wide small RNA sequencing data sets from four human tissue types and were able to classify six different types of RNAs with ∼80% cross-validation accuracy. Analysis by CoRAL revealed that microRNAs, small nucleolar and transposon-derived RNAs are highly discernible and consistent across all human tissue types assessed, whereas long intergenic ncRNAs, small cytoplasmic RNAs and small nuclear RNAs show less consistent patterns. The ability to reliably annotate loci across tissue types demonstrates the potential of CoRAL to characterize ncRNAs using small RNA sequencing data in less well-characterized organisms.
  • Publication
    DASHR: Database of Small Human Noncoding RNAs
    (2016-01-01) Leung, Yuk Y; Kuksa, Pavel P; Amlie-Wolf, Alexandre; Valladares, Otto; Ungar, Lyle H; Kannan, Sampath; Gregory, Brian D; Wang, Li-San
    Small non-coding RNAs (sncRNAs) are highly abundant RNAs, typically long, that act as key regulators of diverse cellular processes. Although thousands of sncRNA genes are known to exist in the human genome, no single database provides searchable, unified annotation, and expression information for full sncRNA transcripts and mature RNA products derived from these larger RNAs. Here, we present the Database of small human noncoding RNAs (DASHR) . DASHR contains the most comprehensive information to date on human sncRNA genes and mature sncRNA products. DASHR provides a simple user interface for researchers to view sequence and secondary structure, compare expression levels, and evidence of specific processing across all sncRNA genes and mature sncRNA products in various human tissues. DASHR annotation and expression data covers all major classes of sncRNAs including microRNAs (miRNAs), Piwi-interacting (piRNAs), small nuclear, nucleolar, cytoplasmic (sn-, sno-, scRNAs, respectively), transfer (tRNAs), and ribosomal RNAs (rRNAs). Currently, DASHR (v1.0) integrates 187 smRNA high-throughput sequencing (smRNA-seq) datasets with over 2.5 billion reads and annotation data from multiple public sources. DASHR contains annotations for ~48,000 human sncRNA genes and mature sncRNA products, 82% of which are expressed in one of more of the curated tissues. DASHR is available at http://lisanwanglab.org/DASHR.