Web supplement to
"A compendium of RNA-binding motifs for decoding gene regulation"

Debashish Ray[1]*, Hilal Kazan[2]*, Kate B. Cook[3]*, Matthew T. Weirauch[1]*#, Hamed S. Najafabadi[1,4]*, Xiao Li[3], Serge Guerrousov[1], Mihai Albu[1], Hong Zheng[1], Ally Yang[1], Hong Na[1], Manuel Irimia[1], Leah H. Matzat[5], Ryan K. Dale[5], Sarah A. Smith[6], Christopher A. Yarosh[6], Seth M. Kelly[7], Behnam Nabet[6], Desirea Mecenas[8], Weimin Li[9], Rakesh S. Laishram[9], Mei Qiao[10], Howard D. Lipshitz[3], Fabio Piano[8], Anita H. Corbett[7], Russ P. Carstens[6], Brendan J. Frey[4], Richard A. Anderson[9], Kristen W. Lynch[6], Luiz O.F. Penalva[10], Elissa P. Lei[5], Andrew G. Fraser[1,3], Benjamin J. Blencowe[1,3], Quaid D. Morris[1,2,3,4]**, and Timothy R. Hughes[1,3]**

[1] Donnelly Centre, [2] Department of Computer Science, [3] Department of Molecular Genetics, [4] Department of Electrical and Computer Engineering, University of Toronto
[5] Laboratory of Cellular and Developmental Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health
[6] Department of Medicine, Perelman School of Medicine at the University of Pennsylvania
[7] Department of Biochemistry, Emory University School of Medicine
[8] Department of Biology and Center for Genomics and Systems Biology, New York University
[9] Department of Pharmacology, School of Medicine and Public Health, University of Wisconsin-Madison
[10] Children's Cancer Research Institute, University of Texas Health Science Center at San Antonio
[#] Current address: Center for Autoimmune Genomics and Etiology (CAGE) and Divisions of Rheumatology and Biomedical Informatics, Cincinnati Children's Hospital Medical Center
[*] These authors made equal contributions to the manuscript.
[**] Communicating authors: and

Abstract

RNA-binding proteins are key regulators of gene expression, yet only a small fraction have been functionally characterized. Here we report a systematic analysis of the RNA motifs recognized by RNA-binding proteins, encompassing 205 distinct genes from 24 diverse eukaryotes. The sequence specificities of RNA-binding proteins display deep evolutionary conservation, and the recognition preferences for a large fraction of metazoan RNA-binding proteins can thus be inferred from their RNA-binding domain sequence. The motifs that we identify in vitro correlate well with in vivo RNA-binding data. Moreover, we can associate them with distinct functional roles in diverse types of post-transcriptional regulation, enabling new insights into the functions of RNA-binding proteins both in normal physiology and in human disease. These data provide an unprecedented overview of RNA-binding proteins and their targets, and constitute an invaluable resource for determining post-transcriptional regulatory mechanisms in eukaryotes.

RNA-binding protein library, experimental conditions, and pTH6838 vector map

Array data and information

7-mer data

Motifs

Literature data

RBP binding sites

RNAcompete data normalization code