Web supplement to
"Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities"

Kamesh Narasimhan1*, Samuel A. Lambert2,*, Ally W.H. Wang1*, Jeremy Riddell3, Sanie Mnaimneh1, Hong Zheng1, Mihai Albu1, Hamed S. Najafabadi1, John S. Reece-Hoyes4, Juan I. Fuxman Bass4, Albertha J.M. Walhout4, Matthew T. Weirauch5#, Timothy R. Hughes1,2,6#

1 Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Canada
2 Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
3 Department of Molecular and Cellular Physiology, Systems Biology and Physiology Program, University of Cincinnati, Cincinnati, OH, USA
4 Program in Systems Biology, University of Massachusetts Medical School, Worcester, MA, USA
5 Center for Autoimmune Genomics and Etiology (CAGE) and Divisions of Biomedical Informatics and Developmental Biology, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati, Cincinnati, OH, USA
6 Canadian Institutes For Advanced Research, Toronto, Canada

*these authors contributed equally

#To whom correspondance should be addressed: or

Abstract

Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only ~10% of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for 97 TFs that have DNA-binding domains similar to those already characterized, resulting in coverage of binding specificities for ~40% of all C. elegans TFs. These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families, and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology, and also identifies putative regulatory roles for unstudied TFs.

All C. elegans data from Cis-BP (for reviewers). This includes motifs, logos, annotation information, and E-scores. Click to Download .zip file

Supplemental Files. Click to Download

Figure Source Data. Click to Download

Web Supplemental Files. Click to Download

PBM data are available in the Cis-BP database and Gene Expression Omnibus (GEO)