Prediction of synergistic transcription factors by function conservationReport as inadecuate

Prediction of synergistic transcription factors by function conservation - Download this document for free, or read online. Document in PDF available to download.

Genome Biology

, 8:R257

First Online: 05 December 2007Received: 28 May 2007Revised: 19 October 2007Accepted: 05 December 2007


BackgroundPrevious methods employed for the identification of synergistic transcription factors TFs are based on either TF enrichment from co-regulated genes or phylogenetic footprinting. Despite the success of these methods, both have limitations.

ResultsWe propose a new strategy to identify synergistic TFs by function conservation. Rather than aligning the regulatory sequences from orthologous genes and then identifying conserved TF binding sites TFBSs in the alignment, we developed computational approaches to implement the novel strategy. These methods include combinatorial TFBS enrichment utilizing distance constraints followed by enrichment of overlapping orthologous genes from human and mouse, whose regulatory sequences contain the enriched TFBS combinations. Subsequently, integration of function conservation from both TFBS and overlapping orthologous genes was achieved by correlation analyses. These techniques have been used for genome-wide promoter analyses, which have led to the identification of 51 homotypic TF combinations; the validity of these approaches has been exemplified by both known TF-TF interactions and function coherence analyses. We further provide computational evidence that our novel methods were able to identify synergistic TFs to a much greater extent than phylogenetic footprinting.

ConclusionFunction conservation based on the concordance of combinatorial TFBS enrichment along with enrichment of overlapping orthologous genes has been proven to be a successful means for the identification of synergistic TFs. This approach avoids the limitations of phylogenetic footprinting as it does not depend upon sequence alignment. It utilizes existing gene annotation data, such as those available in GO, thus providing an alternative method for functional TF discovery and annotation.

AbbreviationsDBTSSDatabase of Transcriptional Start Sites

EELenhancer element locator

GOGene Ontology

LODlog odds ratio

PWMposition weight matrices

TFtranscription factor

TFBStranscription factor binding site.

Electronic supplementary materialThe online version of this article doi:10.1186-gb-2007-8-12-r257 contains supplementary material, which is available to authorized users.

Download fulltext PDF

Author: Zihua Hu - Boyu Hu - James F Collins



Related documents