AliBiMotif: Integrating alignment and biclustering to unravel Transcription Factor Binding Sites in DNA sequences

Joana P. Goncalves, Yves Moreau, Sara C. Madeira

Research output: Contribution to journalArticleScientificpeer-review

2 Citations (Scopus)

Abstract

Transcription Factors (TFs) control transcription by binding to specific sites in the promoter regions of the target genes, which can be modelled by structured motifs. In this paper we propose AliBiMotif, a method combining sequence alignment and a biclustering approach based on efficient string matching techniques using suffix trees to unravel approximately conserved sets of blocks (structured motifs) while straightforwardly disregarding non-conserved stretches in-between. The ability to ignore the width of non-conserved regions is a major advantage of the proposed method over other motif finders, as the lengths of the binding sites are usually easier to estimate than the separating distances.
Original languageEnglish
Pages (from-to)196-215
Number of pages20
JournalInternational Journal of Data Mining and Bioinformatics
Volume6
Issue number2
DOIs
Publication statusPublished - 2012
Externally publishedYes

Keywords

  • biclustering
  • sequence alignment
  • motif finding algorithm
  • structured motif identification
  • binding site
  • TF
  • transcription factor
  • TFBS
  • cis-regulatory module discovery
  • promoter region
  • motif finder
  • integrative mining

Cite this