Frequency distribution of TATA box and extension sequences on human promoters
journal contribution
posted on 2006-12-12, 00:00authored byW Shi, Wanlei Zhou
TATA box is one of the most important transcription factor binding sites. But the exact sequences of TATA box are still not very clear yet. In this study, we conducted a dedicated analysis on the frequency distribution of TATA Box and its extension sequences on human promoters. Sixteen TATA elements derived from TATA Box motif, TATAWAWN, were classified into three distribution patterns: peak, bottom-peak and bottom. Fourteen TATA extension sequences (up to two base extensions) were predicted to be the new TATA Box elements because of their high motif factors, which indicate their statistical significance. Statistical analysis on the promoters of mouse, zebrafish and drosophila melanogaster verified seven of these elements. It was also observed that the distribution of TATA elements on the promoters of housekeeping genes are very similar with their distribution on the promoters of tissue specific genes in human.
This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Publication classification
C1 Refereed article in a scholarly journal; C Journal article