TATA box is one of the most important transcription factor binding sites. But the exact sequences of TATA box are still not very clear yet. In this study, we conducted a dedicated analysis on the frequency distribution of TATA Box and its extension sequences on human promoters. Sixteen TATA elements derived from TATA Box motif, TATAWAWN, were classified into three distribution patterns: peak, bottom-peak and bottom. Fourteen TATA extension sequences (up to two base extensions) were predicted to be the new TATA Box elements because of their high motif factors, which indicate their statistical significance. Statistical analysis on the promoters of mouse, zebrafish and drosophila melanogaster verified seven of these elements. It was also observed that the distribution of TATA elements on the promoters of housekeeping genes are very similar with their distribution on the promoters of tissue specific genes in human.
This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Unless expressly stated otherwise, the copyright for items in Deakin Research Online is owned by the author, with all rights reserved.
Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO.
If you believe that your rights have been infringed by this repository, please contact firstname.lastname@example.org.