AGC box gene transcriptions: Difference between revisions

Revision as of 15:46, 20 October 2021

Editor-In-Chief: Henry A. Hoff

This is a digital photograph of Arabidopsis thaliana. Credit: Alberto Salguero Quiles en Getafe (Madrid), España.

"The GCC box, also referred to as the AGC box (10), GCC element (11), or AGCCGCC sequence (13), is an ethylene-responsive element found in the promoters of a large number of [pathogenesis related] PR genes whose expression is up-regulated following pathogen attack."^[1]

Consensus sequences

The AGC box has a consensus sequence as 3'-AGCCGCC-5' in the direction of transcription.^[2]

AGC

"AGC is a binding site for factors responding to pathogen attacks (Ohme-Takagi et al., 2000)".^[3]

Inverse copies

For "AGC, one copy in inverse orientation of the AGC box (AGCCGCC) [is] present as two copies (-1346 and -1314) in the ERE".^[2]

Enhancers

"Enhancer activity, ethylene responsiveness, and binding of nuclear proteins depend on the integrity of two copies of the AGC box, AGCCGCC, present in the promoters of several ethylene-responsive genes."^[2]

"The GLB enhancer contains two copies of the sequence AGCCGCC, which is conserved in several genes showing expression patterns similar to the GLB gene, as well as a sequence identical at 6 of 7 bp."^[4]

Glucanase promoters

"One common motif, AGCCGCC (AGC box), has been found to be present in nearly all chitinase and glucanase promoters so far analyzed (Ohme-Takagi and Shinshi 1990; Hart et al. 1993)."^[5]

DNA-binding proteins

"cDNA clones have been identified representing 4 novel DNA-binding proteins, called ethylene-responsive element binding proteins (EREBPs), that specifically bind the ERE AGC box".^[2]

Functional non-coding DNA

Functional "non-coding DNA is involved in the regulation of gene expression and thus in the evolution of novelties and adaptation between species [...] Functional non-coding sequences fall into two main categories: protein binding sites such as transcription factor binding sites (TFBSs), enhancers [such as the AGC box], and silencers, which are involved in the control of gene expression, and sequences that control chromatin organization such as insulators and matrix attachment regions".^[6]

Pathogenesis-related genes

"Genes of PR-1 and -5 proteins have now been identified in the genomes of various species of organisms, including humans and nematodes. PR proteins may contribute to the innate immunity of plants as well as to that of other organisms."^[7]

Ostreococcus

File:Ostreococcus RCC143 2.jpg

This is a photomicrograph of Ostreococcus. Credit: Wenche Eikrem and Jahn Throndsen, University of Oslo.

"Ocean-dwelling phytoplankton from the genus Ostreococcus emerge at the primitive root of the green plant lineage, dating back nearly 1.5 billion years. Today, these microscopic, free-living creatures, among the smallest eukaryotes ever characterized, barely a micron in diameter, contribute to a significant share of the world’s total photosynthetic activity. These “picophytoplankton”also exhibit great diversity that contrasts sharply with the dearth of ecological niches available to them in aquatic ecosystems. This observation, known as the “paradox of the plankton,” has long puzzled biologists."^[8]

"Plumbing the depths of molecular-level information of related species, genomics offers a novel glimpse into this paradox. The researchers compared the genomes of two Ostreococcus species, O. lucimarinus and O. tauri, and saw dramatic changes in genome structure and metabolic capabilities."^[8]

“We found several striking features of genome organization. Overlapping genes conserved across the species may enable them to cross-regulate their expression, while species-specific chromosomes with horizontally transferred genes can account for changes in the cell surface to adapt to different ecological niches.”^[8]

“This work builds on the community’s emerging understanding about how carbon fixation is carried out by picoplankton.”^[9]

“From an applied perspective, we are learning some of the tricks nature has employed to ‘engineer’ an extremely small eukaryote to thrive in nature–which may well find applications in bioengineering. It was particularly interesting to see the predicted use of selenium-containing enzymes as one of the tricks to maintain such tiny cells. There are many mechanisms that can account for species formation in photosynthetic phytoplankton, and this is just one of the major pieces to this long-standing puzzle for biologists.”^[9]

“Assimilation of atmospheric CO₂ by marine phytoplankton is a global-scale process that is responsible for about half of the biosphere net primary production. This active absorption of hundreds of millions of tons of carbon per day is essential for maintaining the control of the planet’s climate by counteracting greenhouse effects due to human activities. Clearly, this storage capacity is affected by changes in the photosynthetic efficiency of the algae, which in turn is linked to the environmental conditions experienced by these organisms in their environment.”^[10]

Nicotiana

The osmotin-like protein (OLP) "has no intron and ... its promoter region contains two AGCCGCC sequences that are conserved in most basic PR-protein genes."^[11]

The "AGCCGCC sequence(s) is a DNA element(s) responsive to ethylene. An EREBP2 protein, isolated as one of the proteins binding the AGCCGCC sequence of the tobacco rβ-1,3-glucanase gene, also was found to bind to the AGCCGCC sequence(s) of OLP gene. These results suggest that the ethylene-induced expression of OLP is regulated by trans-acting factor(s) common to basic PR-proteins."^[11]

"AGCCGCC sequences were found at -46 to -52 and -161 to -167. There was no repeated sequence (-938 to -903)".^[11]

"Expression of the osmotin gene is similar to that of the OLP gene. The osmotin gene also has several AGCCGCC sequences; a complete AGCCGCC (from -50 to -44), a slightly modified CGCCGCC (from -144 to -138), and an AGCCGCC sequence in reverse orientation (from -162 to -156)."^[11]

Arabidopsis

File:Arabidopsis thaliana inflorescencias.jpg

This is an image of the flowers of Arabidopsi thaliana, a specimen of about 15 cm, in the first week of March 2004. Credit: Alberto Salguero Quiles in Getafe (Madrid), Spain.

In Arabidopsis thaliana "an ethylene-inducible, GCC box DNA-binding protein interacts with an ocs element binding protein".^[1]

"In yeast and mammalian systems, it is well established that transcriptional down-regulation by DNA-binding repressors involves core histone deacetylation, mediated by their interaction within a complex containing histone deacetylase (e.g. HDA1), as well as various proteins (e.g. SIN3, SAP18, SAP30, and RhAp46). [An] Arabidopsis thaliana gene related in sequence to SAP18, designated AtSAP18, functions in transcription regulation in plants subjected to salt stress."^[12]

Evidence has been provided "that SAP18 and HDA1 function as transcriptional repressors. [Further] they associate with Ethylene-Responsive Element binding Factors (ERFs) to create a hormone-sensitive multimeric repressor complex under conditions of environmental stress."^[12]

"At the molecular level, the actions of ethylene upon gene expression involve Ethylene Responsive element binding Factors (ERFs), which display GCC box-specific binding activities in Arabidopsis (Ohme-Takagi and Shinshi, 1995). ERFs contain a highly conserved DNA binding domain (the EFR domain) consisting of 58-59 amino acids (Ohme-Takagi and Shinshi, 1995), which binds with high affinity to the GCC box (Hao et al., 1998)."^[12]

Peaches

"An AGC box (AGCCGCC) was found [from peach (Prunus persica L. Batsch cv. Loring)] between 886 and 892 bp upstream of the translation start site which has been shown in other ethylene-responsive PR genes to be a binding site for ethylene-responsive binding factor proteins (ERF proteins) (Ohme-Takagi and Shinshi, 1995; Sato et al., 1996; Jia and Martin, 1999; Fujimoto et al., 2000)."^[3]

"The peach ACO1 does have an AGC box that has been found to bind ethylene responsive elements in response to pathogen infections (Ohme-Takagi et al., 2000; Rushton et al., 2002). Only the apple ACO1 also contains this sequence. In addition, both PpACO1 and the apple ACO1 have a MADS box transcription factor binding site (CarG) (Tilly et al., 1998), but none of the other ACO genes do. "^[3]

E2F4

File:Protein E2F4 PDB 1cf7.png

Structure of the E2F4 protein shown is based on PyMOL rendering of PDB 1cf7. Credit: Emw.

Gene ID: 1874 - "The protein encoded by this gene is a member of the E2F family of transcription factors. The E2F family plays a crucial role in the control of cell cycle and action of tumor suppressor proteins and is also a target of the transforming proteins of small DNA tumor viruses. The E2F proteins contain several evolutionally conserved domains found in most members of the family. These domains include a DNA binding domain, a dimerization domain which determines interaction with the differentiation regulated transcription factor proteins (DP), a transactivation domain enriched in acidic amino acids, and a tumor suppressor protein association domain which is embedded within the transactivation domain. This protein binds to all three of the tumor suppressor proteins pRB, p107 and p130, but with higher affinity to the last two. It plays an important role in the suppression of proliferation-associated genes, and its gene mutation and increased expression may be associated with human cancer."^[13]

"The AGC triplet repeat in the coding region of the E2F-4 gene, a member of the family, has been reported to be mutated in colorectal cancers with a microsatellite instability (MSI) phenotype. We found a wider range variation of the repeat number in DNAs from tumors, the corresponding normal mucosa, and healthy individuals. A total of 5 repeat variants, ranging from 8 to 17 AGC repeats, was detected in 6 (9.7%) of the 62 healthy individuals and 8 (8.9%) of the 90 normal DNAs of the patients. The wild-type 13 repeat was present in all of these individuals. The variation of the AGC repeat number may be a polymorphism. Further, loss of heterozygosity (LOH) at the E2F-4 locus in the tumor tissues of 2 (25%) of the 8 informative cases was detected."^[14]

Hypotheses

An AGC box occurs in the human genome.

AGC box samplings

For the Basic programs (starting with SuccessablesAGC.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), including extending the number of nts from 958 to 4445, the programs are, are looking for, and found:

negative strand in the negative direction is SuccessablesAGC--.bas, looking for 3'-AGCCGCC-5', 0,
negative strand in the positive direction is SuccessablesAGC-+.bas, looking for 3'-AGCCGCC-5', 0,
positive strand in the negative direction is SuccessablesAGC+-.bas, looking for 3'-AGCCGCC-5', 0,
positive strand in the positive direction is SuccessablesAGC++.bas, looking for 3'-AGCCGCC-5', 0,
complement, negative strand, negative direction is SuccessablesAGCc--.bas, looking for 3'-TCGGCGG-5', 0,
complement, negative strand, positive direction is SuccessablesAGCc-+.bas, looking for 3'-TCGGCGG-5', 0,
complement, positive strand, negative direction is SuccessablesAGCc+-.bas, looking for 3'-TCGGCGG-5', 0,
complement, positive strand, negative direction is SuccessablesAGCc++.bas, looking for 3'-TCGGCGG-5', 0,
inverse complement, negative strand, negative direction is SuccessablesAGCci--.bas, looking for 3'-GGCGGCT-5', 0,
inverse complement, negative strand, positive direction is SuccessablesAGCci-+.bas, looking for 3'-GGCGGCT-5', 0,
inverse complement, positive strand, negative direction is SuccessablesAGCci+-.bas, looking for 3'-GGCGGCT-5', 1, 3'-GGCGGCT-5', 1754,
inverse complement, positive strand, positive direction is SuccessablesAGCci++.bas, looking for 3'-GGCGGCT-5', 0,
inverse, negative strand, negative direction, is SuccessablesAGCi--.bas, looking for 3'-CCGCCGA-5', 1, 3'-CCGCCGA-5', 1754,
inverse, negative strand, positive direction, is SuccessablesAGCi-+.bas, looking for 3'-CCGCCGA-5', 0,
inverse, positive strand, negative direction, is SuccessablesAGCi+-.bas, looking for 3'-CCGCCGA-5', 0,
inverse, positive strand, positive direction, is SuccessablesAGCi++.bas, looking for 3'-CCGCCGA-5', 0.

GCC box samplings

Copying 5'-GCCGCC-3' in "⌘F" yields one between ZSCAN22 and A1BG and two between ZNF497 and A1BG as can be found by the computer programs.

For the Basic programs (starting with SuccessablesGCC.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), including extending the number of nts from 958 to 4445, the programs are, are looking for, and found:

negative strand in the negative direction, looking for GCCGCC, 1, GCCGCC at 2727.
positive strand in the negative direction, looking for GCCGCC, 0.
positive strand in the positive direction, looking for GCCGCC, 1, GCCGCC at 356.
negative strand in the positive direction, looking for GCCGCC, 2, GCCGCC at 1757, GCCGCC at 904.
complement, negative strand, negative direction, looking for CGGCGG, 0.
complement, positive strand, negative direction, looking for CGGCGG, 1, CGGCGG at 2727.
complement, positive strand, positive direction, looking for CGGCGG, 2, GCCGCC at 1757, GCCGCC at 904.
complement, negative strand, positive direction, looking for CGGCGG, 1, CGGCGG at 356.
inverse complement, negative strand, negative direction, looking for GGCGGC, 0.
inverse complement, positive strand, negative direction, looking for GGCGGC, 1, GGCGGC at 1753.
inverse complement, positive strand, positive direction, looking for GGCGGC, 0.
inverse complement, negative strand, positive direction, looking for GGCGGC, 3, GGCGGC at 1902, GGCGGC at 1794, GGCGGC at 354.
inverse, negative strand, negative direction, looking for CCGCCG, 1, CCGCCG, 1753.
inverse, positive strand, negative direction, looking for CCGCCG, 0.
inverse, negative strand, positive direction, looking for CCGCCG, 0.
inverse, positive strand, positive direction, looking for CCGCCG, 3, CCGCCG at 1902, CCGCCG at 1794, CCGCCG at 354.

GCC proximal promoters

Negative strand, negative direction: GCCGCC at 2727.

GCC distal promoters

Positive strand, negative direction: GGCGGCT at 1753.

Negative strand, positive direction: GGCGGC at 1902, GGCGGC at 1794, GCCGCC at 1757, GCCGCC at 904, GGCGGC at 354.

Positive strand, positive direction: GCCGCC at 356.

GCC random dataset samplings

GCCr0: 3, GCCGCC at 3407, GCCGCC at 2380, GCCGCC at 1384.
GCCr1: 0.
GCCr2: 3, GCCGCC at 3586, GCCGCC at 2598, GCCGCC at 1966.
GCCr3: 3, GCCGCC at 4138, GCCGCC at 2792, GCCGCC at 1452.
GCCr4: 4, GCCGCC at 1092, GCCGCC at 1089, GCCGCC at 1022, GCCGCC at 80.
GCCr5: 1, GCCGCC at 4353.
GCCr6: 0.
GCCr7: 1, GCCGCC at 1770.
GCCr8: 2, GCCGCC at 2518, GCCGCC at 2473.
GCCr9: 3, GCCGCC at 2666, GCCGCC at 2449, GCCGCC at 1415.
RDr0ci: 0.
RDr1ci: 0.
RDr2ci: 0.
RDr3ci: 0.
RDr4ci: 0.
RDr5ci: 0.
RDr6ci: 0.
RDr7ci: 0.
RDr8ci: 0.
RDr9ci: 0.

GCCr UTRs

GCCr0: GCCGCC at 3407.
GCCr2: GCCGCC at 3586.

GCCr core promoters

GCCr5: GCCGCC at 4353.

GCCr proximal promoters

GCCr2: GCCGCC at 2598.

GCCr3: GCCGCC at 4138.

GCCr distal promoters

GCCr0: GCCGCC at 2380, GCCGCC at 1384.
GCCr2: GCCGCC at 1966.
GCCr4: GCCGCC at 1092, GCCGCC at 1089, GCCGCC at 1022, GCCGCC at 80.
GCCr8: 2GCCGCC at 2518, GCCGCC at 2473.

GCCr3: GCCGCC at 2792, GCCGCC at 1452.
GCCr7: GCCGCC at 1770.
GCCr9: GCCGCC at 2666, GCCGCC at 2449, GCCGCC at 1415.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

Initial content for this page in some instances came from Wikiversity.

References

↑ ^1.0 ^1.1 Michael Büttner and Karam B. Singh (May 27, 1997). "Arabidopsis thaliana ethylene-responsive element binding protein (AtEBP), an ethylene-inducible, GCC box DNA-binding protein interacts with an ocs element binding protein". Proceedings of the National Academy of Sciences of the United States of America. 94 (11): 5961–6. Retrieved 2014-05-02.
↑ ^2.0 ^2.1 ^2.2 ^2.3 Gerhard Leubner-Metzger, Luciana Petruzzelli, Rosa Waldvogel, Regina Vögeli-Lange, and Frederick Meins, Jr. (November 1998). "Ethylene-responsive element binding protein (EREBP) expression and the transcriptional regulation of class I β-1, 3-glucanase during tobacco seed germination". Plant Molecular Biology. 38 (5): 785–95. doi:10.1023/A:1006040425383. Retrieved 2014-05-02.
↑ ^3.0 ^3.1 ^3.2 Hangsik Moon and Ann M. Callahan (2004). "Developmental regulation of peach ACC oxidase promoter–GUS fusions in transgenic tomato fruits". Journal of Experimental Botany. 55 (402): 1519–28. doi:10.1093/jxb/erh162. Retrieved 2014-05-07.
↑ CM Hart, F. Nagy, and F. Meins Jr. (January 1993). "A 61 bp enhancer element of the tobacco beta-1,3-glucanase B gene interacts with one or more regulated nuclear proteins". Plant Molecular Biology. 21 (1): 121–31. PMID 8425042. Retrieved 2014-05-02.
↑ Imre E. Somssich (1994). L. Nover, ed. Regulatory Elements Governing Pathogenesis-Related (PR) Gene Expression, In: Plant Promoters and Transcription Factors. 20. Berlin: Springer-Verlag. pp. 163–79. doi:10.1007/978-3-540-48037-2_7. Retrieved 2014-05-07.
↑ Gwenael Piganeau, Klaas Vandepoele, Sébastien Gourbière, Yves Van de Peer, and Hervé Moreau (September 2009). "Unraveling cis-Regulatory Elements in the Genome of the Smallest Photosynthetic Eukaryote: Phylogenetic Footprinting in Ostreococcus". Journal of Molecular Evolution. 69 (3): 249–59. doi:10.1007/s00239-009-927I-0. Retrieved 2014-05-02.
↑ Sakihito Kitajima and Fumihiko Sato (1999). "Plant pathogenesis-related proteins: molecular mechanisms of gene expression and protein function". Journal of Biochemistry. 125 (1): 1–8. Retrieved 2016-01-07.
↑ ^8.0 ^8.1 ^8.2 Igor Grigoriev (April 30, 2007). Puzzling Plankton Yield Secrets to Role in Evolution/Global Photosynthesis. Washington, DC USA: Department of Energy. Retrieved 2014-05-06.
↑ ^9.0 ^9.1 Brian Palenik (April 30, 2007). Puzzling Plankton Yield Secrets to Role in Evolution/Global Photosynthesis. Washington, DC USA: Department of Energy. Retrieved 2014-05-06.
↑ Hervé Moreau (April 30, 2007). Puzzling Plankton Yield Secrets to Role in Evolution/Global Photosynthesis. Washington, DC USA: Department of Energy. Retrieved 2014-05-06.
↑ ^11.0 ^11.1 ^11.2 ^11.3 Fumihiko Sato, Sakihito Kitajima and Tomotsugu Koyama (1996). "Ethylene-Induced Gene Expression of Osmotin-Like Protein, a Neutral Isoform of Tobacco PR-5, is Mediated by the AGCCGCC eft-Sequence". Plant and Cell Physiology. 37 (3): 249–55. Retrieved 2014-05-07.
↑ ^12.0 ^12.1 ^12.2 Chun-Peng Song and David W. Galbraith (January 2006). "AtSAP18, an orthologue of human SAP18, is involved in the regulation of salt stress and mediates transcriptional repression in Arabidopsis". Plant Molecular Biology. 60 (2): 241–57. doi:10.1007/s11103-005-3880-9. Retrieved 2016-01-07.
↑ RefSeqJuly2008 (25 December 2016). E2F4 E2F transcription factor 4 [ Homo sapiens (human) ]. U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2017-01-08.
↑ X. Zhong, H. Hemmi, J. Koike, K. Tsujita, H. Shimatake (March 2000). "Various AGC repeat numbers in the coding region of the human transcription factor gene E2F-4". Human Mutation. 15 (3): 296–7. doi:10.1002/(SICI)1098-1004(200003)15:3<296::AID-HUMU18>3.0.CO;2-X. PMID 10679953. Retrieved 2017-01-08.

External links

[Buttner-1] 1.0 ^1.1 Michael Büttner and Karam B. Singh (May 27, 1997). "Arabidopsis thaliana ethylene-responsive element binding protein (AtEBP), an ethylene-inducible, GCC box DNA-binding protein interacts with an ocs element binding protein". Proceedings of the National Academy of Sciences of the United States of America. 94 (11): 5961–6. Retrieved 2014-05-02.

[Metzger-2] 2.0 ^2.1 ^2.2 ^2.3 Gerhard Leubner-Metzger, Luciana Petruzzelli, Rosa Waldvogel, Regina Vögeli-Lange, and Frederick Meins, Jr. (November 1998). "Ethylene-responsive element binding protein (EREBP) expression and the transcriptional regulation of class I β-1, 3-glucanase during tobacco seed germination". Plant Molecular Biology. 38 (5): 785–95. doi:10.1023/A:1006040425383. Retrieved 2014-05-02.

[Moon-3] 3.0 ^3.1 ^3.2 Hangsik Moon and Ann M. Callahan (2004). "Developmental regulation of peach ACC oxidase promoter–GUS fusions in transgenic tomato fruits". Journal of Experimental Botany. 55 (402): 1519–28. doi:10.1093/jxb/erh162. Retrieved 2014-05-07.

[Hart-4] CM Hart, F. Nagy, and F. Meins Jr. (January 1993). "A 61 bp enhancer element of the tobacco beta-1,3-glucanase B gene interacts with one or more regulated nuclear proteins". Plant Molecular Biology. 21 (1): 121–31. PMID 8425042. Retrieved 2014-05-02.

[Somssich-5] Imre E. Somssich (1994). L. Nover, ed. Regulatory Elements Governing Pathogenesis-Related (PR) Gene Expression, In: Plant Promoters and Transcription Factors. 20. Berlin: Springer-Verlag. pp. 163–79. doi:10.1007/978-3-540-48037-2_7. Retrieved 2014-05-07.

[Piganeau-6] Gwenael Piganeau, Klaas Vandepoele, Sébastien Gourbière, Yves Van de Peer, and Hervé Moreau (September 2009). "Unraveling cis-Regulatory Elements in the Genome of the Smallest Photosynthetic Eukaryote: Phylogenetic Footprinting in Ostreococcus". Journal of Molecular Evolution. 69 (3): 249–59. doi:10.1007/s00239-009-927I-0. Retrieved 2014-05-02.

[Kitajima-7] Sakihito Kitajima and Fumihiko Sato (1999). "Plant pathogenesis-related proteins: molecular mechanisms of gene expression and protein function". Journal of Biochemistry. 125 (1): 1–8. Retrieved 2016-01-07.

[Grigoriev-8] 8.0 ^8.1 ^8.2 Igor Grigoriev (April 30, 2007). Puzzling Plankton Yield Secrets to Role in Evolution/Global Photosynthesis. Washington, DC USA: Department of Energy. Retrieved 2014-05-06.

[Palenik-9] 9.0 ^9.1 Brian Palenik (April 30, 2007). Puzzling Plankton Yield Secrets to Role in Evolution/Global Photosynthesis. Washington, DC USA: Department of Energy. Retrieved 2014-05-06.

[Moreau-10] Hervé Moreau (April 30, 2007). Puzzling Plankton Yield Secrets to Role in Evolution/Global Photosynthesis. Washington, DC USA: Department of Energy. Retrieved 2014-05-06.

[Sato-11] 11.0 ^11.1 ^11.2 ^11.3 Fumihiko Sato, Sakihito Kitajima and Tomotsugu Koyama (1996). "Ethylene-Induced Gene Expression of Osmotin-Like Protein, a Neutral Isoform of Tobacco PR-5, is Mediated by the AGCCGCC eft-Sequence". Plant and Cell Physiology. 37 (3): 249–55. Retrieved 2014-05-07.

[Song-12] 12.0 ^12.1 ^12.2 Chun-Peng Song and David W. Galbraith (January 2006). "AtSAP18, an orthologue of human SAP18, is involved in the regulation of salt stress and mediates transcriptional repression in Arabidopsis". Plant Molecular Biology. 60 (2): 241–57. doi:10.1007/s11103-005-3880-9. Retrieved 2016-01-07.

[RefSeqJuly2008-13] RefSeqJuly2008 (25 December 2016). E2F4 E2F transcription factor 4 [ Homo sapiens (human) ]. U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2017-01-08.

[Zhong-14] X. Zhong, H. Hemmi, J. Koike, K. Tsujita, H. Shimatake (March 2000). "Various AGC repeat numbers in the coding region of the human transcription factor gene E2F-4". Human Mutation. 15 (3): 296–7. doi:10.1002/(SICI)1098-1004(200003)15:3<296::AID-HUMU18>3.0.CO;2-X. PMID 10679953. Retrieved 2017-01-08.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

@@ Line 303: / Line 303: @@
 Positive strand, negative direction: GGCGGCT at 1753.
-Negative strand, positive direction: GCCGCC at 1757, GCCGCC at 904.
-Positive strand, positive direction: CCGCCG at 1902, CCGCCG at 1794, GCCGCC at 356, CCGCCG at 354.
+Negative strand, positive direction: GGCGGC at 1902, GGCGGC at 1794, GCCGCC at 1757, GCCGCC at 904, GGCGGC at 354.
+Positive strand, positive direction: GCCGCC at 356.
 ==GCC random dataset samplings==

v t e Gene project
Articles	Complex locus A1BG and ZNF497 Grainyhead-like Genes in Regulating Development and Genetic Defects Lysenin Lysine: biosynthesis, catabolism and roles RIG-I like receptors ShK toxin: history, structure and therapeutic applications for autoimmune diseases
Categories	Biochemistry Biology Genetics Medicine
Laboratories	AGC box gene transcription laboratory ATA box gene transcription laboratory C and D boxes gene transcription laboratory CArG box gene transcription laboratory CGCG box gene transcription laboratory CRE box gene transcription laboratory E2 box gene transcription laboratory Enhancer box gene transcription laboratory Factor II B recognition element gene transcription laboratory GA responsive complex gene transcription laboratory GC box gene transcription laboratory H box gene transcription laboratory HNF6 gene transcription laboratory HY box gene transcription laboratory Initiator element gene transcription laboratory Metal responsive element gene transcription laboratory STAT5 gene transcription laboratory TATA box gene transcription laboratory
Lessons	A1BG gene transcription programming Amino Acids Enzymes Enzyme catalysis Enzyme structure and function Eukaryotic transcription Gene regulation in prokaryotes
Lists	Biomolecules
Modules	Module:Infobox gene Module:InfoboxImage
Original research	Gene project
Projects	Biochemistry Gene project History of biology Molecular Biology Molecular evolution Topobiology
Proposals	Gene expressions/Cost sharing and research products Gene expressions in human exploration beyond low earth orbits Gene expressions/Project narrative
Resources	5' cap Acid-base homeostasis Actins Adenines Allergies Alpha-1-B glycoprotein Ammonoids Original research/Amino acids Amphiphiles Anabolism Animal physiology Anomeric carbons Autocatalytic reactions Autonomously replicating sequences Base pairs Biology Biodegradation Biosynthesis Biosynthesis of a human protein Biosynthesis of amino acids Blood Bodily fluids Botany Brain box Calcium signaling Capping enzymes Carbohydrates Carcinoembryonic antigen gene family Catabolism Catalysis Cells Cell signaling Centrosomes Chromatins Chromoboxes Coactivators Corepressors Cofactors Consensus sequences Cytogenetics Cytokinesis Cytosines Deoxyribonucleic acids Digestion Disaccharides Dispersed promoters Dominant group metagenomes Downregulations Endochondral ossification Enzyme inhibitors Enzymology Epigenetics Epigenomes Esters Esterification Eukaryotes Eukaryotic initiation factors Evolution Exaptation Excision repair cross-complementing Factors Fatty acids Ferredoxin Foldings Foods Forkhead boxes Functional groups Genome surveillance complexes Genealogy Genes Genetics Gene transcriptions Genomes Genomics Glycoproteins Glycosides Glycosidic bond Guanines Hair color gene expressions Helicases Heredity History of agriculture Greek and Roman histories of biology Homeostasis Human amino acid synthesis Human DNAs Human genes Human RNA Human teeth Human temperatures Immunoglobulin domain cl11960 Immunoglobulin domain genes Immunoglobulin like domain cd05751 Immunoglobulin like domain pfam13895 Immunoglobulin like domain smart00410 Immunoglobulin receptor superfamily genes Immunoglobulin supergene family Inhibitory peptides Insulators Intranuclear localizations Introduction to Cell Biology Introduction to polymer chemistry Lamarckism Leucine zipper Localization Major histocompatibility complex class I gene family Major histocompatibility complex class II gene family Major histocompatibility complex class III gene family Mammalogy Mathematical molecular biology Mediator complexes Medicine Melanocytes Membranes Metagenomes Molecular biology Molecular genetics Nitrogen metabolism Nucleotide Synthesis Origin of life Orthomolecular medicine Osteoarthritis Paleanthropology Paleontology Phosphate biochemistry Phosphate budgets Phosphate reactions Post translational modifications Principles of biosynthesis Protein isoform Proteins Proteomics Regulations Ribonucleotides Ribosomes RNA polymerases RNA polymerase II holoenzymes RNA polymerase II holoenzyme complexes RNA translations Salinity Stroke management Teeth TFIIA Transports Vascular endothelial growth factor A What is a human? Upregulations Upstream and downstream ZSCAN22 Zoology
Transcription resources	A1BG gene transcription core promoters A1BG gene transcriptions A1BG regulatory elements and regions A1BG response element gene transcriptions A1BG response element negative results A1BG response element positive results ABA-response element gene transcriptions Abf1 regulatory factor gene transcriptions A box gene transcriptions Abscisic acid-responsive elements ACGT-containing element gene transcriptions Activating protein gene transcriptions Activating transcription factor gene transcriptions Adenylate–uridylate rich element gene transcriptions Adr1p gene transcriptions Aft1p gene transcriptions AGC box gene transcriptions AGCE gene transcriptions Alpha-amylase conserved element gene transcriptions Amino acid response element gene transcriptions AARE-like Androgen response element gene transcriptions Angiotensinogen core promoter element gene transcriptions Antioxidant-electrophile responsive element gene transcriptions ATA box gene transcriptions Auxin response factor gene transcriptions B box gene transcriptions Bioinformatics tool gene transcriptions Box gene transcriptions B recognition element upstream Bridge gene transcriptions CAAT box gene transcriptions CACA elements CadC binding domain gene transcriptions Calcineurin-responsive transcription factor gene transcriptions Calcium-response element gene transcriptions cAMP response element gene transcriptions C and D boxes gene transcriptions Carbohydrate response element gene transcriptions Carbon source-responsive element gene transcriptions Carcinoembryonic antigen gene family CARE gene transcriptions CArG box gene transcriptions CAT box gene transcriptions Cat8p gene transcriptions Cbf1 regulatory factor gene transcriptions C box gene transcriptions CCCTC-binding factor gene transcriptions C-EBP box gene transcriptions Cell-cycle box gene transcriptions Cell cycle regulation gene transcriptions CENP-B box gene transcriptions CGCG box gene transcriptions Circadian control element gene transcriptions "Class C" (Leal) samplings Cold-responsive element gene transcriptions Complement copy gene transcriptions Complement-inverse copy gene transcriptions Consensus sequence gene transcriptions Copper response element gene transcriptions Core promoter gene transcriptions Coupling element gene transcriptions CRE box gene transcriptions Cytokinin response regulator gene transcriptions Cytoplasmic polyadenylation element gene transcriptions DAF-16-associated element gene transcriptions DAF-16 binding element gene transcriptions D box gene transcriptions Defense and stress-responsive element gene transcriptions Degenerate nucleotide gene transcriptions Dispersed promoter gene transcriptions Distal promoter gene transcriptions DNA melting gene transcriptions DNA damage response element gene transcriptions DNA replication-related element gene transcriptions Downstream core element gene transcriptions Downstream promoter element gene transcriptions Downstream TFIIB recognition element gene transcriptions DREB box gene transcriptions E2 box gene transcriptions EIF4E basal element gene transcriptions EIN3 binding site gene transcriptions Enhancer activity copy gene transcriptions E box gene transcriptions Element gene transcriptions Endoplasmic reticulum stress response element gene transcriptions Endosperm expression gene transcriptions Enhancer box gene transcriptions Estrogen response element gene transcriptions Ethylene responsive element gene transcriptions Families of TATA box genes F box gene transcriptions Focused promoter gene transcriptions Forkhead box gene transcriptions Fur box gene transcriptions GAAC element gene transcriptions Gal4p gene transcriptions Γ-interferon activated sequence gene transcriptions GARE gene transcriptions GA responsive complex gene transcriptions GATA gene transcriptions G box gene transcriptions GC box gene transcriptions GCC box gene transcriptions Gcn4p gene transcriptions Gcr1p gene transcriptions Gene expressions General factor II D gene transcriptions General regulatory factors General transcription factor II A gene transcriptions General transcription factor II B gene transcriptions General transcription factor II D gene transcriptions General transcription factor II F gene transcriptions General transcription factor II H gene transcriptions General transcription factor gene transcriptions Gene transcriptions GGC triplet gene transcriptions Gibberellin responsive element gene transcriptions GLM box gene transcriptions Glucocorticoid response element gene transcriptions Grainy head gene transcriptions Grainy head transcription factor gene transcriptions Growth hormone response element gene transcriptions GT boxes Hac1p gene transcriptions Hair color gene expressions H and ACA box gene transcriptions H box gene transcriptions Heat-responsive element gene transcriptions Heat shock elements Hex sequence gene transcriptions HMG box gene transcriptions HNF gene transcriptions Homeobox gene transcriptions Hsf1p gene transcriptions HY box gene transcriptions Hybrid C, A boxes Hybrid C, G boxes Hybrid C, T boxes Hypoxia-inducible factor gene transcriptions Hypoxia response elements I box gene transcriptions Immunoglobulin like domain containing family Initiator element gene transcriptions Initiator-like element gene transcriptions Inositol/choline-responsive elements Interaction gene transcriptions Interferon regulatory factors Inverse copy gene transcriptions Jasmonic acid-responsive element gene transcriptions K-boxes Kozak sequence gene transcriptions Kruppel-associated box gene transcriptions Krüppel-like factor gene transcriptions L box gene transcriptions Leu3 gene transcriptions M35 box gene transcriptions MADS box gene transcriptions Maf recognition element gene transcriptions M box gene transcriptions Mcm1 regulatory factor gene transcriptions Met3 gene transcriptions Met31p box gene transcriptions Metal responsive element gene transcriptions Middle sporulation element gene transcriptions Mig1p gene transcriptions Model samplings Motif ten element gene transcriptions Msn2,4p gene transcriptions Musashi binding element gene transcriptions MYB recognition element gene transcriptions Myelocytomatosis transcription factor gene transcriptions Myocyte enhancer factor gene transcriptions N-boxes Nanos/Pumilio response element Ndt80p gene transcriptions Nuclear factor 1 Nuclear factor 𝜿B Nuclear factor gene transcriptions Nuclear factor of activated T cell gene transcriptions (NFAT) Nuclear factor Y gene transcriptions Nutrient-sensing response element gene transcriptions Oaf1p gene transcriptions ORE1 binding site gene transcriptions p53 response element gene transcriptions P63 DNA-binding site gene transcriptions P box gene transcriptions Pdr1,3p gene transcriptions Peroxisome proliferator hormone response element gene transcriptions Phosphate starvation-response transcription factor gene transcriptions Pollen1 element gene transcriptions Polycomb response element gene transcriptions Preinitiation complex Preinitiation complex gene transcriptions Pribnow box gene transcriptions Prolamin box gene transcriptions Promoter gene transcriptions Proximal promoter gene transcriptions Promoter occurrence gene transcriptions Pyrimidine box gene transcriptions Q element gene transcriptions Rap1 regulatory factor gene transcriptions Reb1 general regulatory factor gene transcriptions Retinoblastoma control element gene transcriptions Retinoic acid response element gene transcriptions Rgt1p gene transcriptions Rlm1p gene transcriptions RNA polymerase II gene transcriptions RNA polymerase II holoenzyme complex Root specific element gene transcriptions ROR-response element gene transcriptions Rox1p gene transcriptions Rpn4p gene transcriptions R response element gene transcriptions SARE gene transcriptions Seed-specific element gene transcriptions Serum response element gene transcriptions Servenius sequence gene transcriptions Shoot specific element gene transcriptions Sip4p gene transcriptions Smp1p gene transcriptions Sp1 gene transcriptions Spaceflight gene expressions Specificity protein gene transcriptions STAT gene transcriptions Ste12p gene transcriptions Sterol response element gene transcriptions Sucrose box gene transcriptions Synaptic Activity-Responsive Elements TACTAAC box gene transcriptions TAGteam gene transcriptions Tapetum box gene transcriptions TATA binding protein associated factor gene transcriptions TATA binding protein gene transcriptions TATA box actin/cytoskeleton/contractile family TATA box albumin family TATA box aldolase family TATA box alkaline phosphatase family TATA box annexin family TATA box cytochrome superfamily TATA box gene transcriptions TATA box heat shock family TATA box histone family TATA box human genes TATA box keratin family TATA box lipase family TATA box peptidase family TATA box platelet-derived growth factor family TATA box selenium-binding family TATA box serine protease family TATA box serpin superfamily TATA box transforming growth factor superfamily TATA box trefoil family TATA box tumor necrosis factor superfamily TATA box zinc metalloenzyme family TAT box gene transcriptions TATC box gene transcriptions Tbf1 regulatory factor gene transcriptions T box gene transcriptions TCCACCATA element gene transcriptions TC element gene transcriptions TCT gene transcriptions TEA consensus sequence gene transcriptions Tec1p gene transcriptions Telomeric repeat DNA-binding factor gene transcriptions Tetradecanoylphorbol-13-acetate response element gene transcriptions TGF-β control elements (TCEs) TGF-β inhibitory elements (TIEs) Thyroid hormone response element gene transcriptions Transcriptional regulation Transcription bubble gene transcriptions Transcription factor gene transcriptions Transcription factor 3 gene transcriptions Transcription factory gene transcriptions Transcription start site gene transcriptions Translational control sequence gene transcriptions Tryptophan residue U box gene transcriptions Unfolded protein response element gene transcriptions Upstream response element gene transcriptions Upstream stimulatory factor gene transcriptions UTR promoter gene transcriptions V and P box gene transcriptions V box gene transcriptions Vhr1p gene transcriptions Vitamin D response element gene transcriptions W box gene transcriptions X box gene transcriptions Xbp1p gene transcriptions X core promoter element gene transcriptions Xenobiotic response element gene transcriptions Xenobiotic responsive element gene transcriptions Yap1p,2p gene transcriptions Y box gene transcriptions YY1 gene transcriptions Zap1p gene transcriptions Z box gene transcriptions Zinc responsive element gene transcriptions