TATA box immunoglobulin like domain containing family: Difference between revisions

Jump to navigation Jump to search
Marshallsumter (talk | contribs)
Marshallsumter (talk | contribs)
 
(One intermediate revision by the same user not shown)
Line 515: Line 515:
===Gene ID: 1 TATA boxes===
===Gene ID: 1 TATA boxes===


====TATA (4560-2846) UTRs====
====TATA(A/T)A(A/T)(A/G) (4560-2846) UTRs====


# Inverse complement, negative strand, negative direction: TTTATATA at 2871 or +18 TTTATATA +25.
# Inverse complement, negative strand, negative direction: TTTATATA at 2871 or +18 TTTATATA +25.
# Positive strand, negative direction: TATATAAA at 2874 or +21 TATATAAA +28.
# Positive strand, negative direction: TATATAAA at 2874 or +21 TATATAAA +28.


====TATA negative direction (2596-1) distal promoters====
====TATA(A/T)A(A/T)(A/G) negative direction (2596-1) distal promoters====


# Negative strand, negative direction: TATATAAA at 1602 or -1251 TATATAAA -1244, TATATATA at 1600 or -1249 TATATATA -1246.
# Negative strand, negative direction: TATATAAA at 1602 or -1251 TATATAAA -1244, TATATATA at 1600 or -1249 TATATATA -1246.

Latest revision as of 19:05, 4 August 2024

The TATA box (also called Goldberg-Hogness box)[1] is a DNA sequence (cis-regulatory element) found in the promoter region of genes in archaea and eukaryotes;[2] approximately 24% of human genes contain a TATA box within the core promoter.[3]

Human genes

"TATA-containing genes are more often highly regulated, such as by biotic or stress stimuli."[4] Only "∼10% of these TATA-containing promoters have the canonical TATA box (TATAWAWR)."[4]

"SRF-regulated genes of the actin/cytoskeleton/contractile family tend to have a TATA box."[5]

Different "TATA box sequences have different abilities to convey the activating signals of certain enhancers and activators in mammalian cells [...] and in yeast [...]."[5]

"SRF is a well established master regulator of the specific family of genes encoding the actin cytoskeleton and contractile apparatus [...], and we found that ~40% of the core promoters for these genes contain a TATA box, which is a significant enrichment compared to the low overall frequency of TATA-containing promoters in human and mouse genomes (...)."[5] "Global frequencies of core promoter types for human [9010 orthologous mouse-human promoter pairs with 1848 TATA-containing or 7162 TATA-less][6] genes with experimentally validated transcription start sites [are known from 2006]."[5] "The TATA box [...] has a consensus sequence of TATAWAAR [...]."[5] W = A or T and R = A or G. We "estimate that ~17% of promoters contain a TATA box".[6]

Gene ID: 1

"The protein encoded by this gene is a plasma glycoprotein of unknown function. The protein shows sequence similarity to the variable regions of some immunoglobulin supergene family member proteins."[7]

Gene ID: 2205

"The immunoglobulin epsilon receptor (IgE receptor) is the initiator of the allergic response. When two or more high-affinity IgE receptors are brought together by allergen-bound IgE molecules, mediators such as histamine that are responsible for allergy symptoms are released. This receptor is comprised of an alpha subunit, a beta subunit, and two gamma subunits. The protein encoded by this gene represents the alpha subunit."[8] In version AB059236 there is an INR-, whereas version L14075 has -34 "AAGCCTA" -28 as a TATA-, an INR- and a DPE+ downstream.[6]

Gene ID: 2209

"This gene encodes a protein that plays an important role in the immune response. This protein is a high-affinity Fc-gamma receptor. The gene is one of three related gene family members located on chromosome 1."[9] In version CR591922 there is an apparent TATA+ at -31 "TATGAGC" -25, with an INR- and MTE+ downstream, whereas version X14355 has an INR- and a MTE+ downstream, or an INR+ and a DPE+.[6]

Gene ID: 2213

"The protein encoded by this gene is a low affinity receptor for the Fc region of immunoglobulin gamma complexes. The encoded protein is involved in the phagocytosis of immune complexes and in the regulation of antibody production by B-cells. Variations in this gene may increase susceptibilty to systemic lupus erythematosus (SLE). Several transcript variants encoding different isoforms have been found for this gene."[10] This gene version CR592864 has an alleged TATA box at -35 "GAACATT" -29 with an INR- and MTE+ downstream, whereas version M90736 has an INR- and DPE-.[6]

Gene ID: 3383

"This gene encodes a cell surface glycoprotein which is typically expressed on endothelial cells and cells of the immune system. It binds to integrins of type CD11a / CD18, or CD11b / CD18 and is also exploited by Rhinovirus as a receptor."[11] This gene has a TATA box at -14 "TATAAAG" -8 and a MTE+.[6]

Gene ID: 3386

"This gene encodes the Landsteiner-Wiener (LW) blood group antigen(s) that belongs to the immunoglobulin (Ig) superfamily, and that shares similarity with the intercellular adhesion molecule (ICAM) protein family. This ICAM protein contains 2 Ig-like C2-type domains and binds to the leukocyte adhesion LFA-1 protein. The molecular basis of the LW(A)/LW(B) blood group antigens is a single aa variation at position 100; Gln-100=LW(A) and Arg-100=LW(B). Alternative splicing results in multiple transcript variants encoding distinct isoforms."[12] This gene has no TATA box in its core promoter but has an INR+ and a MTE-.[6]

Gene ID: 3556

"This gene encodes a component of the interleukin 1 receptor complex, which initiates signalling events that result in the activation of interleukin 1-responsive genes. Alternative splicing of this gene results in membrane-bound and soluble isoforms differing in their C-terminus. The ratio of soluble to membrane-bound forms increases during acute-phase induction or stress."[13] This gene has no TATA box in its core promoter but does have BRE+, INR- and MTE+.[6]

Gene ID: 3570

"This gene encodes a subunit of the interleukin 6 (IL6) receptor complex. Interleukin 6 is a potent pleiotropic cytokine that regulates cell growth and differentiation and plays an important role in the immune response. The IL6 receptor is a protein complex consisting of this protein and interleukin 6 signal transducer (IL6ST/GP130/IL6-beta), a receptor subunit also shared by many other cytokines. Dysregulated production of IL6 and this receptor are implicated in the pathogenesis of many diseases, such as multiple myeloma, autoimmune diseases and prostate cancer. Alternatively spliced transcript variants encoding distinct isoforms have been identified in this gene. A pseudogene of this gene is found on chromosome 9."[14] This gene has no TATA box in its core promoter but does have an INR-.[6]

Gene ID: 3590

"Interleukin 11 is a stromal cell-derived cytokine that belongs to a family of pleiotropic and redundant cytokines that use the gp130 transducing subunit in their high affinity receptors. This gene encodes the IL-11 receptor, which is a member of the hematopoietic cytokine receptor family. This particular receptor is very similar to ciliary neurotrophic factor, since both contain an extracellular region with a 2-domain structure composed of an immunoglobulin-like domain and a cytokine receptor-like domain. Multiple alternatively spliced transcript variants have been found for this gene."[15] This gene has no TATA box in its core promoter but in version NM_004512 it does have a BRE+ and an INR-, whereas in version AK075530 it has an INR- and a MTE+.[6]

Gene ID: 3593

"This gene encodes a subunit of interleukin 12, a cytokine that acts on T and natural killer cells, and has a broad array of biological activities. Interleukin 12 is a disulfide-linked heterodimer composed of the 40 kD cytokine receptor like subunit encoded by this gene, and a 35 kD subunit encoded by IL12A. This cytokine is expressed by activated macrophages that serve as an essential inducer of Th1 cells development. This cytokine has been found to be important for sustaining a sufficient number of memory/effector Th1 cells to mediate long-term protection to an intracellular pathogen. Overexpression of this gene was observed in the central nervous system of patients with multiple sclerosis (MS), suggesting a role of this cytokine in the pathogenesis of the disease. The promoter polymorphism of this gene has been reported to be associated with the severity of atopic and non-atopic asthma in children."[16] This gene has a TATA box in its core promoter at -29 "TATATAAG" -22 and an INR-.[6]

Gene ID: 3815

"This gene encodes a receptor tyrosine kinase. This gene was initially identified as a homolog of the feline sarcoma viral oncogene v-kit and is often referred to as proto-oncogene c-Kit. The canonical form of this glycosylated transmembrane protein has an N-terminal extracellular region with five immunoglobulin-like domains, a transmembrane region, and an intracellular tyrosine kinase domain at the C-terminus. Upon activation by its cytokine ligand, stem cell factor (SCF), this protein phosphorylates multiple intracellular proteins that play a role in in the proliferation, differentiation, migration and apoptosis of many cell types and thereby plays an important role in hematopoiesis, stem cell maintenance, gametogenesis, melanogenesis, and in mast cell development, migration and function. This protein can be a membrane-bound or soluble protein. Mutations in this gene are associated with gastrointestinal stromal tumors, mast cell disease, acute myelogenous leukemia, and piebaldism. Multiple transcript variants encoding different isoforms have been found for this gene."[17] There is no TATA box but there are a BRE+, an INR+, and a MTE+.[6]

Gene ID: 4063

"LY9 belongs to the SLAM family of immunomodulatory receptors (see SLAMF1; MIM 603492) and interacts with the adaptor molecule SAP (SH2D1A; MIM 300490) (Graham et al., 2006 [PubMed 16365421])."[18] There is no TATA box but there is an INR+.[6]

Gene ID: 4914

"This gene encodes a member of the neurotrophic tyrosine kinase receptor (NTKR) family. This kinase is a membrane-bound receptor that, upon neurotrophin binding, phosphorylates itself and members of the MAPK pathway. The presence of this kinase leads to cell differentiation and may play a role in specifying sensory neuron subtypes. Mutations in this gene have been associated with congenital insensitivity to pain, anhidrosis, self-mutilating behavior, cognitive disability and cancer. Alternate transcriptional splice variants of this gene have been found, but only three have been characterized to date."[19] There is no TATA box but there are an INR+ and a MTE-.[6]

Gene ID: 5175

"The protein encoded by this gene is found on the surface of platelets, monocytes, neutrophils, and some types of T-cells, and makes up a large portion of endothelial cell intercellular junctions. The encoded protein is a member of the immunoglobulin superfamily and is likely involved in leukocyte migration, angiogenesis, and integrin activation."[20] There is no TATA box but there are an INR+ and a MTE-.[6]

Gene ID: 7010

"This gene encodes a receptor that belongs to the protein tyrosine kinase Tie2 family. The encoded protein possesses a unique extracellular region that contains two immunoglobulin-like domains, three epidermal growth factor (EGF)-like domains and three fibronectin type III repeats. The ligand angiopoietin-1 binds to this receptor and mediates a signaling pathway that functions in embryonic vascular development. Mutations in this gene are associated with inherited venous malformations of the skin and mucous membranes. Alternative splicing results in multiple transcript variants. Additional alternatively spliced transcript variants of this gene have been described, but their full-length nature is not known."[21] There is an annotated TATA- box -29 "AGGAACT" -23 with an INR- and a MTE-.[6]

Gene ID: 7070

"This gene encodes a cell surface glycoprotein and member of the immunoglobulin superfamily of proteins. The encoded protein is involved in cell adhesion and cell communication in numerous cell types, but particularly in cells of the immune and nervous systems. The encoded protein is widely used as a marker for hematopoietic stem cells. This gene may function as a tumor suppressor in nasopharyngeal carcinoma. Alternative splicing results in multiple transcript variants."[22] There is an annotated TATA- box -29 "CGTCCTT" -23 with an INR- and a MTE-.[6]

Gene ID: 7075

"This gene encodes a member of the tyrosine protein kinase family. The encoded protein plays a critical role in angiogenesis and blood vessel stability by inhibiting angiopoietin 1 signaling through the endothelial receptor tyrosine kinase Tie2. Ectodomain cleavage of the encoded protein relieves inhibition of Tie2 and is mediated by multiple factors including vascular endothelial growth factor. Alternatively spliced transcript variants encoding multiple isoforms have been observed for this gene."[23] There is no TATA box, but there are an INR+ and a DPE-.[6]

Gene ID: 8482

"This gene encodes a member of the semaphorin family of proteins. The encoded preproprotein is proteolytically processed to generate the mature glycosylphosphatidylinositol (GPI)-anchored membrane glycoprotein. The encoded protein is found on activated lymphocytes and erythrocytes and may be involved in immunomodulatory and neuronal processes. The encoded protein carries the John Milton Hagen (JMH) blood group antigens. Mutations in this gene may be associated with reduced bone mineral density (BMD). Alternative splicing results in multiple transcript variants, at least one of which encodes an isoform that is proteolytically processed."[24] There is no TATA box but there are an INR- and a DPE+.[6]

Gene ID: 8483

"Major alterations in the composition of the cartilage extracellular matrix occur in joint disease, such as osteoarthrosis. This gene encodes the cartilage intermediate layer protein (CILP), which increases in early osteoarthrosis cartilage. The encoded protein was thought to encode a protein precursor for two different proteins; an N-terminal CILP and a C-terminal homolog of NTPPHase, however, later studies identified no nucleotide pyrophosphatase phosphodiesterase (NPP) activity. The full-length and the N-terminal domain of this protein was shown to function as an IGF-1 antagonist. An allelic variant of this gene has been associated with lumbar disc disease."[25] This gene has a TATA box at -30 "TATATAA" -24, INR- and MTE+.[6]

CILP and A1BG are in the HGNC gene group (Immunoglobulin like domain containing) of the Immunoglobulin superfamily domain containing.[26]

Gene ID: 9214

"Fc receptors specifically bind to the Fc region of immunoglobulins (Igs) to mediate the unique functions of each Ig class. FAIM3 encodes an Fc receptor for IgM (see MIM 147020) (Kubagawa et al., 2009 [PubMed 19858324]; Shima et al., 2010 [PubMed 20042454])."[27] In version BC006401, at the TATA box position there is no TATA box and downstream an INR- and a DPE-, whereas version NM_005449 has -29 "CTCTCTT" -23 as TATA- and an INR- downstream.[6]

Gene ID: 9723

"Semaphorins are a large family of conserved secreted and membrane associated proteins which possess a semaphorin (Sema) domain and a PSI domain (found in plexins, semaphorins and integrins) in the N-terminal extracellular portion. Based on sequence and structural similarities, semaphorins are put into eight classes: invertebrates contain classes 1 and 2, viruses have class V, and vertebrates contain classes 3-7. Semaphorins serve as axon guidance ligands via multimeric receptor complexes, some (if not all) containing plexin proteins. This gene encodes a class 4 semaphorin. This gene encodes a class 3 semaphorin. Multiple transcript variants encoding different isoforms have been found for this gene."[28] At the TATA box position there is -29 "AATACAG" -23 and downstream an INR+ and a DPE+.[6]

Gene ID: 10068

"The protein encoded by this gene functions as an inhibitor of the proinflammatory cytokine, IL18. It binds IL18, prevents the binding of IL18 to its receptor, and thus inhibits IL18-induced IFN-gamma production, resulting in reduced T-helper type 1 immune responses. This protein is constitutively expressed and secreted in mononuclear cells. Elevated level of this protein is detected in the intestinal tissues of patients with Crohn's disease. Alternatively spliced transcript variants encoding different isoforms have been described for this gene."[29] There is no TATA box in the core promoter but there is an INR- and MTE-.[6]

Gene ID: 10223

"The glycoprotein encoded by this gene is a cell surface antigen that is expressed in greater than 95% of human colon cancers. The open reading frame encodes a 319-amino acid polypeptide having a putative secretory signal sequence and 3 potential glycosylation sites. The predicted mature protein has a 213-amino acid extracellular region, a single transmembrane domain, and a 62-amino acid intracellular tail. The sequence of the extracellular region contains 2 domains characteristic of the CD2 subgroup of the immunoglobulin (Ig) superfamily."[30] In version BC069789 there is no TATA box, but there is an INR-, a MTE+ and DPE+.[6] In another version of the same version, there is an apparent TATA box at -34 "TATTATT" -28 with an INR+ and MTE- downstream.[6]

Gene ID: 11055

"ZPBP is one of several proteins that are thought to participate in secondary binding between acrosome-reacted sperm and the egg-specific extracellular matrix, the zona pellucida (McLeskey et al., 1998 [PubMed 9378618])."[31] There is no TATA box, but there is a MTE+.[6]

Gene ID: 51599

"Predicted to be involved in several processes, including establishment of skin barrier; protein localization to tricellular tight junction; and tricellular tight junction assembly. Located in extracellular exosome."[32] There is no TATA box but there are an INR+ and a MTE+.[6]

Gene ID: 57611

"Predicted to be involved in positive regulation of axon extension. Predicted to be located in plasma membrane. Predicted to be integral component of membrane. Predicted to be active in cell surface."[33] No TATA box is present but there are an INR+, MTE+ and a DPE-.[6]

Gene ID: 57715

"Semaphorins are a large family of conserved secreted and membrane associated proteins which possess a semaphorin (Sema) domain and a PSI domain (found in plexins, semaphorins and integrins) in the N-terminal extracellular portion. Based on sequence and structural similarities, semaphorins are put into eight classes: invertebrates contain classes 1 and 2, viruses have class V, and vertebrates contain classes 3-7. Semaphorins serve as axon guidance ligands via multimeric receptor complexes, some (if not all) containing plexin proteins. Multiple transcript variants encoding different isoforms have been found for this gene."[34] No TATA box is present but there are a BRE+, an INR-, and a MTE+.[6]

Gene ID: 59307

"Predicted to enable NAD+ nucleosidase activity. Involved in negative regulation of DNA-binding transcription factor activity. Located in membrane."[35] No TATA box is present but there is a MTE-.[6]

Gene ID: 84824

"This gene encodes a protein similar to receptors for the Fc fragment of gamma immunoglobulin (IgG). These receptors, referred to as FCGRs, mediate the destruction of IgG-coated antigens and of cells induced by antibodies. This encoded protein is selectively expressed in B cells, and may be involved in their development. This protein may also be involved in the development of lymphomas. Multiple alternatively spliced transcript variants that encode different protein isoforms have been described for this gene."[36] This gene has no TATA box in its core promoter but it does have an INR-, MTE+ and a DPE+.[6]

Gene ID: 115350

"This gene encodes a member of the immunoglobulin receptor superfamily and is one of several Fc receptor-like glycoproteins clustered on the long arm of chromosome 1. The encoded protein contains three extracellular C2-like immunoglobulin domains, a transmembrane domain and a cytoplasmic domain with two immunoreceptor-tyrosine activation motifs. This protein may play a role in the regulation of cancer cell growth. Alternative splicing results in multiple transcript variants."[37] This gene has an INR+ and a MTE+.[6]

Butler 2002 TATA boxes

Gene ID: 1 TATA boxes

TATAAA UTRs (4560-2846)

  1. Negative strand, negative direction: TATAAA at 2852 or +2 TATAAA +7.
  2. Inverse complement, negative strand, negative direction: TTTATA at 2869 or +18 TTTATA +23.
  3. Positive strand, negative direction: TATAAA at 2874 or +23 TATAAA +28.

TATAAA proximal promoters (2811-2596)

  1. Inverse complement, negative strand, negative direction: TTTATA at 2638 or -213 TTTATA -208.

TATAAA distal promoters (2596-1)

  1. Negative strand, negative direction: TATAAA at 1602 or -1117 TATAAA -1112.
  2. Inverse complement, negative strand, negative direction: TTTATA at 1740 or -1111 TTTATA -1106.
  3. Positive strand, negative direction: TATAAA at 221 or -2630 TATAAA -2625, TATAAA at 182 or -2669 TATAAA -2664.
  4. Inverse complement, positive strand, negative direction: TTTATA at 219 or -2632 TTTATA -2627.

TATAAA distal promoters (4050-1)

  1. Inverse complement, negative strand, positive direction: TTTATA at 2588 or -1467 TTTATA -1462.

Gene ID: 3383 TATA boxes

  1. -14 TATAAA -9 nts from the TSS.[6]

Carninci 2006 TATA boxes

Gene ID: 1 TATA boxes

TATAAAA (4560-2846) UTRs

  1. Negative strand, negative direction: TATAAAA at 2853 or +2 TATAAAA +8.
  2. Inverse complement, negative strand, negative direction: TTTTATA at 2869 or +17 TTTTATA +23.

TATAAAA negative direction (2596-1) distal promoters

  1. Inverse complement, negative strand, negative direction: TTTTATA at 1740 or -1112 TTTTATA -1106.
  2. Positive strand, negative direction: TATAAAA at 222 or -2630 TATAAAA -2624, TATAAAA at 183 or -2669 TATAAAA -2663.
  3. Inverse complement, positive strand, negative direction: TTTTATA at 219 or -2633 TTTTATA -2627.

Watson 2014 TATA boxes

Gene ID: 1 TATA boxes

TATA(A/T)A(A/T) (4560-2846) UTRs

  1. Negative strand, negative direction: TATATAT at 2872 or +20 TATATAT +26, TATAAAA at 2853 or +2 TATAAAA +8.
  2. Inverse complement, negative strand, negative direction: TTATATA at 2871 or +19 TTATATA +25, TTTTATA at 2869 or +17 TTTTATA +23.
  3. Positive strand, negative direction: TATATAA at 2873 or +21 TATATAA +27.

TATA(A/T)A(A/T) negative direction (2811-2596) proximal promoters

  1. Inverse complement, negative strand, negative direction: ATTTATA at 2638 or -214 ATTTATA -208.

TATA(A/T)A(A/T) negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: TATATAA at 1601 or -1251 TATATAA -1245, TATATAT at 1599 -1253 TATATAT -1247.
  2. Inverse complement, negative strand, negative direction: TTTTATA at 1740 or -1112 TTTTATA -1106.
  3. Positive strand, negative direction: TATATAT at 1600 or -1252 TATATAA -1246, TATAAAA at 222 -2630 TATAAAA -2624, TATAAAA at 183 or -2669 TATAAAA -2663.
  4. Inverse complement, positive strand, negative direction: TTTTATA at 219 or -2633 TTTTATA -2627.

Gene ID: 3593 TATA boxes

  1. TATA box -28 TATATATA -21.[6]

Gene ID: 8483 TATA boxes

  1. TATA box -30 TATATAAG -23 nts.[6]

Juven-Gershon 2010 TATA boxes

Gene ID: 1 TATA boxes

TATA(A/T)AA(A/G) (4560-2846) UTRs

  1. Inverse complement, negative strand, negative direction: TTTATATA at 2871 or +18 TTTATATA +25.
  2. Positive strand, negative direction: TATATAAA at 2874 or +21 TATATAAA +28.

TATA(A/T)AA(A/G) negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: TATATAAA at 1602 or -1251 TATATAAA -1244.
  2. Positive strand, negative direction: TATAAAAG at 223 or -2630 TATAAAAG -2623, TATAAAAG at 184 or -2669 TATAAAAG -2662.
  3. Inverse complement, positive strand, negative direction: TTTTTATA at 219 or -2634 TTTTTATA -2627.

Basehoar 2004 TATA boxes

Gene ID: 1 TATA boxes

TATA(A/T)A(A/T)(A/G) (4560-2846) UTRs

  1. Inverse complement, negative strand, negative direction: TTTATATA at 2871 or +18 TTTATATA +25.
  2. Positive strand, negative direction: TATATAAA at 2874 or +21 TATATAAA +28.

TATA(A/T)A(A/T)(A/G) negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: TATATAAA at 1602 or -1251 TATATAAA -1244, TATATATA at 1600 or -1249 TATATATA -1246.
  2. Positive strand, negative direction: TATAAAAG at 223 or -2630 TATAAAAG -2623, TATAAAAG at 184 or -2669 TATAAAAG -2662.
  3. Inverse complement, positive strand, negative direction: TTTTTATA at 219 or -2634 TTTTTATA -2627.

Families of TATA box genes

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

References

  1. R. P. Lifton, M. L. Goldberg, R. W. Karp, and D. S. Hogness (1978). "The organization of the histone genes in Drosophila melanogaster: functional and evolutionary implications". Cold Spring Harbor Symposia on Quantitative Biology. 42: 1047–51. doi:10.1101/SQB.1978.042.01.105. PMID 98262.
  2. Stephen T. Smale and James T. Kadonaga (July 2003). "The RNA Polymerase II Core Promoter" (PDF). Annual Review of Biochemistry. 72 (1): 449–79. doi:10.1146/annurev.biochem.72.121801.161520. PMID 12651739. Retrieved 2012-05-07.
  3. C Yang, E Bolotin, T Jiang, FM Sladek, E Martinez (March 2007). "Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters". Gene. 389 (1): 52–65. doi:10.1016/j.gene.2006.09.029. PMID 17123746.
  4. 4.0 4.1 Chuhu Yang, Eugene Bolotin, Tao Jiang, Frances M. Sladek, and Ernest Martinez (10 October 2006). "Prevalence of the Initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters". Gene. 389 (1): 52–65. doi:10.1016/j.gene.2006.09.029. PMID 17123746. Retrieved 2024-06-07.
  5. 5.0 5.1 5.2 5.3 5.4 Muyu Xu, Elsie Gonzalez-Hurtado, and Ernest Martinez (April 2016). "Core promoter-specific gene regulation: TATA box selectivity and Initiator-dependent bi-directionality of serum response factor-activated transcription". Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms. 1859 (4): 553–563. doi:10.1016/j.bbagrm.2016.01.005. Retrieved 2024-06-08.
  6. 6.00 6.01 6.02 6.03 6.04 6.05 6.06 6.07 6.08 6.09 6.10 6.11 6.12 6.13 6.14 6.15 6.16 6.17 6.18 6.19 6.20 6.21 6.22 6.23 6.24 6.25 6.26 6.27 6.28 6.29 6.30 6.31 6.32 6.33 6.34 Victor X Jin, Gregory AC Singer, Francisco J Agosto-Pérez, Sandya Liyanarachchi, and Ramana V Davuluri (2006). "Genome-wide analysis of core promoter elements from conserved human and mouse orthologous pairs". BMC Bioinformatics. 7: 114. doi:10.1186/1471-2105-7-114. Retrieved 2024-06-09.
  7. RefSeq (July 2008). "A1BG alpha-1-B glycoprotein [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-07-31.
  8. RefSeq (August 2011). "FCER1A Fc epsilon receptor Ia [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-07-31.
  9. RefSeq (July 2008). "FCGR1A Fc gamma receptor Ia [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-07-31.
  10. RefSeq (June 2010). "FCGR2B Fc gamma receptor IIb [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-07-31.
  11. RefSeq (July 2008). "ICAM1 intercellular adhesion molecule 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-01.
  12. RefSeq (July 2008). "ICAM4 intercellular adhesion molecule 4 (Landsteiner-Wiener blood group) [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-01.
  13. RefSeq (July 2018). "IL1RAP interleukin 1 receptor accessory protein [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-01.
  14. RefSeq (August 2020). "IL1RAP interleukin 1 receptor accessory protein [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-01.
  15. RefSeq (June 2012). "IL11RA interleukin 11 receptor subunit alpha [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-01.
  16. RefSeq (July 2008). "IL12B interleukin 12B [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-01.
  17. RefSeq (May 2020). "KIT KIT proto-oncogene, receptor tyrosine kinase [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-02.
  18. OMIM (March 2008). "LY9 lymphocyte antigen 9 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-02.
  19. RefSeq (July 2008). "NTRK1 neurotrophic receptor tyrosine kinase 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-02.
  20. RefSeq (May 2010). "PECAM1 platelet and endothelial cell adhesion molecule 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-02.
  21. RefSeq (February 2014). "TEK TEK receptor tyrosine kinase [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-02.
  22. RefSeq (July 2015). "THY1 Thy-1 cell surface antigen [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-02.
  23. RefSeq (November 2011). "TIE1 tyrosine kinase with immunoglobulin like and EGF like domains 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-02.
  24. RefSeq (February 2016). "SEMA7A semaphorin 7A (JohnMiltonHagen blood group) [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-08-02.
  25. RefSeq (September 2010). "CILP cartilage intermediate layer protein [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-07-31.
  26. HGNC (2024). "CILP cartilage intermediate layer protein [ Homo sapiens ]". Farmington, Connecticut, USA: genenames.org. Retrieved 2024-07-31.
  27. OMIM (July 2010). "FCMR Fc mu receptor [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-07-31.
  28. RefSeq (May 2010). "SEMA3E semaphorin 3E [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-08-02.
  29. RefSeq (February 2011). "IL18BP interleukin 18 binding protein [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-08-02.
  30. RefSeq (July 2008). "GPA33 glycoprotein A33 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-08-01.
  31. OMIM (August 2008). "GPA33 glycoprotein A33 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-08-01.
  32. Alliance of Genome Resources (April 2022). "LSR lipolysis stimulated lipoprotein receptor [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-08-02.
  33. Alliance of Genome Resources (April 2022). "ISLR2 immunoglobulin superfamily containing leucine rich repeat 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-08-02.
  34. RefSeq (February 2011). "SEMA4G semaphorin 4G [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-08-02.
  35. Alliance of Genome Resources (April 2022). "SIGIRR single Ig and TIR domain containing [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-08-02.
  36. RefSeq (August 2011). "FCRLA Fc receptor like A [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-07-31.
  37. RefSeq (April 2009). "FCRL1 Fc receptor like 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-07-31.

External links