H and ACA box gene transcriptions
Editor-In-Chief: Henry A. Hoff
Consensus sequences
"The box H/ACA snoRNAs were most recently recognized as a small RNA family by virtue of an ACA trinucleotide located 3 nt upstream of the mature snoRNA 3' end (41). In addition to this ACA box, they have the consensus H box sequence (5'-ANANNA-3') but have no other primary sequence identity. Despite this lack of primary sequence conservation, the H and ACA boxes are embedded in an evolutionarily conserved hairpin-hinge-hairpin-tail core secondary structure with the H box in the single-stranded hinge region and the ACA box in the single-stranded tail (5, 16)."[1]
The "3' end of mature hTR (45) has an ACA trinucleotide 3 nt upstream of its 3' end. In addition, the 3' region of hTR contains a single H box consensus sequence (5'-AGAGGA-3')."[1]
"Comparison with the murine telomerase RNA (mTR) (7) suggests that the snoRNA-like features of hTR are evolutionarily conserved. The mTR 3' end (nt 169 to 397 as numbered in reference 25) has ~76% sequence identity with the corresponding region of hTR (nt 211 to 451) and includes consensus H (5'-ACAGGA-3') and ACA box sequences."[1]
An H box has a consensus sequence of 3'-ACACCA-5'.[2]
The combined consensus sequence is 5'-ACAGGA-3'.[1]
H and ACA boxes in promoters of A1BG
For the Basic programs (starting with SuccessablesHACA.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:
- negative strand, negative direction is SuccessablesHACA--.bas, looking for 5'-ACAGGA-3', 0.
- negative strand, positive direction is SuccessablesHACA-+.bas, looking for 5'-ACAGGA-3', 1, 5'-ACAGGA-3' at 3572.
- positive strand, negative direction is SuccessablesHACA+-.bas, looking for 5'-ACAGGA-3', 1, 5'-ACAGGA-3' at 2690.
- positive strand, positive direction is SuccessablesHACA++.bas, looking for 5'-ACAGGA-3', 1, 5'-ACAGGA-3' at 3620.
- complement, negative strand, negative direction is SuccessablesHACAc--.bas, looking for 5'-TGTCCT-3', 1, 5'-TGTCCT-3' at 2690.
- complement, negative strand, positive direction is SuccessablesHACAc-+.bas, looking for 5'-TGTCCT-3', 1, 5'-TGTCCT-3' at 3620.
- complement, positive strand, negative direction is SuccessablesHACAc+-.bas, looking for 5'-TGTCCT-3', 0.
- complement, positive strand, positive direction is SuccessablesHACAc++.bas, looking for 5'-TGTCCT-3', 1, 5'-TGTCCT-3' at 3572.
- inverse complement, negative strand, negative direction is SuccessablesHACAci--.bas, looking for 5'-TCCTGT-3', 3, 5'-TCCTGT-3' at 3389, 5'-TCCTGT-3' at 3756, 5'-TCCTGT-3' at 4468.
- inverse complement, negative strand, positive direction is SuccessablesHACAci-+.bas, looking for 5'-TCCTGT-3', 2, 5'-TCCTGT-3' at 144, 5'-TCCTGT-3' at 3622.
- inverse complement, positive strand, negative direction is SuccessablesHACAci+-.bas, looking for 5'-TCCTGT-3', 1, 5'-TCCTGT-3' at 1911.
- inverse complement, positive strand, positive direction is SuccessablesHACAci++.bas, looking for 5'-TCCTGT-3', 3, 5'-TCCTGT-3' at 2460, 5'-TCCTGT-3' at 3131, 5'-TCCTGT-3' at 4252.
- inverse, negative strand, negative direction, is SuccessablesHACAi--.bas, looking for 5'-AGGACA-3', 1, 5'-AGGACA-3' at 1911.
- inverse, negative strand, positive direction, is SuccessablesHACAi-+.bas, looking for 5'-AGGACA-3', 3, 3'-AGGACA-3' at 2460, 3'-AGGACA-3' at 3131, 3'-AGGACA-3' at 4252.
- inverse, positive strand, negative direction, is SuccessablesHACAi+-.bas, looking for 5'-AGGACA-3', 3, 5'-AGGACA-3' at 3389, 5'-AGGACA-3' at 3756, 5'-AGGACA-3' at 4468.
- inverse, positive strand, positive direction, is SuccessablesHACAi++.bas, looking for 5'-AGGACA-3', 2, 5'-AGGACA-3' at 144, 5'-AGGACA-3' at 3622.
HACA UTRs
- Negative strand, negative direction: TCCTGT at 4468, TCCTGT at 3756, TCCTGT at 3389.
HACA proximal promoters
- Positive strand, positive direction: TCCTGT at 4252.
HACA distal promoters
- Positive strand, negative direction: ACAGGA at 2690, TCCTGT at 1911.
- Negative strand, positive direction: TCCTGT at 3622, ACAGGA at 3572, TCCTGT at 144.
- Positive strand, positive direction: TCCTGT at 4252, ACAGGA at 3620, TCCTGT at 3131, TCCTGT at 2460.
H and ACA box random dataset samplings
- HACAr0: 0.
- HACAr1: 1, ACAGGA at 3275.
- HACAr2: 0.
- HACAr3: 1, ACAGGA at 1344.
- HACAr4: 0.
- HACAr5: 0.
- HACAr6: 1, ACAGGA at 2593.
- HACAr7: 1, ACAGGA at 2793.
- HACAr8: 0.
- HACAr9: 0.
- RDr0ci: 0.
- RDr1ci: 0.
- RDr2ci: 0.
- RDr3ci: 0.
- RDr4ci: 0.
- RDr5ci: 0.
- RDr6ci: 0.
- RDr7ci: 0.
- RDr8ci: 0.
- RDr9ci: 0.
RDr UTRs
RDr core promoters
RDr proximal promoters
HACAr distal promoters
- HACAr6: ACAGGA at 2593.
- HACAr1: ACAGGA at 3275.
- HACAr3: ACAGGA at 1344.
- HACAr7: ACAGGA at 2793.
Response element analysis and results
The combined consensus sequence is 5'-ACAGGA-3'.[1]
Reals or randoms | Promoters | direction | Numbers | Strands | Occurrences | Averages (± 0.1) |
---|---|---|---|---|---|---|
Reals | UTR | negative | 0 | 2 | 0 | 0 |
Randoms | UTR | arbitrary negative | 0 | 10 | 0 | 0 |
Randoms | UTR | alternate negative | 0 | 10 | 0 | 0 |
Reals | Core | negative | 0 | 2 | 0 | 0 |
Randoms | Core | negative | 0 | 10 | 0 | 0 |
Reals | Core | positive | 0 | 2 | 0 | 0 |
Randoms | Core | positive | 0 | 10 | 0 | 0 |
Reals | Proximal | negative | 0 | 2 | 0 | 0 |
Randoms | Proximal | negative | 0 | 10 | 0 | 0 |
Reals | Proximal | positive | 0 | 2 | 0 | 0 |
Randoms | Proximal | positive | 0 | 10 | 0 | 0 |
Reals | Distal | negative | 0 | 2 | 0 | 0 |
Randoms | Distal | negative | 0 | 10 | 0 | 0 |
Reals | Distal | positive | 0 | 2 | 0 | 0 |
Randoms | Distal | positive | 0 | 10 | 0 | 0 |
Comparison:
The occurrences of real responsive element consensus sequences are larger than the randoms. This suggests that the real responsive element consensus sequences are likely active or activable.
Acknowledgements
The content on this page was first contributed by: Henry A. Hoff.
Initial content for this page in some instances came from Wikiversity.
See also
References
- ↑ 1.0 1.1 1.2 1.3 1.4 James R. Mitchell, Jeffrey Cheng, ang Kathleen Collins (January 1999). "A Box H/ACA Small Nucleolar RNA-Like Domain at the Human Telomerase RNA 3' End" (PDF). Molecular and Cellular Biology. 19 (1): 567–576. Retrieved 5 November 2018.
- ↑ Timofey S. Rozhdestvensky, Thean Hock Tang, Inna V. Tchirkova, Jürgen Brosius, Jean‐Pierre Bachellerie and Alexander Hüttenhofer (2003). "Binding of L7Ae protein to the K‐turn of archaeal snoRNAs: a shared RNA binding motif for C/D and H/ACA box snoRNAs in Archaea". Nucleic Acids Research. 31 (3): 869–77. doi:10.1093/nar/gkg175. Retrieved 2014-06-08.