H and ACA box gene transcriptions
Editor-In-Chief: Henry A. Hoff
Consensus sequences
"The box H/ACA snoRNAs were most recently recognized as a small RNA family by virtue of an ACA trinucleotide located 3 nt upstream of the mature snoRNA 3' end (41). In addition to this ACA box, they have the consensus H box sequence (5'-ANANNA-3') but have no other primary sequence identity. Despite this lack of primary sequence conservation, the H and ACA boxes are embedded in an evolutionarily conserved hairpin-hinge-hairpin-tail core secondary structure with the H box in the single-stranded hinge region and the ACA box in the single-stranded tail (5, 16)."[1]
The "3' end of mature hTR (45) has an ACA trinucleotide 3 nt upstream of its 3' end. In addition, the 3' region of hTR contains a single H box consensus sequence (5'-AGAGGA-3')."[1]
"Comparison with the murine telomerase RNA (mTR) (7) suggests that the snoRNA-like features of hTR are evolutionarily conserved. The mTR 3' end (nt 169 to 397 as numbered in reference 25) has ~76% sequence identity with the corresponding region of hTR (nt 211 to 451) and includes consensus H (5'-ACAGGA-3') and ACA box sequences."[1]
An H box has a consensus sequence of 3'-ACACCA-5'.[2]
The combined consensus sequence is 5'-ACAGGA-3'.[1]
H and ACA boxes in promoters of A1BG
For the Basic programs (starting with SuccessablesHACA.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:
- negative strand, negative direction is SuccessablesHACA--.bas, looking for 5'-ACAGGA-3', 0.
- positive strand, negative direction is SuccessablesHACA+-.bas, looking for 5'-ACAGGA-3', 1, 5'-ACAGGA-3' at 2690.
- negative strand, positive direction is SuccessablesHACA-+.bas, looking for 5'-ACAGGA-3', 1, 5'-ACAGGA-3' at 3572.
- positive strand, positive direction is SuccessablesHACA++.bas, looking for 5'-ACAGGA-3', 1, 5'-ACAGGA-3' at 3620.
- inverse complement, negative strand, negative direction is SuccessablesHACAci--.bas, looking for 5'-TCCTGT-3', 3, 5'-TCCTGT-3' at 3389, 5'-TCCTGT-3' at 3756, 5'-TCCTGT-3' at 4468.
- inverse complement, negative strand, positive direction is SuccessablesHACAci-+.bas, looking for 5'-TCCTGT-3', 2, 5'-TCCTGT-3' at 144, 5'-TCCTGT-3' at 3622.
- inverse complement, positive strand, negative direction is SuccessablesHACAci+-.bas, looking for 5'-TCCTGT-3', 1, 5'-TCCTGT-3' at 1911.
- inverse complement, positive strand, positive direction is SuccessablesHACAci++.bas, looking for 5'-TCCTGT-3', 3, 5'-TCCTGT-3' at 2460, 5'-TCCTGT-3' at 3131, 5'-TCCTGT-3' at 4252.
HACA (4560-2846) UTRs
- Negative strand, negative direction: TCCTGT at 4468, TCCTGT at 3756, TCCTGT at 3389.
HACA negative direction (2811-2596) proximal promoters
- Positive strand, negative direction: ACAGGA at 2690.
HACA positive direction (4265-4050) proximal promoters
- Positive strand, positive direction: TCCTGT at 4252.
HACA negative direction (2596-1) distal promoters
- Positive strand, negative direction: TCCTGT at 1911.
HACA positive direction (4050-1) distal promoters
- Negative strand, positive direction: TCCTGT at 3622, ACAGGA at 3572, TCCTGT at 144.
- Positive strand, positive direction: ACAGGA at 3620, TCCTGT at 3131, TCCTGT at 2460.
H and ACA box random dataset samplings
- HACAr0: 0.
- HACAr1: 1, ACAGGA at 3275.
- HACAr2: 0.
- HACAr3: 1, ACAGGA at 1344.
- HACAr4: 0.
- HACAr5: 0.
- HACAr6: 1, ACAGGA at 2593.
- HACAr7: 1, ACAGGA at 2793.
- HACAr8: 0.
- HACAr9: 0.
- HACAr0ci: 3, TCCTGT at 2253, TCCTGT at 1551, TCCTGT at 1009.
- HACAr1ci: 1, TCCTGT at 1802.
- HACAr2ci: 3, TCCTGT at 2879, TCCTGT at 1204, TCCTGT at 725.
- HACAr3ci: 1, TCCTGT at 350.
- HACAr4ci: 1, TCCTGT at 801.
- HACAr5ci: 0.
- HACAr6ci: 0.
- HACAr7ci: 1, TCCTGT at 4129.
- HACAr8ci: 1, TCCTGT at 3449.
- HACAr9ci: 0.
HACAr UTRs
- HACAr2ci: TCCTGT at 2879.
- HACAr8ci: TCCTGT at 3449.
HACAr proximal promoters
- HACAr7ci: TCCTGT at 4129.
HACAr distal promoters
- HACAr6: ACAGGA at 2593.
- HACAr0ci: TCCTGT at 2253, TCCTGT at 1551, TCCTGT at 1009.
- HACAr2ci: TCCTGT at 1204, TCCTGT at 725.
- HACAr4ci: TCCTGT at 801.
- HACAr1: ACAGGA at 3275.
- HACAr3: ACAGGA at 1344.
- HACAr7: ACAGGA at 2793.
- HACAr1ci: TCCTGT at 1802.
- HACAr3ci: TCCTGT at 350.
H and ACA boxes analysis and results
The combined consensus sequence is ACAGGA.[1]
Reals or randoms | Promoters | direction | Numbers | Strands | Occurrences | Averages (± 0.1) |
---|---|---|---|---|---|---|
Reals | UTR | negative | 3 | 2 | 1.5 | 1.5 |
Randoms | UTR | arbitrary negative | 2 | 10 | 0.2 | 0.2 |
Randoms | UTR | alternate negative | 2 | 10 | 0.2 | 0.2 |
Reals | Core | negative | 0 | 2 | 0 | 0 |
Randoms | Core | negative | 0 | 10 | 0 | 0 |
Reals | Core | positive | 0 | 2 | 0 | 0 |
Randoms | Core | positive | 0 | 10 | 0 | 0 |
Reals | Proximal | negative | 0 | 2 | 0 | 0.25 |
Randoms | Proximal | negative | 0 | 10 | 0 | 0.05 |
Reals | Proximal | positive | 1 | 2 | 0.5 | 0.25 |
Randoms | Proximal | positive | 1 | 10 | 0.1 | 0.05 |
Reals | Distal | negative | 2 | 2 | 1.0 | 2.25 |
Randoms | Distal | negative | 8 | 10 | 0.8 | 0.65 |
Reals | Distal | positive | 7 | 2 | 3.5 | 2.25 |
Randoms | Distal | positive | 5 | 10 | 0.5 | 0.65 |
Comparison:
The occurrences of real H and ACA box consensus sequences are greater than the randoms. This suggests that the real H and ACA box consensus sequences are likely active or activable.
Acknowledgements
The content on this page was first contributed by: Henry A. Hoff.
Initial content for this page in some instances came from Wikiversity.
See also
References
- ↑ 1.0 1.1 1.2 1.3 1.4 James R. Mitchell, Jeffrey Cheng, ang Kathleen Collins (January 1999). "A Box H/ACA Small Nucleolar RNA-Like Domain at the Human Telomerase RNA 3' End" (PDF). Molecular and Cellular Biology. 19 (1): 567–576. Retrieved 5 November 2018.
- ↑ Timofey S. Rozhdestvensky, Thean Hock Tang, Inna V. Tchirkova, Jürgen Brosius, Jean‐Pierre Bachellerie and Alexander Hüttenhofer (2003). "Binding of L7Ae protein to the K‐turn of archaeal snoRNAs: a shared RNA binding motif for C/D and H/ACA box snoRNAs in Archaea". Nucleic Acids Research. 31 (3): 869–77. doi:10.1093/nar/gkg175. Retrieved 2014-06-08.