CadC binding domain gene transcriptions
Associate Editor(s)-in-Chief: Henry A. Hoff
"Dimerization of [cadaverine C-terminal] CadC enables the binding of two DBDs to the two Cad1 consensus target sites."[1]
Consensus sequences
"Altogether, the specific contacts observed suggest a consensus binding motif of 5′-T-T-A-x-x-x-x-T-3′."[1]
"The DNA consensus sequence 5′-T-T-A-x-x-x-x-T-3′ is present once in the quasi-palindromic Cad1 17-mer DNA, consistent with the formation of a 1:1 complex. However, a second consensus facilitates the formation of the 2:1 complex of CadC with Cad1 41-mer DNA as evidenced by the CadC model with the minimal Cad1 26-mer DNA that spans the two AT-rich regions, i.e. consensus sites."[1]
Hypotheses
- A1BG has no CadC binding domain in either promoter.
- A1BG is not transcribed by a CadC binding domain.
- CadC binding domain does not participate in the transcription of A1BG.
Cadaverine C samplings
Copying the cadaverine C-terminal binding domain consensus sequence 5'-T-T-A-x-x-x-x-T-3' and putting the sequence in "⌘F" finds one location between ZNF497 and A1BG or four locations between ZSCAN22 and A1BG as can be found by the computer programs.
For the Basic programs testing consensus sequence 5'-TTANNNNT-3' (starting with SuccessablesCadC.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:
- Negative strand, negative direction: 24, TTATTAAT at 4227, TTATTATT at 4224, TTATCTTT at 4081, TTATTTAT at 4074, TTAGGGTT at 3978, TTACCCTT at 3662, TTATGACT at 3542, TTAACTAT at 3360, TTATTTGT at 3337, TTACCGAT at 3007, TTACGAAT at 2935, TTATATAT at 2872, TTATATGT at 2641, TTATCATT at 2501, TTATGTTT at 2306, TTATGTTT at 1879, TTATCTCT at 1712, TTACGGTT at 1636, TTATGTCT at 1567, TTAGTCCT at 985, TTATGTTT at 770, TTATGTTT at 636, TTATGCTT at 493, TTAAGATT at 9.
- Negative strand, positive direction: 9, TTATAATT at 4169, TTATTGAT at 4162, TTAATCAT at 4148, TTATTAAT at 4145, TTATCACT at 4126, TTATGACT at 3029, TTAGGGCT at 2768, TTAAAATT at 2447, TTAAACAT at 2139.
- Positive strand, negative direction: 6, TTATTAAT at 4540, TTAAAAAT at 3355, TTAGATAT at 2982, TTATCTTT at 1733, TTAAGTGT at 322, TTAAGAAT at 196.
- Positive strand, positive direction: 5, TTACTCCT at 4097, TTATACCT at 3163, TTATCTTT at 2629, TTAAAATT at 2443, TTACACTT at 231.
- inverse complement, negative strand, negative direction: 10, ATAATTAA at 4541, AAGTGTAA at 4533, ATTATTAA at 4226, ATCTATAA at 3467, ATTTTTAA at 3356, ATTTTTAA at 3174, ATGGATAA at 2997, ATCTCTAA at 1714, ACCCTTAA at 1694, ATATATAA at 1601.
- inverse complement, positive strand, negative direction: 28, AATAATAA at 4224, AAAAATAA at 4221, ATAAATAA at 4075, AAAAATAA at 4071, ACCTGTAA at 3972, AGGGATAA at 3656, ATTACTAA at 3472, ATTGATAA at 3361, ACAAATAA at 3334, AAACATAA at 3169, AAAACTAA at 3030, ATGGCTAA at 3008, ATGCTTAA at 2936, ATATATAA at 2873, AATAGTAA at 2501, ATCTTTAA at 1735, AAACCTAA at 1591, ACACTTAA at 1546, ACCTGTAA at 1133, ACCTGTAA at 803, ACACGTAA at 533, ACCTGTAA at 396, AGTGCTAA at 337, AACCTTAA at 318, AGATGTAA at 247, AGGTATAA at 181, ACACCTAA at 64, AATTCTAA at 9
- inverse complement, negative strand, positive direction: 2, AATTTTAA at 2443, ACAGTTAA at 2135.
- inverse complement, positive strand, positive direction: 10, AATATTAA at 4169, ATAACTAA at 4163, ACTAATAA at 4159, AGAACTAA at 4133, ATCCATAA at 2643, AACCCTAA at 2545, AATTTTAA at 2447, ACAAATAA at 2347, ACCGATAA at 1976, AAACATAA at 115.
CadC (4560-2846) UTRs
- Negative strand, negative direction: ATAATTAA at 4541, AAGTGTAA at 4533, TTATTAAT at 4227, ATTATTAA at 4226, TTATTATT at 4224, TTATCTTT at 4081, TTATTTAT at 4074, TTAGGGTT at 3978, TTACCCTT at 3662, TTATGACT at 3542, ATCTATAA at 3467, TTAACTAT at 3360, ATTTTTAA at 3356, TTATTTGT at 3337, ATTTTTAA at 3174, TTACCGAT at 3007, ATGGATAA at 2997, TTACGAAT at 2935, TTATATAT at 2872.
- Positive strand, negative direction: TTATTAAT at 4540, AATAATAA at 4224, AAAAATAA at 4221, ATAAATAA at 4075, AAAAATAA at 4071, ACCTGTAA at 3972, AGGGATAA at 3656, ATTACTAA at 3472, ATTGATAA at 3361, TTAAAAAT at 3355, ACAAATAA at 3334, AAACATAA at 3169, AAAACTAA at 3030, ATGGCTAA at 3008, TTAGATAT at 2982, ATGCTTAA at 2936, ATATATAA at 2873.
CadC negative direction (2811-2596) proximal promoters
- Negative strand, negative direction: TTATATGT at 2641.
CadC positive direction (4265-4050) proximal promoters
- Negative strand, positive direction: TTATAATT at 4169, TTATTGAT at 4162, TTAATCAT at 4148, TTATTAAT at 4145, TTATCACT at 4126.
- Positive strand, positive direction: AATATTAA at 4169, ATAACTAA at 4163, ACTAATAA at 4159, AGAACTAA at 4133, TTACTCCT at 4097.
CadC negative direction (2596-1) distal promoters
- Negative strand, negative direction: TTATCATT at 2501, TTATGTTT at 2306, TTATGTTT at 1879, ATCTCTAA at 1714, TTATCTCT at 1712, ACCCTTAA at 1694, TTACGGTT at 1636, ATATATAA at 1601, TTATGTCT at 1567, TTAGTCCT at 985, TTATGTTT at 770, TTATGTTT at 636, TTATGCTT at 493, TTAAGATT at 9.
- Positive strand, negative direction: AATAGTAA at 2501, ATCTTTAA at 1735, TTATCTTT at 1733, AAACCTAA at 1591, ACACTTAA at 1546, ACCTGTAA at 1133, ACCTGTAA at 803, ACACGTAA at 533, ACCTGTAA at 396, AGTGCTAA at 337, TTAAGTGT at 322, AACCTTAA at 318, AGATGTAA at 247, TTAAGAAT at 196, AGGTATAA at 181, ACACCTAA at 64, AATTCTAA at 9.
CadC positive direction (4050-1) distal promoters
- Negative strand, positive direction: TTATGACT at 3029, TTAGGGCT at 2768, TTAAAATT at 2447, AATTTTAA at 2443, TTAAACAT at 2139, ACAGTTAA at 2135.
- Positive strand, positive direction: TTATACCT at 3163, ATCCATAA at 2643, TTATCTTT at 2629, AACCCTAA at 2545, AATTTTAA at 2447, TTAAAATT at 2443, ACAAATAA at 2347, ACCGATAA at 1976, TTACACTT at 231, AAACATAA at 115.
CadC binding domain random dataset samplings
- CadCr0: 24, TTAGGGGT at 4395, TTATCGCT at 4009, TTAATTTT at 3695, TTACGTTT at 3685, TTATCTGT at 3620, TTACTGGT at 3610, TTACTGGT at 2972, TTATCCGT at 2898, TTAAAAAT at 2696, TTATATTT at 2144, TTATGTCT at 2115, TTAGTTGT at 1668, TTAGATAT at 1466, TTAAGAAT at 1190, TTAGGGAT at 761, TTACAGCT at 715, TTATAGTT at 562, TTAGTTAT at 558, TTACCACT at 551, TTATGGCT at 487, TTATAACT at 431, TTATACGT at 397, TTATGCCT at 342, TTAAAACT at 238.
- CadCr1: 22, TTACCCGT at 4417, TTAAGGTT at 4383, TTAGGGCT at 3616, TTAGGATT at 3583, TTACTGAT at 3284, TTAAACGT at 3164, TTATCACT at 3149, TTATTTAT at 3097, TTACGTTT at 3029, TTACCCGT at 2966, TTAATTGT at 2903, TTAAACCT at 2538, TTACCCCT at 2414, TTAGCAGT at 1877, TTACGGCT at 1465, TTAGCTCT at 1089, TTAATTGT at 1009, TTAAACTT at 977, TTAGGCGT at 589, TTATGTGT at 535, TTACACTT at 529, TTAGGGTT at 198.
- CadCr2: 14, TTAGGTGT at 4110, TTACCCAT at 4037, TTAAGGTT at 3933, TTACTGGT at 3490, TTAATCCT at 3232, TTATCCTT at 2856, TTAGCCGT at 2525, TTAAGTGT at 2493, TTAAATTT at 1770, TTAGAAAT at 1332, TTAGGAGT at 383, TTATGGGT at 362, TTAGCTGT at 342, TTAGGATT at 134.
- CadCr3: 24, TTAATTTT at 4350, TTAACGCT at 4329, TTAGTTCT at 4219, TTAAATTT at 4194, TTAATGCT at 4152, TTAACAAT at 4062, TTAACTTT at 3537, TTACTGCT at 3180, TTAAGGAT at 3037, TTAGTCCT at 3028, TTAGTTTT at 3022, TTACTCCT at 2978, TTACCTGT at 2668, TTAACCGT at 2156, TTAACGTT at 1934, TTAAAACT at 1471, TTAGATAT at 941, TTAGTAAT at 917, TTAAAAAT at 537, TTAGTGCT at 465, TTACAGCT at 458, TTAGCCGT at 364, TTAAGGTT at 327, TTAACTAT at 206.
- CadCr4: 24, TTACTTTT at 4524, TTACGCGT at 4441, TTAATGAT at 4378, TTATCCCT at 4305, TTAATATT at 4162, TTAAACCT at 4152, TTAATCTT at 3985, TTATGTTT at 3868, TTACCATT at 3757, TTATATTT at 3722, TTACTAAT at 3670, TTAAACCT at 3375, TTAGTACT at 3214, TTAGATCT at 3133, TTAGGAGT at 2775, TTAAGATT at 2721, TTAGAACT at 1747, TTAGACGT at 1452, TTACGAAT at 1305, TTAGGTCT at 980, TTACATAT at 932, TTAAACAT at 833, TTAGGGGT at 408, TTAAAACT at 174.
- CadCr5: 27, TTAAGGTT at 4459, TTAAGATT at 4089, TTACACGT at 4067, TTAGGGAT at 4060, TTAACACT at 3935, TTAACTTT at 3619, TTACCGGT at 2982, TTAGAGGT at 2960, TTACCAAT at 2635, TTACCCAT at 2514, TTAAAGGT at 2294, TTAGAGCT at 2117, TTAGGCCT at 1799, TTACCATT at 1607, TTATAAAT at 1564, TTAACCAT at 1364, TTATCCCT at 1277, TTAAAGAT at 1141, TTATTCCT at 1115, TTACCGGT at 1070, TTATCGTT at 953, TTATTTAT at 857, TTAATTAT at 853, TTACAGGT at 696, TTATCCAT at 652, TTATGGCT at 244, TTAATGCT at 117.
- CadCr6: 24, TTAAGTAT at 4543, TTATTCCT at 3766, TTATCATT at 3759, TTAGGATT at 3753, TTACCATT at 3688, TTATTCTT at 3473, TTAGCAAT at 3440, TTAACCTT at 3397, TTACCTTT at 3288, TTAGTTAT at 2806, TTACCGGT at 2734, TTAGGTAT at 2555, TTAAATAT at 2237, TTAATTTT at 1655, TTAAGGCT at 1536, TTATGGTT at 1418, TTATGTCT at 1291, TTACTAGT at 488, TTAGTACT at 453, TTAGCCGT at 334, TTACCATT at 328, TTACGGTT at 212, TTAGGTTT at 137, TTAAGATT at 93.
- CadCr7: 15, TTAGGGAT at 4529, TTATAATT at 4085, TTACGGCT at 4016, TTAAAAAT at 3919, TTAACTAT at 3626, TTAAAGGT at 2962, TTAAAAAT at 2512, TTATAATT at 2455, TTATTACT at 2336, TTAAGTAT at 1658, TTATAAGT at 1320, TTAACCCT at 598, TTAAGCGT at 335, TTATGGAT at 313, TTAATGTT at 247.
- CadCr8: 22, TTAGTTTT at 4398, TTAGGAGT at 4366, TTACTATT at 4308, TTAGAAGT at 3917, TTAAAAGT at 3684, TTAAAGAT at 3576, TTAGAATT at 3487, TTAAGCCT at 3179, TTAGTATT at 2965, TTACTTAT at 2566, TTATGGAT at 2088, TTATGAGT at 1964, TTAAAGGT at 1930, TTAGTTTT at 1851, TTACAAGT at 1727, TTACCGCT at 1625, TTATCGTT at 1079, TTAGTACT at 734, TTAACTTT at 611, TTAGGTAT at 360, TTATGGGT at 96, TTATCACT at 81.
- CadCr9: 15, TTAAGAAT at 4411, TTAGACTT at 4060, TTATCGTT at 2897, TTACATGT at 2411, TTACTTGT at 2293, TTATATTT at 2125, TTACCTAT at 1948, TTAGACAT at 1631, TTATGAAT at 1401, TTAACCGT at 1235, TTAAAAAT at 1165, TTACACCT at 921, TTAGTGCT at 908, TTAAGGAT at 238, TTAAAAGT at 33.
- CadCr0ci: 18, AAACCTAA at 3803, AGCCATAA at 3797, AGCTTTAA at 3790, ACCAATAA at 2735, ATGGGTAA at 2607, ATTCTTAA at 2359, AAAAATAA at 2261, AGCAGTAA at 2220, AAAGTTAA at 2082, ACAAATAA at 1790, ATAAATAA at 1497, AATAATAA at 1493, AGGAATAA at 1490, ATAACTAA at 433, ATACGTAA at 399, AAGGGTAA at 321, AATAATAA at 147, ATTTCTAA at 133.
- CadCr1ci: 24, AGCAGTAA at 4316, ACGATTAA at 4043, ACTGGTAA at 4026, AGGGATAA at 3936, ATTTCTAA at 3588, AATGATAA at 3445, ATGCCTAA at 3309, AAACGTAA at 3166, ATCACTAA at 3151, AAGAATAA at 3082, AAATGTAA at 3076, AACATTAA at 2912, AAACCTAA at 2540, AAGGCTAA at 2524, ACAGTTAA at 2500, AGTACTAA at 2374, AGCGCTAA at 1930, AGCAGTAA at 1879, ACGACTAA at 1830, AAGCCTAA at 1240, AAGCCTAA at 1044, AAATTTAA at 1005, ATCCGTAA at 804, ATGTGTAA at 537.
- CadCr2ci: 23, AAGGATAA at 4471, AAAGCTAA at 4370, AGCCGTAA at 4183, AAAAGTAA at 4157, AACTATAA at 3822, AATTGTAA at 3446, AGAATTAA at 3031, AGCTGTAA at 2946, AGCATTAA at 2911, AGCCATAA at 2582, AATCCTAA at 2515, AAATTTAA at 2132, AGAGGTAA at 2066, AGAGCTAA at 1957, AAAACTAA at 1853, ATTCGTAA at 1078, ACACTTAA at 903, ATCTATAA at 517, AGGAGTAA at 385, AGCTGTAA at 344, AACGATAA at 307, ACATGTAA at 275, AAGTATAA at 239.
- CadCr3ci: 19, AACGTTAA at 4451, ACCTATAA at 4443, ACTCTTAA at 4427, AAAATTAA at 4346, AGCACTAA at 4090, AGTTCTAA at 3990, ACTGATAA at 3901, AGGGATAA at 3865, AGGAATAA at 3672, AGGAGTAA at 3601, AGCAATAA at 3445, ATCTATAA at 3273, AACTGTAA at 2399, ACGCGTAA at 2213, AGAGGTAA at 2003, AACGTTAA at 1936, AAGGGTAA at 1812, AGCGATAA at 1288, ACTATTAA at 209.
- CadCr4ci: 20, AATATTAA at 4164, ACCGTTAA at 4050, ATATTTAA at 3724, ACCTATAA at 3684, AATCCTAA at 3634, ATCGGTAA at 3588, AGTACTAA at 3216, AGATCTAA at 3135, ATCACTAA at 2813, AATCTTAA at 2449, ACGTCTAA at 2317, ATTAGTAA at 2301, ACCAGTAA at 2189, AAGAATAA at 1695, AGCTATAA at 1436, ATGTATAA at 774, ATGTATAA at 732, ATGTTTAA at 704, AGCAATAA at 493, AGACCTAA at 262.
- CadCr5ci: 13, AGCGTTAA at 3615, AACTATAA at 3517, ACCCATAA at 3212, ACCCATAA at 2516, AACCTTAA at 2481, AGGTGTAA at 1960, AATTATAA at 1562, AGCCTTAA at 1252, AAGCGTAA at 1242, ATTTGTAA at 1207, AAAGATAA at 1143, ATGCGTAA at 1061, ATGGCTAA at 246.
- CadCr6ci: 19, ACGAATAA at 4134, ATGCTTAA at 4095, AGTGGTAA at 3976, AACCATAA at 3150, ATGGTTAA at 3109, AGTTTTAA at 2779, ATGGCTAA at 2561, AACAGTAA at 2381, AATACTAA at 2375, AAATATAA at 2239, AATTTTAA at 2233, AATTTTAA at 1651, AGCTATAA at 1131, ACATCTAA at 812, AGGTGTAA at 413, AACTCTAA at 238, AAAGCTAA at 231, AGGTTTAA at 139, ACCATTAA at 32.
- CadCr7ci: 22, AGTTATAA at 4083, AAAGGTAA at 4071, ATGCCTAA at 3736, ATAAATAA at 3632, AACTATAA at 3628, AGATATAA at 3056, ACCAATAA at 2926, ATCCATAA at 2848, ATGGATAA at 2720, AGAGGTAA at 2627, AGTGGTAA at 2545, ATACTTAA at 2508, ATTCTTAA at 1949, ATTATTAA at 1654, AGCTTTAA at 1556, AAATATAA at 1413, ACCAGTAA at 1109, ACTCGTAA at 1055, ATGGGTAA at 641, AGCTCTAA at 291, ACCCCTAA at 186, ACTTCTAA at 80.
- CadCr8ci: 18, AGATATAA at 3982, ATGAATAA at 3888, ATCACTAA at 3838, AGCGCTAA at 3626, AAAGATAA at 3578, AGTTTTAA at 3175, AAAGTTAA at 2662, AATTGTAA at 2500, AGTGTTAA at 2267, ATGAGTAA at 1966, AGGTCTAA at 1519, ATACTTAA at 1496, AAAGCTAA at 1222, ACATTTAA at 1044, ACCGCTAA at 997, AATACTAA at 851, ACCCATAA at 781, AATCCTAA at 394.
- CadCr9ci: 18, ATCGTTAA at 4407, AATCTTAA at 4298, AACTTTAA at 3956, ATGACTAA at 3706, AAGAATAA at 2847, AAAGTTAA at 2496, AATGATAA at 2466, ACATGTAA at 2413, AAAACTAA at 2391, AAGAGTAA at 2381, ACATGTAA at 1761, AGTTTTAA at 1249, ATAATTAA at 1231, ACTTCTAA at 1202, ATTATTAA at 1161, ACTTCTAA at 1016, AGGATTAA at 241, AGCGCTAA at 201.
CadCr arbitrary (evens) (4560-2846) UTRs
- CadCr0: TTAGGGGT at 4395, TTATCGCT at 4009, TTAATTTT at 3695, TTACGTTT at 3685, TTATCTGT at 3620, TTACTGGT at 3610, TTACTGGT at 2972, TTATCCGT at 2898.
- CadCr2: TTAGGTGT at 4110, TTACCCAT at 4037, TTAAGGTT at 3933, TTACTGGT at 3490, TTAATCCT at 3232, TTATCCTT at 2856.
- CadCr4: TTACTTTT at 4524, TTACGCGT at 4441, TTAATGAT at 4378, TTATCCCT at 4305, TTAATATT at 4162, TTAAACCT at 4152, TTAATCTT at 3985, TTATGTTT at 3868, TTACCATT at 3757, TTATATTT at 3722, TTACTAAT at 3670, TTAAACCT at 3375, TTAGTACT at 3214, TTAGATCT at 3133.
- CadCr6: TTAAGTAT at 4543, TTATTCCT at 3766, TTATCATT at 3759, TTAGGATT at 3753, TTACCATT at 3688, TTATTCTT at 3473, TTAGCAAT at 3440, TTAACCTT at 3397, TTACCTTT at 3288.
- CadCr8: TTAGTTTT at 4398, TTAGGAGT at 4366, TTACTATT at 4308, TTAGAAGT at 3917, TTAAAAGT at 3684, TTAAAGAT at 3576, TTAGAATT at 3487, TTAAGCCT at 3179, TTAGTATT at 2965.
- CadCr0ci: AAACCTAA at 3803, AGCCATAA at 3797, AGCTTTAA at 3790.
- CadCr2ci: AAGGATAA at 4471, AAAGCTAA at 4370, AGCCGTAA at 4183, AAAAGTAA at 4157, AACTATAA at 3822, AATTGTAA at 3446, AGAATTAA at 3031, AGCTGTAA at 2946, AGCATTAA at 2911.
- CadCr4ci: AATATTAA at 4164, ACCGTTAA at 4050, ATATTTAA at 3724, ACCTATAA at 3684, AATCCTAA at 3634, ATCGGTAA at 3588, AGTACTAA at 3216, AGATCTAA at 3135.
- CadCr6ci: ACGAATAA at 4134, ATGCTTAA at 4095, AGTGGTAA at 3976, AACCATAA at 3150, ATGGTTAA at 3109.
- CadCr8ci: AGATATAA at 3982, ATGAATAA at 3888, ATCACTAA at 3838, AGCGCTAA at 3626, AAAGATAA at 3578, AGTTTTAA at 3175.
CadCr alternate (odds) (4560-2846) UTRs
- CadCr1: TTACCCGT at 4417, TTAAGGTT at 4383, TTAGGGCT at 3616, TTAGGATT at 3583, TTACTGAT at 3284, TTAAACGT at 3164, TTATCACT at 3149, TTATTTAT at 3097, TTACGTTT at 3029, TTACCCGT at 2966, TTAATTGT at 2903.
- CadCr3: TTAATTTT at 4350, TTAACGCT at 4329, TTAGTTCT at 4219, TTAAATTT at 4194, TTAATGCT at 4152, TTAACAAT at 4062, TTAACTTT at 3537, TTACTGCT at 3180, TTAAGGAT at 3037, TTAGTCCT at 3028, TTAGTTTT at 3022, TTACTCCT at 2978.
- CadCr5: TTAAGGTT at 4459, TTAAGATT at 4089, TTACACGT at 4067, TTAGGGAT at 4060, TTAACACT at 3935, TTAACTTT at 3619, TTACCGGT at 2982, TTAGAGGT at 2960.
- CadCr7: TTAGGGAT at 4529, TTATAATT at 4085, TTACGGCT at 4016, TTAAAAAT at 3919, TTAACTAT at 3626, TTAAAGGT at 2962.
- CadCr9: TTAAGAAT at 4411, TTAGACTT at 4060, TTATCGTT at 2897.
- CadCr1ci: AGCAGTAA at 4316, ACGATTAA at 4043, ACTGGTAA at 4026, AGGGATAA at 3936, ATTTCTAA at 3588, AATGATAA at 3445, ATGCCTAA at 3309, AAACGTAA at 3166, ATCACTAA at 3151, AAGAATAA at 3082, AAATGTAA at 3076, AACATTAA at 2912.
- CadCr3ci: AACGTTAA at 4451, ACCTATAA at 4443, ACTCTTAA at 4427, AAAATTAA at 4346, AGCACTAA at 4090, AGTTCTAA at 3990, ACTGATAA at 3901, AGGGATAA at 3865, AGGAATAA at 3672, AGGAGTAA at 3601, AGCAATAA at 3445, ATCTATAA at 3273.
- CadCr5ci: AGCGTTAA at 3615, AACTATAA at 3517, ACCCATAA at 3212.
- CadCr7ci: AGTTATAA at 4083, AAAGGTAA at 4071, ATGCCTAA at 3736, ATAAATAA at 3632, AACTATAA at 3628, AGATATAA at 3056, ACCAATAA at 2926, ATCCATAA at 2848.
- CadCr9ci: ATCGTTAA at 4407, AATCTTAA at 4298, AACTTTAA at 3956, ATGACTAA at 3706, AAGAATAA at 2847.
CadCr arbitrary negative direction (evens) (2846-2811) core promoters
- CadCr4ci: ATCACTAA at 2813.
CadCr arbitrary positive direction (odds) (4445-4265) core promoters
- CadCr1: TTACCCGT at 4417, TTAAGGTT at 4383.
- CadCr3: TTAATTTT at 4350, TTAACGCT at 4329.
- CadCr9: TTAAGAAT at 4411.
- CadCr1ci: AGCAGTAA at 4316.
- CadCr3ci: ACCTATAA at 4443, ACTCTTAA at 4427, AAAATTAA at 4346.
- CadCr9ci: ATCGTTAA at 4407, AATCTTAA at 4298.
CadCr alternate positive direction (evens) (4445-4265) core promoters
- CadCr0: TTAGGGGT at 4395.
- CadCr4: TTACGCGT at 4441, TTAATGAT at 4378, TTATCCCT at 4305.
- CadCr8: TTAGTTTT at 4398, TTAGGAGT at 4366, TTACTATT at 4308.
- CadCr2ci: AAAGCTAA at 4370.
CadCr arbitrary negative direction (evens) (2811-2596) proximal promoters
- CadCr0: TTAAAAAT at 2696.
- CadCr4: TTAGGAGT at 2775, TTAAGATT at 2721.
- CadCr6: TTAGTTAT at 2806, TTACCGGT at 2734.
- CadCr0ci: ACCAATAA at 2735, ATGGGTAA at 2607.
- CadCr6ci: AGTTTTAA at 2779.
- CadCr8ci: AAAGTTAA at 2662.
CadCr alternate negative direction (odds) (2811-2596) proximal promoters
- CadCr3: TTACCTGT at 2668.
- CadCr5: TTACCAAT at 2635.
- CadCr7ci:ATGGATAA at 2720, AGAGGTAA at 2627.
CadCr arbitrary positive direction (odds) (4265-4050) proximal promoters
- CadCr3: TTAGTTCT at 4219, TTAAATTT at 4194, TTAATGCT at 4152, TTAACAAT at 4062.
- CadCr5: TTAAGATT at 4089, TTACACGT at 4067, TTAGGGAT at 4060.
- CadCr7: TTATAATT at 4085.
- CadCr9: TTAGACTT at 4060.
- CadCr3ci: AGCACTAA at 4090.
- CadCr7ci: AGTTATAA at 4083, AAAGGTAA at 4071.
CadCr alternate positive direction (evens) (4265-4050) proximal promoters
- CadCr2: TTAGGTGT at 4110.
- CadCr4: TTAATATT at 4162, TTAAACCT at 4152.
- CadCr2ci: AGCCGTAA at 4183, AAAAGTAA at 4157.
- CadCr4ci: AATATTAA at 4164, ACCGTTAA at 4050.
- CadCr6ci: ACGAATAA at 4134, ATGCTTAA at 4095.
CadCr arbitrary negative direction (evens) (2596-1) distal promoters
- CadCr0: TTATATTT at 2144, TTATGTCT at 2115, TTAGTTGT at 1668, TTAGATAT at 1466, TTAAGAAT at 1190, TTAGGGAT at 761, TTACAGCT at 715, TTATAGTT at 562, TTAGTTAT at 558, TTACCACT at 551, TTATGGCT at 487, TTATAACT at 431, TTATACGT at 397, TTATGCCT at 342, TTAAAACT at 238.
- CadCr2: TTAGCCGT at 2525, TTAAGTGT at 2493, TTAAATTT at 1770, TTAGAAAT at 1332, TTAGGAGT at 383, TTATGGGT at 362, TTAGCTGT at 342, TTAGGATT at 134.
- CadCr4: TTAGAACT at 1747, TTAGACGT at 1452, TTACGAAT at 1305, TTAGGTCT at 980, TTACATAT at 932, TTAAACAT at 833, TTAGGGGT at 408, TTAAAACT at 174.
- CadCr6: TTAGGTAT at 2555, TTAAATAT at 2237, TTAATTTT at 1655, TTAAGGCT at 1536, TTATGGTT at 1418, TTATGTCT at 1291, TTACTAGT at 488, TTAGTACT at 453, TTAGCCGT at 334, TTACCATT at 328, TTACGGTT at 212, TTAGGTTT at 137, TTAAGATT at 93.
- CadCr8: TTACTTAT at 2566, TTATGGAT at 2088, TTATGAGT at 1964, TTAAAGGT at 1930, TTAGTTTT at 1851, TTACAAGT at 1727, TTACCGCT at 1625, TTATCGTT at 1079, TTAGTACT at 734, TTAACTTT at 611, TTAGGTAT at 360, TTATGGGT at 96, TTATCACT at 81.
- CadCr0ci: ATTCTTAA at 2359, AAAAATAA at 2261, AGCAGTAA at 2220, AAAGTTAA at 2082, ACAAATAA at 1790, ATAAATAA at 1497, AATAATAA at 1493, AGGAATAA at 1490, ATAACTAA at 433, ATACGTAA at 399, AAGGGTAA at 321, AATAATAA at 147, ATTTCTAA at 133.
- CadCr2ci: AGCCATAA at 2582, AATCCTAA at 2515, AAATTTAA at 2132, AGAGGTAA at 2066, AGAGCTAA at 1957, AAAACTAA at 1853, ATTCGTAA at 1078, ACACTTAA at 903, ATCTATAA at 517, AGGAGTAA at 385, AGCTGTAA at 344, AACGATAA at 307, ACATGTAA at 275, AAGTATAA at 239.
- CadCr4ci: AATCTTAA at 2449, ACGTCTAA at 2317, ATTAGTAA at 2301, ACCAGTAA at 2189, AAGAATAA at 1695, AGCTATAA at 1436, ATGTATAA at 774, ATGTATAA at 732, ATGTTTAA at 704, AGCAATAA at 493, AGACCTAA at 262.
- CadCr6ci: ATGGCTAA at 2561, AACAGTAA at 2381, AATACTAA at 2375, AAATATAA at 2239, AATTTTAA at 2233, AATTTTAA at 1651, AGCTATAA at 1131, ACATCTAA at 812, AGGTGTAA at 413, AACTCTAA at 238, AAAGCTAA at 231, AGGTTTAA at 139, ACCATTAA at 32.
- CadCr8ci: AATTGTAA at 2500, AGTGTTAA at 2267, ATGAGTAA at 1966, AGGTCTAA at 1519, ATACTTAA at 1496, AAAGCTAA at 1222, ACATTTAA at 1044, ACCGCTAA at 997, AATACTAA at 851, ACCCATAA at 781, AATCCTAA at 394.
CadCr alternate negative direction (odds) (2596-1) distal promoters
- CadCr1: TTAAACCT at 2538, TTACCCCT at 2414, TTAGCAGT at 1877, TTACGGCT at 1465, TTAGCTCT at 1089, TTAATTGT at 1009, TTAAACTT at 977, TTAGGCGT at 589, TTATGTGT at 535, TTACACTT at 529, TTAGGGTT at 198.
- CadCr3: TTAACCGT at 2156, TTAACGTT at 1934, TTAAAACT at 1471, TTAGATAT at 941, TTAGTAAT at 917, TTAAAAAT at 537, TTAGTGCT at 465, TTACAGCT at 458, TTAGCCGT at 364, TTAAGGTT at 327, TTAACTAT at 206.
- CadCr5: TTACCCAT at 2514, TTAAAGGT at 2294, TTAGAGCT at 2117, TTAGGCCT at 1799, TTACCATT at 1607, TTATAAAT at 1564, TTAACCAT at 1364, TTATCCCT at 1277, TTAAAGAT at 1141, TTATTCCT at 1115, TTACCGGT at 1070, TTATCGTT at 953, TTATTTAT at 857, TTAATTAT at 853, TTACAGGT at 696, TTATCCAT at 652, TTATGGCT at 244, TTAATGCT at 117.
- CadCr7: TTAAAAAT at 2512, TTATAATT at 2455, TTATTACT at 2336, TTAAGTAT at 1658, TTATAAGT at 1320, TTAACCCT at 598, TTAAGCGT at 335, TTATGGAT at 313, TTAATGTT at 247.
- CadCr9: TTACATGT at 2411, TTACTTGT at 2293, TTATATTT at 2125, TTACCTAT at 1948, TTAGACAT at 1631, TTATGAAT at 1401, TTAACCGT at 1235, TTAAAAAT at 1165, TTACACCT at 921, TTAGTGCT at 908, TTAAGGAT at 238, TTAAAAGT at 33.
- CadCr1ci: AAACCTAA at 2540, AAGGCTAA at 2524, ACAGTTAA at 2500, AGTACTAA at 2374, AGCGCTAA at 1930, AGCAGTAA at 1879, ACGACTAA at 1830, AAGCCTAA at 1240, AAGCCTAA at 1044, AAATTTAA at 1005, ATCCGTAA at 804, ATGTGTAA at 537.
- CadCr3ci: AACTGTAA at 2399, ACGCGTAA at 2213, AGAGGTAA at 2003, AACGTTAA at 1936, AAGGGTAA at 1812, AGCGATAA at 1288, ACTATTAA at 209.
- CadCr5ci: ACCCATAA at 2516, AACCTTAA at 2481, AGGTGTAA at 1960, AATTATAA at 1562, AGCCTTAA at 1252, AAGCGTAA at 1242, ATTTGTAA at 1207, AAAGATAA at 1143, ATGCGTAA at 1061, ATGGCTAA at 246.
- CadCr7ci: AGTGGTAA at 2545, ATACTTAA at 2508, ATTCTTAA at 1949, ATTATTAA at 1654, AGCTTTAA at 1556, AAATATAA at 1413, ACCAGTAA at 1109, ACTCGTAA at 1055, ATGGGTAA at 641, AGCTCTAA at 291, ACCCCTAA at 186, ACTTCTAA at 80.
- CadCr9ci: AAAGTTAA at 2496, AATGATAA at 2466, ACATGTAA at 2413, AAAACTAA at 2391, AAGAGTAA at 2381, ACATGTAA at 1761, AGTTTTAA at 1249, ATAATTAA at 1231, ACTTCTAA at 1202, ATTATTAA at 1161, ACTTCTAA at 1016, AGGATTAA at 241, AGCGCTAA at 201.
CadCr arbitrary positive direction (odds) (4050-1) distal promoters
- CadCr1: TTAGGGCT at 3616, TTAGGATT at 3583, TTACTGAT at 3284, TTAAACGT at 3164, TTATCACT at 3149, TTATTTAT at 3097, TTACGTTT at 3029, TTACCCGT at 2966, TTAATTGT at 2903, TTAAACCT at 2538, TTACCCCT at 2414, TTAGCAGT at 1877, TTACGGCT at 1465, TTAGCTCT at 1089, TTAATTGT at 1009, TTAAACTT at 977, TTAGGCGT at 589, TTATGTGT at 535, TTACACTT at 529, TTAGGGTT at 198.
- CadCr3: TTAACTTT at 3537, TTACTGCT at 3180, TTAAGGAT at 3037, TTAGTCCT at 3028, TTAGTTTT at 3022, TTACTCCT at 2978, TTACCTGT at 2668, TTAACCGT at 2156, TTAACGTT at 1934, TTAAAACT at 1471, TTAGATAT at 941, TTAGTAAT at 917, TTAAAAAT at 537, TTAGTGCT at 465, TTACAGCT at 458, TTAGCCGT at 364, TTAAGGTT at 327, TTAACTAT at 206.
- CadCr5: TTAACACT at 3935, TTAACTTT at 3619, TTACCGGT at 2982, TTAGAGGT at 2960, TTACCAAT at 2635, TTACCCAT at 2514, TTAAAGGT at 2294, TTAGAGCT at 2117, TTAGGCCT at 1799, TTACCATT at 1607, TTATAAAT at 1564, TTAACCAT at 1364, TTATCCCT at 1277, TTAAAGAT at 1141, TTATTCCT at 1115, TTACCGGT at 1070, TTATCGTT at 953, TTATTTAT at 857, TTAATTAT at 853, TTACAGGT at 696, TTATCCAT at 652, TTATGGCT at 244, TTAATGCT at 117.
- CadCr7: TTACGGCT at 4016, TTAAAAAT at 3919, TTAACTAT at 3626, TTAAAGGT at 2962, TTAAAAAT at 2512, TTATAATT at 2455, TTATTACT at 2336, TTAAGTAT at 1658, TTATAAGT at 1320, TTAACCCT at 598, TTAAGCGT at 335, TTATGGAT at 313, TTAATGTT at 247.
- CadCr9: TTATCGTT at 2897, TTACATGT at 2411, TTACTTGT at 2293, TTATATTT at 2125, TTACCTAT at 1948, TTAGACAT at 1631, TTATGAAT at 1401, TTAACCGT at 1235, TTAAAAAT at 1165, TTACACCT at 921, TTAGTGCT at 908, TTAAGGAT at 238, TTAAAAGT at 33.
- CadCr1ci: ACGATTAA at 4043, ACTGGTAA at 4026, AGGGATAA at 3936, ATTTCTAA at 3588, AATGATAA at 3445, ATGCCTAA at 3309, AAACGTAA at 3166, ATCACTAA at 3151, AAGAATAA at 3082, AAATGTAA at 3076, AACATTAA at 2912, AAACCTAA at 2540, AAGGCTAA at 2524, ACAGTTAA at 2500, AGTACTAA at 2374, AGCGCTAA at 1930, AGCAGTAA at 1879, ACGACTAA at 1830, AAGCCTAA at 1240, AAGCCTAA at 1044, AAATTTAA at 1005, ATCCGTAA at 804, ATGTGTAA at 537.
- CadCr3ci: AGTTCTAA at 3990, ACTGATAA at 3901, AGGGATAA at 3865, AGGAATAA at 3672, AGGAGTAA at 3601, AGCAATAA at 3445, ATCTATAA at 3273, AACTGTAA at 2399, ACGCGTAA at 2213, AGAGGTAA at 2003, AACGTTAA at 1936, AAGGGTAA at 1812, AGCGATAA at 1288, ACTATTAA at 209.
- CadCr5ci: AGCGTTAA at 3615, AACTATAA at 3517, ACCCATAA at 3212, ACCCATAA at 2516, AACCTTAA at 2481, AGGTGTAA at 1960, AATTATAA at 1562, AGCCTTAA at 1252, AAGCGTAA at 1242, ATTTGTAA at 1207, AAAGATAA at 1143, ATGCGTAA at 1061, ATGGCTAA at 246.
- CadCr7ci: ATGCCTAA at 3736, ATAAATAA at 3632, AACTATAA at 3628, AGATATAA at 3056, ACCAATAA at 2926, ATCCATAA at 2848, ATGGATAA at 2720, AGAGGTAA at 2627, AGTGGTAA at 2545, ATACTTAA at 2508, ATTCTTAA at 1949, ATTATTAA at 1654, AGCTTTAA at 1556, AAATATAA at 1413, ACCAGTAA at 1109, ACTCGTAA at 1055, ATGGGTAA at 641, AGCTCTAA at 291, ACCCCTAA at 186, ACTTCTAA at 80.
- CadCr9ci: AACTTTAA at 3956, ATGACTAA at 3706, AAGAATAA at 2847, AAAGTTAA at 2496, AATGATAA at 2466, ACATGTAA at 2413, AAAACTAA at 2391, AAGAGTAA at 2381, ACATGTAA at 1761, AGTTTTAA at 1249, ATAATTAA at 1231, ACTTCTAA at 1202, ATTATTAA at 1161, ACTTCTAA at 1016, AGGATTAA at 241, AGCGCTAA at 201.
CadCr alternate positive direction (evens) (4050-1) distal promoters
- CadCr0: TTATCGCT at 4009, TTAATTTT at 3695, TTACGTTT at 3685, TTATCTGT at 3620, TTACTGGT at 3610, TTACTGGT at 2972, TTATCCGT at 2898, TTAAAAAT at 2696, TTATATTT at 2144, TTATGTCT at 2115, TTAGTTGT at 1668, TTAGATAT at 1466, TTAAGAAT at 1190, TTAGGGAT at 761, TTACAGCT at 715, TTATAGTT at 562, TTAGTTAT at 558, TTACCACT at 551, TTATGGCT at 487, TTATAACT at 431, TTATACGT at 397, TTATGCCT at 342, TTAAAACT at 238.
- CadCr2: TTACCCAT at 4037, TTAAGGTT at 3933, TTACTGGT at 3490, TTAATCCT at 3232, TTATCCTT at 2856, TTAGCCGT at 2525, TTAAGTGT at 2493, TTAAATTT at 1770, TTAGAAAT at 1332, TTAGGAGT at 383, TTATGGGT at 362, TTAGCTGT at 342, TTAGGATT at 134.
- CadCr4: TTAATCTT at 3985, TTATGTTT at 3868, TTACCATT at 3757, TTATATTT at 3722, TTACTAAT at 3670, TTAAACCT at 3375, TTAGTACT at 3214, TTAGATCT at 3133, TTAGGAGT at 2775, TTAAGATT at 2721, TTAGAACT at 1747, TTAGACGT at 1452, TTACGAAT at 1305, TTAGGTCT at 980, TTACATAT at 932, TTAAACAT at 833, TTAGGGGT at 408, TTAAAACT at 174.
- CadCr6: TTATTCCT at 3766, TTATCATT at 3759, TTAGGATT at 3753, TTACCATT at 3688, TTATTCTT at 3473, TTAGCAAT at 3440, TTAACCTT at 3397, TTACCTTT at 3288, TTAGTTAT at 2806, TTACCGGT at 2734, TTAGGTAT at 2555, TTAAATAT at 2237, TTAATTTT at 1655, TTAAGGCT at 1536, TTATGGTT at 1418, TTATGTCT at 1291, TTACTAGT at 488, TTAGTACT at 453, TTAGCCGT at 334, TTACCATT at 328, TTACGGTT at 212, TTAGGTTT at 137, TTAAGATT at 93.
- CadCr8: TTAGAAGT at 3917, TTAAAAGT at 3684, TTAAAGAT at 3576, TTAGAATT at 3487, TTAAGCCT at 3179, TTAGTATT at 2965, TTACTTAT at 2566, TTATGGAT at 2088, TTATGAGT at 1964, TTAAAGGT at 1930, TTAGTTTT at 1851, TTACAAGT at 1727, TTACCGCT at 1625, TTATCGTT at 1079, TTAGTACT at 734, TTAACTTT at 611, TTAGGTAT at 360, TTATGGGT at 96, TTATCACT at 81.
- CadCr0ci: AAACCTAA at 3803, AGCCATAA at 3797, AGCTTTAA at 3790, ACCAATAA at 2735, ATGGGTAA at 2607, ATTCTTAA at 2359, AAAAATAA at 2261, AGCAGTAA at 2220, AAAGTTAA at 2082, ACAAATAA at 1790, ATAAATAA at 1497, AATAATAA at 1493, AGGAATAA at 1490, ATAACTAA at 433, ATACGTAA at 399, AAGGGTAA at 321, AATAATAA at 147, ATTTCTAA at 133.
- CadCr2ci: AACTATAA at 3822, AATTGTAA at 3446, AGAATTAA at 3031, AGCTGTAA at 2946, AGCATTAA at 2911, AGCCATAA at 2582, AATCCTAA at 2515, AAATTTAA at 2132, AGAGGTAA at 2066, AGAGCTAA at 1957, AAAACTAA at 1853, ATTCGTAA at 1078, ACACTTAA at 903, ATCTATAA at 517, AGGAGTAA at 385, AGCTGTAA at 344, AACGATAA at 307, ACATGTAA at 275, AAGTATAA at 239.
- CadCr4ci: ACCGTTAA at 4050, ATATTTAA at 3724, ACCTATAA at 3684, AATCCTAA at 3634, ATCGGTAA at 3588, AGTACTAA at 3216, AGATCTAA at 3135, ATCACTAA at 2813, AATCTTAA at 2449, ACGTCTAA at 2317, ATTAGTAA at 2301, ACCAGTAA at 2189, AAGAATAA at 1695, AGCTATAA at 1436, ATGTATAA at 774, ATGTATAA at 732, ATGTTTAA at 704, AGCAATAA at 493, AGACCTAA at 262.
- CadCr6ci: AGTGGTAA at 3976, AACCATAA at 3150, ATGGTTAA at 3109, AGTTTTAA at 2779, ATGGCTAA at 2561, AACAGTAA at 2381, AATACTAA at 2375, AAATATAA at 2239, AATTTTAA at 2233, AATTTTAA at 1651, AGCTATAA at 1131, ACATCTAA at 812, AGGTGTAA at 413, AACTCTAA at 238, AAAGCTAA at 231, AGGTTTAA at 139, ACCATTAA at 32.
- CadCr8ci: AGATATAA at 3982, ATGAATAA at 3888, ATCACTAA at 3838, AGCGCTAA at 3626, AAAGATAA at 3578, AGTTTTAA at 3175, AAAGTTAA at 2662, AATTGTAA at 2500, AGTGTTAA at 2267, ATGAGTAA at 1966, AGGTCTAA at 1519, ATACTTAA at 1496, AAAGCTAA at 1222, ACATTTAA at 1044, ACCGCTAA at 997, AATACTAA at 851, ACCCATAA at 781, AATCCTAA at 394.
CadC binding domain analysis and results
"Altogether, the specific contacts observed suggest a consensus binding motif of 5′-T-T-A-x-x-x-x-T-3′."[1]
Reals or randoms | Promoters | direction | Numbers | Strands | Occurrences | Averages (± 0.1) |
---|---|---|---|---|---|---|
Reals | UTR | negative | 36 | 2 | 18 | 18 |
Randoms | UTR | arbitrary negative | 77 | 10 | 7.7 | 7.85 |
Randoms | UTR | alternate negative | 80 | 10 | 8.0 | 7.85 |
Reals | Core | negative | 0 | 2 | 0 | 0 |
Randoms | Core | arbitrary negative | 1 | 10 | 0.1 | 0.05 |
Randoms | Core | alternate negative | 0 | 10 | 0 | 0 |
Reals | Core | positive | 0 | 2 | 0 | 0 |
Randoms | Core | arbitrary positive | 11 | 10 | 1.1 | 0.95 |
Randoms | Core | alternate positive | 8 | 10 | 0.8 | 0.95 |
Reals | Proximal | negative | 1 | 2 | 0.5 | 0.5 |
Randoms | Proximal | arbitrary negative | 9 | 10 | 0.9 | 0.65 |
Randoms | Proximal | alternate negative | 4 | 10 | 0.4 | 0.65 |
Reals | Proximal | positive | 10 | 2 | 5 | 5 (-+5,++5) |
Randoms | Proximal | arbitrary positive | 12 | 10 | 1.2 | 1.5 |
Randoms | Proximal | alternate positive | 9 | 10 | 0.9 | 1.05 |
Reals | Distal | negative | 31 | 2 | 15.5 | 15.5 ± 1.5 (--14,+-17) |
Randoms | Distal | arbitrary negative | 119 | 10 | 11.9 | 11.7 |
Randoms | Distal | alternate negative | 115 | 10 | 11.5 | 11.7 |
Reals | Distal | positive | 16 | 2 | 8 | 8 ± 2 (-+6,++10) |
Randoms | Distal | arbitrary positive | 173 | 10 | 17.3 | 18.0 |
Randoms | Distal | alternate positive | 187 | 10 | 18.7 | 18.0 |
Comparison:
The occurrences of real CadC UTRs, positive direction proximals are greater than the randoms, the negative direction proximal are within the randoms, and the distals are outside the randoms. This suggests that the real CadCs are likely active or activable.
Acknowledgements
The content on this page was first contributed by: Henry A. Hoff.
See also
References
- ↑ 1.0 1.1 1.2 1.3 Andreas Schlundt, Sophie Buchner, Robert Janowski, Thomas Heydenreich, Ralf Heermann, Jürgen Lassak, Arie Geerlof, Ralf Stehle, Dierk Niessing, Kirsten Jung & Michael Sattler (21 April 2017). "Structure-function analysis of the DNA-binding domain of a transmembrane transcriptional activator". Scientific Reports. 7: 1051. doi:10.1038/s41598-017-01031-9. PMID 28432336. Retrieved 28 August 2020.