KIAA1797 is a protein-coding gene in Homo sapiens. Alternate names for the gene are FLJ20375, OTTHUMP00000069845, and hypothetical protein LOC54914.[1] Located on chromosome 9 at area q21.3,[3] the entire gene including introns and exons is 375,010 base pairs on the plus strand. There are 19 alternative splice variants. Longest variant yields a mRNA of 6117 base pairs.
Expression
KIAA1797 was determined to express ubiquitously at varying levels throughout the human body. Based on the EST profile of Unigene, KIAA1797 expression have been observed in tissues ranging from reproductive to secretory.[4]
Predicted secondary mRNA structures in the 5'UTR and the 3'UTR are “ugagaugaacucgguaucuca” and “uccuaagagaggag” respectively. Other possible secondary structures are shown in the table below.
Protein sequence
The main isoform of the human protein is 1801 amino acid long, a total of 200,072 Da.[3]
Two distinct Domain of unknown function(DUF) are found in the sequence.DUF3730 (465-682aa) appears two times in the sequence; this domain family is found in eukaryotes and is typically between 220 and 262 amino acids in length. DUF3028(1213-1801aa). No additional information was provided regarding this DUF.
KIAA1797 is well conserved in mammals. However, it is also found in non-mammalians with lower sequence identities.
Gene Neighborhood
KIAA1797 is downstream of MLLT3 and upstream of PTPLAD2. MLLT3 is involved with myeloid/lymphoid or mixed-lineage leukemia. PTPLAD2 is a protein tyrosine phosphatase.
Maruyama K, Sugano S (1994). "Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides". Gene. 138 (1–2): 171–4. doi:10.1016/0378-1119(94)90802-8. PMID8125298.
Nagase T, Nakayama M, Nakajima D, et al. (2001). "Prediction of the coding sequences of unidentified human genes. XX. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro". DNA Res. 8 (2): 85–95. doi:10.1093/dnares/8.2.85. PMID11347906.