C19orf18
Chromosome 19 open reading frame 18 (c19orf18) is a protein which in humans is encoded by the c19orf18 gene. The gene is exclusive to mammals and the protein is predicted to have a transmembrane domain and a coiled coil stretch.[1] This protein has a function that is not yet fully understood by the scientific community.
Gene
Aliases of this gene include MGC41906 and LOC147685.[1] The gene is located on chromosome 19 at 19q13.43.[2] The gene spans from 58,485,905 bp to 58,469,805 bp on the minus strand and contains 6 exons and 5 introns.[1] Transcription of this gene produces one spliced mRNA which codes for the protein c19orf18.
Expression
C19orf18 is ubiquitously expressed at moderate levels.[1] In humans, there is higher expression in the testis, prostate, lung, liver, pancreas, uterus, heart, and other connective tissues.[3][4]
Homology
Paralogs
There are no known paralogs of this gene in the human genome.[5]
Orthologs
The gene is exclusive to mammals.[1] The transmembrane domain is the most conserved region among close orthologs and distant homologs. The following table presents some of the orthologs found using searches in BLAST.[6] This list does not contain all of the orthologs for c19orf18. It is meant to display the diversity of species for which orthologs are found. They are sorted by date of divergence and then protein similarity.
Species | Date of Divergence (MYA) | Accession Number | Sequence length (aa) | Identity | Similarity |
---|---|---|---|---|---|
Homo sapiens (Humans) | 0 | NP_689687.1 | 215 | 100% | 100% |
Pongo abelii (Orangutan) | 15.2 | XP_002829939.1 | 216 | 92% | 94% |
Rhinopithecus roxellana (Golden snub-nosed Monkey) | 28.1 | XP_010385277.1 | 216 | 84% | 90% |
Carlito syrichta (Philippine tarsier) | 66.7 | XP_008066887.1 | 217 | 70% | 81% |
Otolemur garnettii (Galago) | 73 | XP_012663984.1 | 183 | 50% | 62% |
Mus musculus (Mouse) | 88 | XP_017167821.1 | 183 | 46% | 63% |
Oryctolagus cuniculus (European rabbit) | 88 | XP_008247222.1 | 242 | 49% | 62% |
Rhinolophus sinicus (Horseshoe bat) | 94 | XP_019567114.1 | 284 | 70% | 82% |
Vicugna pacos (Alpaca) | 94 | XP_015107013.1 | 214 | 65% | 80% |
Canis lupus familiaris (Dog) | 94 | XP_005616108.1 | 223 | 49% | 61% |
Bos taurus (Cow) | 94 | XP_015313970.1 | 250 | 44% | 53% |
Ornithorhynchus anatinus (Platypus) | 169 | XP_007664656.1 | 308 | 34% | 57% |
Protein
The coding sequence contains 215 amino acids. The molecular weight of c19orf18 is 24.151 kdal and the isoelectric point for the unphosphorylated state is 9.06.[7] The protein sequence is rich in leucine and is deficient in tryptophan, cysteine, and tyrosine. There is a negative charge cluster from amino acid 149 to 172.[8]
Structure
There is a cross-program consensus between GOR4, CFSSP, and PHYRE2 that the protein structure contains mostly coiled regions and alpha helices.[9][10][11]
Topology
The protein sequence is predicted to contain a signal peptide (1 aa to 24 aa), an extracellular domain (25 aa to 100 aa), a transmembrane domain (101 aa to 121 aa), and a cytoplasmic domain (122 aa to 215 aa).[12]
Subcellular localization
PSORTII and CELLO predicted that the human protein would localize to the plasma membrane and part of it would be in the extracellular region.[13][14] Immunofluorescent staining of human cell line U-2 OS shows localization to the Golgi apparatus.[15]
Function
Protein interactions
C19orf18 protein has been predicted to interact with several proteins listed in the table below. The interactions have been identified and verified through affinity capture-MS.[16]
Predicted interacting protein name | Score | Experimental verification |
---|---|---|
Nedd4 family interacting protein 1 | 0.9165 | Affinity capture-MS |
Activin A receptor, type IIA | 0.7829 | Affinity capture-MS |
Syntaxin 6 | 0.9679 | Affinity capture-MS |
Bone morphogenetic protein receptor type 1A | 0.8914 | Affinity capture-MS |
Fibroblast growth factor receptor 2 | 0.8789 | Affinity capture-MS |
Microfibrillar-associated protein 3 | 0.8756 | Affinity capture-MS |
C19orf18 protein interacts with Nedd4 family interacting protein 1 (NDFIP1) which promotes pancreatic beta cell death reduces insulin secretion.[17] Activin A receptor type 2A (ACVR2A) is a transmembrane receptor that is involved in ligand-binding and mediates the functions of activins.[18] Syntaxin 6 functions in trans-Golgi network vesicle trafficking, perhaps targeting to endosomes in mammalian cells.[19] Bone morphogenetic protein receptor type 1A(BMPR1A) is expressed almost exclusively in skeletal muscle and is a transcriptional regulator.[20] Fibroblast growth factor receptor 2 (FGFR2) plays an essential role in the regulation of osteoblast differentiation, proliferation and apoptosis, and is required for normal skeleton development.[21] Microfibrillar-associated protein 3 (MFAP3) has a function that is not fully understood but may be involved in nuclear signaling and may play a role in metastasis.[22]
Clinical Significance
Disease association
The c19orf18 protein is down-regulated in pancreatic cancer[23] and contains CpG sites found to be replicated for association with epithelial ovarian cancer risk.[24] The gene also decreases in expression in teratozoospermia[25] and increases in expression in polycystic ovary syndrome.[26] The gene may also be involved in prostate cancer and various tumors[3]
References
- ↑ 1.0 1.1 1.2 1.3 1.4 mieg@ncbi.nlm.nih.gov, Danielle Thierry-Mieg and Jean Thierry-Mieg, NCBI/NLM/NIH,. "AceView: Gene:C19orf18, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2018-02-26.
- ↑ "C19orf18 chromosome 19 open reading frame 18 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-02-26.
- ↑ 3.0 3.1 "EST Profile - Hs.134209". EST Profiles.
- ↑ "GDS3113 - GEO Profiles - NCBI". GEO profiles.
- ↑ "C19orf18 Gene - GeneCards". GeneCards - Human Gene Database.
- ↑ "Protein BLAST: search protein databases using a protein query". NIH Basic Local Alignment Search Tool.
- ↑ "C19orf18 (human)". PhosphoSitePlus.
- ↑ "SAPS Results". EMBL-EBI.
- ↑ "GOR IV SECONDARY STRUCTURE PREDICTION". PRABI Rhone-Alpes Bioinformatics Center.
- ↑ Ashok Kumar, T. "CFSSP: Chou & Fasman Secondary Structure Prediction Server". www.biogem.org.
- ↑ Kelley, Lawrence. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk.
- ↑ "C19orf18 - Uncharacterized protein C19orf18 precursor - Homo sapiens (Human) - C19orf18 gene & protein". UniProt.
- ↑ "PSORT II Prediction". PSORTII.
- ↑ "CELLO:Subcellular Localization Predictive System". Molecular Bioinformatics Center. Archived from the original on 2016-03-04.
- ↑ "C19orf18". The Human Protein Atlas.
- ↑ Tyers, Mike. "C19orf18 Result Summary". BioGRID.
- ↑ "NDFIP1 - NEDD4 family-interacting protein 1 - Homo sapiens (Human) - NDFIP1 gene & protein". UniProt.
- ↑ "ACVR2A activin A receptor type 2A [Homo sapiens (human)] - Gene - NCBI". Gene.
- ↑ Bock, J B; Klumperman, J; Davanger, S; Scheller, R H (July 1997). "Syntaxin 6 functions in trans-Golgi network vesicle trafficking". Molecular Biology of the Cell. 8 (7): 1261–1271. ISSN 1059-1524. PMC 276151. PMID 9243506.
- ↑ "BMPR1A Gene - GeneCards | BMR1A Protein | BMR1A Antibody". GeneCards Human Gene.
- ↑ "FGFR2 - Fibroblast growth factor receptor 2 precursor - Homo sapiens (Human) - FGFR2 gene & protein". UniProt.
- ↑ "MFAP3L - Microfibrillar-associated protein 3-like precursor - Homo sapiens (Human) - MFAP3L gene & protein". UniProt.
- ↑ Makler, Amy; Narayanan, Ramaswamy (2017-05-01). "Mining Exosomal Genes for Pancreatic Cancer Targets". Cancer Genomics - Proteomics. 14 (3): 161–172. doi:10.21873/cgp.20028. ISSN 1109-6535. PMID 28446531.
- ↑ Fridley, Brooke L.; Armasu, Sebastian M.; Cicek, Mine S.; Larson, Melissa C.; Wang, Chen; Winham, Stacey J.; Kalli, Kimberly R.; Koestler, Devin C.; Rider, David N. (2014-04-28). "Methylation of leukocyte DNA and ovarian cancer: relationships with disease status and outcome". BMC Medical Genomics. 7: 21. doi:10.1186/1755-8794-7-21. ISSN 1755-8794.
- ↑ "GDS2697 - GEO Profiles - NCBI". GEO profiles.
- ↑ "GDS4399 - GEO Profiles - NCBI". GEO profiles.