C8orf58: Difference between revisions
←Created page with '{{Infobox_gene}} '''Chromosome 8 open reading frame 58''' is a protein that in humans is encoded by the C8orf58 gene. <ref name="entrez"> {{cite web | t...' |
Matt Pijoan (talk | contribs) m 1 revision imported |
||
(One intermediate revision by one other user not shown) | |||
Line 1: | Line 1: | ||
{{Infobox_gene}} | {{Infobox_gene}} | ||
'''Chromosome 8 open reading frame 58''' is | '''Chromosome 8 open reading frame 58''' is an uncharacterised [[protein]] that in humans is encoded by the ''C8orf58'' [[gene]].<ref name="entrez"> | ||
<ref name="entrez"> | |||
{{cite web | {{cite web | ||
| title = Entrez Gene: Chromosome 8 open reading frame 58 | | title = Entrez Gene: Chromosome 8 open reading frame 58 | ||
| url = | | url = https://www.ncbi.nlm.nih.gov/gene/541565 | ||
| accessdate = 2017-11-22 | | accessdate = 2017-11-22 | ||
}}</ref> | }}</ref> The protein is predicted to be localized in the [[Cell nucleus|nucleus]]. | ||
== Gene == | |||
The <i>C8orf58</i> gene is located on [[chromosome 8]] at position 8p21.3. It spans a total of 4,550 [[Base pair|base pairs]] and has seven [[Exon|exons]]. C8orf58 is flanked by the genes [[PDLIM2]] and CCAR2.<ref>''NCBI Nucleotide. Homo sapiens chromosome 8 open reading frame 58 (C8orf58), transcript variant 1, mRNA. [https://www.ncbi.nlm.nih.gov/nuccore/NM_001013842.2]''</ref> There are no aliases. It is defined as a protein coding gene.<ref>GeneCard. C8orf58 Gene(Protein Coding) Chromosome 8 Open Reading Frame 58. [https://www.genecards.org/cgi-bin/carddisp.pl?gene=C8orf58#publications]</ref> | |||
== mRNA == | |||
C8orf58 produces three transcript splice variants. The [[Transcription (biology)|transcript]] of variant 1 represents the longest transcript and encodes the largest protein. It is 2,062 base pairs and contains seven exons. There are two other splice variants, produced by [[Alternative splicing|alternative splice sites]].<ref>NCBI Gene. C8orf58 chromosome 8 open reading frame 58 [ Homo sapiens (human) ]. [https://www.ncbi.nlm.nih.gov/gene/541565]</ref> | |||
{| class="wikitable" | |||
<tbody><tr><th>Isoform</th><th>Exons</th><th>Length (base pairs)</th><th>Features</th></tr><tr><td>Transcript Variant 1</td><td>1, 2, 3, 4, 5, 6, 7</td><td>2062</td><td>One upstream in-frame stop codon.</td></tr><tr><td>Transcript Variant 2</td><td>1, 2, 3, 4, 5, 6, 7</td><td>2038</td><td>Alternate in-frame splice site in the 3' coding region.</td></tr><tr href="Category:All stub articles"><td>Transcript Variant 3</td><td>1, 2, 3, 4, 5, 6</td><td>1955</td><td>Lacks an alternate exon, results in a frameshift in the 3' coding region.</td></tr></tbody> | |||
|} | |||
C8orf58 has a relatively short 5’ region and a moderate 3’ region. Both the 5’ and 3’ regions contain [[Stem-loop|stem loops]].<ref>[http://unafold.rna.albany.edu/?q=mfold/RNA-Folding-Form RNA Folding Form]</ref> There is one predicted [[miRNA]] binding site that found in the 3’UTR of C8orf58.<ref>[http://www.targetscan.org/vert_71/ TargetScan Human]</ref> | |||
== | == Protein == | ||
C8orf58 protein Isoform 1 is 365 amino acids long. Isoform 2 and Isoform 3 are 357 and 300 amino acids respectively. There is a [[kozak consensus sequence]] present, which confirms it is a protein coding sequence.<ref>NCBI Protein. Uncharacterized protein C8orf58 isoform 1 [Homo sapiens].[https://www.ncbi.nlm.nih.gov/protein/NP_001013864.1]</ref> | |||
C8orf58 Isoform 1 has a molecular weight of 39.7 [[Dalton (unit)|kDa]] and an [[isoelectric point]] of 8.29. It is proline and arginine rich and isoleucine, asparagine, phenylalanine, and tyrosine poor.<ref name="auto">[http://workbench.sdsc.edu/ SDSC Biology Workbench]</ref> | |||
== | The predicted secondary structure of the C8orf58 protein include multiple [[Alpha helix|alpha helices]] and one [[Beta strand|beta strands]].<ref name="auto" /><ref>[http://www.biogem.org/tool/chou-fasman/index.php Chou-Fasman Secondary Structure Prediction Server]</ref> | ||
{| class="wikitable" | |||
!Isoform | |||
!From mRNA Variant | |||
!Length (amino acids) | |||
!Molecular Weight (kDa) | |||
!Isoelectric Point | |||
|- | |||
|1 | |||
|1 | |||
|365 | |||
|39.7 | |||
|8.30 | |||
|- | |||
|2 | |||
|2 | |||
|357 | |||
|38.6 | |||
|8.30 | |||
|- | |||
|3 | |||
|3 | |||
|300 | |||
|32.0 | |||
|5.82 | |||
|} | |||
== Evolutionary history == | |||
It is part of the DUF4657 family, a family of proteins found in eukaryotes. Proteins in this family are typically between 305 and 370 amino acids in length.<ref>[https://www.uniprot.org/uniprot/Q8NAV2 UniProtKB - Q8NAV2 (CH058_HUMAN). UniProt]</ref> The [[Domain of unknown function|Domain of Unknown Function]] (DUF) of C8orf58 is located between amino acids 73 to 364. | |||
== Expression == | |||
According to the NCBI GEO profiles, C8orf58 is a narrowly expressed protein found in spleen, lung, thymus, prostate, and spinal cord tissue. It is constitutively expressed in these tissues.<ref>[https://www.ncbi.nlm.nih.gov/geoprofiles/?term=c8orf58 NCBI GEO Profiles]</ref> | |||
== Post-translational modification == | |||
The bioinformatic tools on Expasy were used to determine potential post translational modification sites for the C8orf58 protein. There are two predicted [[Phosphorylation site|phosphorylation sites]] and one predicted [[Sumoylation|sumoylation site]].<ref>[https://www.expasy.org/proteomics Expasy Bioinformatics Resource Portal]</ref> | |||
== Subcellular localization == | |||
According to PSORT II, C8orf58 is located in the nucleus. This is supported by the presence of a sumoylation site, which is involved in [[Nuclear transport|nucleic cytoplasmic transport]]. | |||
== Interacting proteins == | |||
Two proteins have been found to interact with protein C8orf58, [[CENPH]] and metG1, which were found using [[Two-hybrid screening|two hybrid assay]] and the two hybrid pooling approach respectively.<ref>[http://www.ebi.ac.uk/intact/ IntAct Molecular Interaction Database]</ref> CENPH (Centromere Protein H) plays a critical role in centromere structure, kinetochore formation, and sister chromatid separation.<ref>[https://www.uniprot.org/uniprot/Q9H3R5 Centromere protein H]</ref> MetG1 (Methionine—tRNA ligase) is required for elongation of protein synthesis and the initiation of all mRNA translation through initiator tRNA(fMet) aminoacylation.<ref>[https://www.uniprot.org/uniprot/Q1H2F4 Methionine--tRNA ligase]</ref> | |||
== Homology == | |||
An important [[paralog]] of this gene is ENSG00000248235.<ref>GeneCard. 8orf58 Gene(Protein Coding) Chromosome 8 Open Reading Frame 58. [https://www.genecards.org/cgi-bin/carddisp.pl?gene=C8orf58#publications].</ref> [[Orthologs]] of the human gene C8orf58 are limited to [[vertebrates]] of the animal kingdom. | |||
{| class="wikitable" | |||
!Scientific Name | |||
!Common Name | |||
!NCBI Accession Number | |||
!Length (Amino Acids) | |||
!Date of Divergence (MYA) | |||
!Identity (%) | |||
!Similarity (%) | |||
|- | |||
|''[[Homo sapiens]]'' | |||
|Human | |||
|NP_001013864.1 | |||
|365 | |||
| - | |||
| - | |||
| - | |||
|- | |||
|''[[Gorilla gorilla]]'' | |||
|Gorilla | |||
|XP_004046807.1 | |||
|439 | |||
|9.06 | |||
|96 | |||
|79.50 | |||
|- | |||
|''[[Marmota marmota]]'' | |||
|Alpine Marmot | |||
|XP_015354979.1 | |||
|369 | |||
|90 | |||
|68 | |||
|75.7 | |||
|- | |||
|''[[Oryctolagus cuniculus]]'' | |||
|European Rabbit | |||
|XP_008248092.1 | |||
|371 | |||
|90 | |||
|66 | |||
|72 | |||
|- | |||
|''[[Spalax|Nannospalax galili]]'' | |||
|Spalax | |||
|XP_008848689.1 | |||
|362 | |||
|90 | |||
|65 | |||
|74.7 | |||
|- | |||
|''[[Ceratotherium simum simum]]'' | |||
|White Rhinoceros | |||
|XP_014652157.1 | |||
|381 | |||
|96 | |||
|66 | |||
|72.7 | |||
|- | |||
|''[[Odobenus rosmarus divergens]]'' | |||
|Pacific walrus | |||
|XP_012418498.1 | |||
|388 | |||
|96 | |||
|65 | |||
|74.7 | |||
|- | |||
|''[[Sus scrofa]]'' | |||
|Wild Boar | |||
|XP_005670472.1 | |||
|382 | |||
|96 | |||
|65 | |||
|73.3 | |||
|- | |||
|''[[Hipposideros armiger]]'' | |||
|Great Roundleaf Bat | |||
|XP_019487131.1 | |||
|387 | |||
|96 | |||
|62 | |||
|71 | |||
|- | |||
|''[[Eptesicus fuscus]]'' | |||
|Big Brown Bat | |||
|XP_008149784.1 | |||
|377 | |||
|96 | |||
|62 | |||
|70.1 | |||
|- | |||
|''[[Loxodonta africana]]'' | |||
|African Bush Elephant | |||
|XP_003412428.1 | |||
|372 | |||
|105 | |||
|71 | |||
|77.2 | |||
|- | |||
|''[[Aardvark|Orycteropus afer afer]]'' | |||
|Aardvark | |||
|XP_007949039.1 | |||
|370 | |||
|105 | |||
|65 | |||
|71.7 | |||
|- | |||
|''[[Parus major]]'' | |||
|Great Tit | |||
|XP_015504136.1 | |||
|320 | |||
|312 | |||
|32 | |||
|35.6 | |||
|- | |||
|''[[Anolis carolinensis]]'' | |||
|Carolina Anole | |||
|XP_008118367.1 | |||
|453 | |||
|312 | |||
|28 | |||
|38.9 | |||
|} | |||
==References== | |||
<references /> | |||
{{gene-8-stub}} | {{gene-8-stub}} |
Latest revision as of 08:50, 10 January 2019
VALUE_ERROR (nil) | |||||||
---|---|---|---|---|---|---|---|
Identifiers | |||||||
Aliases | |||||||
External IDs | GeneCards: [6] | ||||||
Orthologs | |||||||
Species | Human | Mouse | |||||
Entrez |
|
| |||||
Ensembl |
|
| |||||
UniProt |
|
| |||||
RefSeq (mRNA) |
|
| |||||
RefSeq (protein) |
|
| |||||
Location (UCSC) | n/a | n/a | |||||
PubMed search | n/a | n/a | |||||
Wikidata | |||||||
|
Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene.[1] The protein is predicted to be localized in the nucleus.
Gene
The C8orf58 gene is located on chromosome 8 at position 8p21.3. It spans a total of 4,550 base pairs and has seven exons. C8orf58 is flanked by the genes PDLIM2 and CCAR2.[2] There are no aliases. It is defined as a protein coding gene.[3]
mRNA
C8orf58 produces three transcript splice variants. The transcript of variant 1 represents the longest transcript and encodes the largest protein. It is 2,062 base pairs and contains seven exons. There are two other splice variants, produced by alternative splice sites.[4]
<tbody></tbody>Isoform | Exons | Length (base pairs) | Features |
---|---|---|---|
Transcript Variant 1 | 1, 2, 3, 4, 5, 6, 7 | 2062 | One upstream in-frame stop codon. |
Transcript Variant 2 | 1, 2, 3, 4, 5, 6, 7 | 2038 | Alternate in-frame splice site in the 3' coding region. |
Transcript Variant 3 | 1, 2, 3, 4, 5, 6 | 1955 | Lacks an alternate exon, results in a frameshift in the 3' coding region. |
C8orf58 has a relatively short 5’ region and a moderate 3’ region. Both the 5’ and 3’ regions contain stem loops.[5] There is one predicted miRNA binding site that found in the 3’UTR of C8orf58.[6]
Protein
C8orf58 protein Isoform 1 is 365 amino acids long. Isoform 2 and Isoform 3 are 357 and 300 amino acids respectively. There is a kozak consensus sequence present, which confirms it is a protein coding sequence.[7]
C8orf58 Isoform 1 has a molecular weight of 39.7 kDa and an isoelectric point of 8.29. It is proline and arginine rich and isoleucine, asparagine, phenylalanine, and tyrosine poor.[8]
The predicted secondary structure of the C8orf58 protein include multiple alpha helices and one beta strands.[8][9]
Isoform | From mRNA Variant | Length (amino acids) | Molecular Weight (kDa) | Isoelectric Point |
---|---|---|---|---|
1 | 1 | 365 | 39.7 | 8.30 |
2 | 2 | 357 | 38.6 | 8.30 |
3 | 3 | 300 | 32.0 | 5.82 |
Evolutionary history
It is part of the DUF4657 family, a family of proteins found in eukaryotes. Proteins in this family are typically between 305 and 370 amino acids in length.[10] The Domain of Unknown Function (DUF) of C8orf58 is located between amino acids 73 to 364.
Expression
According to the NCBI GEO profiles, C8orf58 is a narrowly expressed protein found in spleen, lung, thymus, prostate, and spinal cord tissue. It is constitutively expressed in these tissues.[11]
Post-translational modification
The bioinformatic tools on Expasy were used to determine potential post translational modification sites for the C8orf58 protein. There are two predicted phosphorylation sites and one predicted sumoylation site.[12]
Subcellular localization
According to PSORT II, C8orf58 is located in the nucleus. This is supported by the presence of a sumoylation site, which is involved in nucleic cytoplasmic transport.
Interacting proteins
Two proteins have been found to interact with protein C8orf58, CENPH and metG1, which were found using two hybrid assay and the two hybrid pooling approach respectively.[13] CENPH (Centromere Protein H) plays a critical role in centromere structure, kinetochore formation, and sister chromatid separation.[14] MetG1 (Methionine—tRNA ligase) is required for elongation of protein synthesis and the initiation of all mRNA translation through initiator tRNA(fMet) aminoacylation.[15]
Homology
An important paralog of this gene is ENSG00000248235.[16] Orthologs of the human gene C8orf58 are limited to vertebrates of the animal kingdom.
Scientific Name | Common Name | NCBI Accession Number | Length (Amino Acids) | Date of Divergence (MYA) | Identity (%) | Similarity (%) |
---|---|---|---|---|---|---|
Homo sapiens | Human | NP_001013864.1 | 365 | - | - | - |
Gorilla gorilla | Gorilla | XP_004046807.1 | 439 | 9.06 | 96 | 79.50 |
Marmota marmota | Alpine Marmot | XP_015354979.1 | 369 | 90 | 68 | 75.7 |
Oryctolagus cuniculus | European Rabbit | XP_008248092.1 | 371 | 90 | 66 | 72 |
Nannospalax galili | Spalax | XP_008848689.1 | 362 | 90 | 65 | 74.7 |
Ceratotherium simum simum | White Rhinoceros | XP_014652157.1 | 381 | 96 | 66 | 72.7 |
Odobenus rosmarus divergens | Pacific walrus | XP_012418498.1 | 388 | 96 | 65 | 74.7 |
Sus scrofa | Wild Boar | XP_005670472.1 | 382 | 96 | 65 | 73.3 |
Hipposideros armiger | Great Roundleaf Bat | XP_019487131.1 | 387 | 96 | 62 | 71 |
Eptesicus fuscus | Big Brown Bat | XP_008149784.1 | 377 | 96 | 62 | 70.1 |
Loxodonta africana | African Bush Elephant | XP_003412428.1 | 372 | 105 | 71 | 77.2 |
Orycteropus afer afer | Aardvark | XP_007949039.1 | 370 | 105 | 65 | 71.7 |
Parus major | Great Tit | XP_015504136.1 | 320 | 312 | 32 | 35.6 |
Anolis carolinensis | Carolina Anole | XP_008118367.1 | 453 | 312 | 28 | 38.9 |
References
- ↑ "Entrez Gene: Chromosome 8 open reading frame 58". Retrieved 2017-11-22.
- ↑ NCBI Nucleotide. Homo sapiens chromosome 8 open reading frame 58 (C8orf58), transcript variant 1, mRNA. [1]
- ↑ GeneCard. C8orf58 Gene(Protein Coding) Chromosome 8 Open Reading Frame 58. [2]
- ↑ NCBI Gene. C8orf58 chromosome 8 open reading frame 58 [ Homo sapiens (human) ]. [3]
- ↑ RNA Folding Form
- ↑ TargetScan Human
- ↑ NCBI Protein. Uncharacterized protein C8orf58 isoform 1 [Homo sapiens].[4]
- ↑ 8.0 8.1 SDSC Biology Workbench
- ↑ Chou-Fasman Secondary Structure Prediction Server
- ↑ UniProtKB - Q8NAV2 (CH058_HUMAN). UniProt
- ↑ NCBI GEO Profiles
- ↑ Expasy Bioinformatics Resource Portal
- ↑ IntAct Molecular Interaction Database
- ↑ Centromere protein H
- ↑ Methionine--tRNA ligase
- ↑ GeneCard. 8orf58 Gene(Protein Coding) Chromosome 8 Open Reading Frame 58. [5].
![]() | This article on a gene on human chromosome 8 is a stub. You can help Wikipedia by expanding it. |