HomoloGene
WikiDoc Resources for HomoloGene |
Articles |
---|
Most recent articles on HomoloGene |
Media |
Evidence Based Medicine |
Clinical Trials |
Ongoing Trials on HomoloGene at Clinical Trials.gov Clinical Trials on HomoloGene at Google
|
Guidelines / Policies / Govt |
US National Guidelines Clearinghouse on HomoloGene
|
Books |
News |
Commentary |
Definitions |
Patient Resources / Community |
Patient resources on HomoloGene Discussion groups on HomoloGene Patient Handouts on HomoloGene Directions to Hospitals Treating HomoloGene Risk calculators and risk factors for HomoloGene
|
Healthcare Provider Resources |
Causes & Risk Factors for HomoloGene |
Continuing Medical Education (CME) |
International |
|
Business |
Experimental / Informatics |
Please Take Over This Page and Apply to be Editor-In-Chief for this topic: There can be one or more than one Editor-In-Chief. You may also apply to be an Associate Editor-In-Chief of one of the subtopics below. Please mail us [1] to indicate your interest in serving either as an Editor-In-Chief of the entire topic or as an Associate Editor-In-Chief for a subtopic. Please be sure to attach your CV and or biographical sketch.
Overview
HomoloGene[2], a tool of the National Center for Biotechnology Information (NCBI)[3], is a system for automated detection of homologs (similarity attributable to descent from a common ancestor) among the annotated genes of several completely sequenced eukaryotic genomes.
The HomoloGene processing consists of the protein analysis from the input organisms. Sequences are compared using blastp[4], then matched up and put into groups, using a taxonomic tree built from sequence similarity, where closer related organisms are matched up first, and then further organisms are added to the tree. The protein alignments are mapped back to their corresponding DNA sequences, and then distance metrics as molecular distances Jukes and Cantor (1969), Ka/Ks ratio can be calculated.
The sequences are matched up by using a heuristic algorithm for maximizing the score globally, rather than locally, in a bipartite matching (see complete bipartite graph). And then it calculates the statistical significance of each match. Cutoffs are made per position and Ks values are set to prevent false "orthologs" from being grouped together. “Paralogs” are identified by finding sequences that are closer within species than other species.
Input Organisms
Homo sapiens, Mus musculus, Danio rerio, Rattus norvegicus, Pan troglodytes, Canis lupus familiaris, Arabidopsis thaliana, Gallus gallus, Oryza sativa, Anopheles gambiae, Drosophila melanogaster, Magnaporthe grisea, Neurospora crassa, Caenorhabditis elegans, Saccharomyces cerevisiae, Kluyveromyces lactis, Eremothecium gossypii, Schizosaccharomyces pombe and Plasmodium falciparum.
Interface
The HomoloGene is linked to all Entrez databases and based on homology and phenotype information of these links:
- Mouse Genome Informatics (MGI),
- Zebrafish Information Network (ZFIN),
- Saccharomyces Genome Database (SGD),
- Clusters of Orthologous Groups (COG),
- FlyBase,
- Online Mendelian Inheritance in Man (OMIM)
As a result HomoloGene displays information about Genes, Proteins, Phenotypes, and Conserved Domains.
External links
- Bioinformatic Harvester is a meta search engine that uses, among others, Homologene: [5]
- NCBI
- OMIM
- ZFIN
- SGD
- COG
- FlyBase
- MGI
- Rat Genome Database