Click on a chromosome for a closer view
This site provides a data set based on the March
2006 Pan_troglodytes-2.1 6x shotgun assembly from the Chimpanzee Sequencing
Consortium headed by the GSC (St. Louis) and The Broad Institute (MIT).
The chimpanzee 2.1 assembly is a merge of the initial 4X made in collaboration with the Broad Institute at MIT and Harvard and an additional (2X) whole genome coverage from the WUGSC (St. Louis) utilizing a combination of whole genome plasmid reads as well as fosmid and BAC end sequences.
This release of the assembly has the following properties:
As of Release 35 we have changed the chimpanzee chromosome numbering to match the new primate standard proposed by E.H. McConkey (Cytogenetics and Genome Research, 105:157-158) and endorsed by the International Chimpanzee Genome Consortium.
The genome was aligned to human NCBI36 by UCSC using BLASTz. These alignments were used to transfer human ensembl gene structures (Human Build 36b) to chimpanzee. 94% of the genes were mapped by direct projection while only 0,1% of the human ensembl genes were not mapped, the remaining 5.9% were aligned directly into the chimpanzee genome by using Exonerate (G. Slater et al., BMC Bioinformatics. 2005 6:31).
A new Ensembl genebuild has been completed on the latest chimpanzee assembly from the Broad Institute (PanTro 2.1).
Read more...
Ensembl 41 includes the first release of variation data for chimp, to accompany the new assembly and genebuild.
Read more...
Whole Genome Alignments
Ensembl RNA data has been updated as follows:
| Assembly: | PanTro 2.1, Mar 2006 |
| Genebuild: | Ensembl, July 2006 |
| Database version: | 41.21 |
| Known genes: | 929 |
| Projected genes: | 19,035 |
| Novel genes: | 110 |
| Pseudogenes: | 1,028 |
| RNA genes: | 3,500 |
| Genscan gene predictions: | 126,539 |
| Gene exons: | 237,797 |
| Gene transcripts: | 33,880 |
| Base Pairs*: | 2,928,563,828 |
| Golden Path Length**: | 3,350,417,645 |
| Most common InterPro domains: | Top 40 Top 500 |
* Total number of base pairs = sum of lengths of DNA table
** Reference assembly (Golden path) length = sum of non-redundant top level seq regions
© 2008 WTSI / EBI. Ensembl is available to download for public use - please see the code licence for details.