Click on a chromosome for a closer view
This site provides the first gene annotation set based on the initial
release [July 2004] from the Dog Genome Sequencing consortium.
The assembly is constructed from a WGS read coverage of 7.6x (Q20 bases, assuming a genome of 2.4 Gb). It has an N50 contig length of 123 kb, N50 supercontig length of 41.6 Mb.
The gene build was run via a reasonably standard Ensembl mammalian pipeline, modifed to make optimal choices of source proteins for each gene. This initial analysis gives 20,604 genes with 32,559 transcripts, but lacks a number (around 700) of orthologs to human which seem placeable on the dog genome. We are investigating these in more detail and may release a patched dataset in the future.
Ensembl RNA data has been updated as follows:
The API has of course been updated to reflect these changes
.| Assembly: | CanFam 1.0, July 2004 |
| Genebuild: | Ensembl, Nov 2004 |
| Database version: | 41.1j |
| Known genes: | 3,325 |
| Projected genes: | 13,786 |
| Novel genes: | 1,103 |
| Pseudogenes: | 2,238 |
| RNA genes: | 2,348 |
| Genscan gene predictions: | 77,477 |
| Gene exons: | 224,068 |
| Gene transcripts: | 32,559 |
| Base Pairs*: | 2,359,845,093 |
| Golden Path Length**: | 2,519,795,863 |
| Most common InterPro domains: | Top 40 Top 500 |
* Total number of base pairs = sum of lengths of DNA table
** Reference assembly (Golden path) length = sum of non-redundant top level seq regions
© 2008 WTSI / EBI. Ensembl is available to download for public use - please see the code licence for details.