(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Eukaryota: NE > Opisthokonta: NE > Metazoa: NE > Eumetazoa: NE > Bilateria: NE > Deuterostomia: NE > Chordata: NE > Craniata: NE > Vertebrata: NE > Gnathostomata: NE > Teleostomi: NE > Euteleostomi: NE > Sarcopterygii: NE > Dipnotetrapodomorpha: NE > Tetrapoda: NE > Amniota: NE > Mammalia: NE > Theria: NE > Eutheria: NE > Boreoeutheria: NE > Laurasiatheria: NE > Cetartiodactyla: NE > Ruminantia: NE > Pecora: NE > Cervidae: NE > Cervinae: NE > Cervus: NE > Cervus elaphus: NE > Cervus elaphus hippelaphus: NE
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MSGEWVHSGQTLIWAVWVLAAAIKGPATADAPTRHTNLGWVRGTQASVLG NDMLVNVFLGVPYAAPPVGPLRFANPEPLLPWNGFLNATSYPKLCFQNLE WLFTDQHILKVRYPKFRVSEDCLYLNIYAPAHAETGSRLPVMVWLPGGAF ETGSASIFDGSALASYENVLVVTIQYRLGIFGFFNTGDKHALGNWAFMDQ VAALMWVQENIESFGGDPGRVTIFGESAGAISVSSLILSPMTEGLFHRAI MESGVAIIPYLKAPDYERNDDLQTIASICDCSASNSVALLQCLRAKSSKE LLSISQKTKSFTRVVDGLFFPNEPLDLLAQKSFHLVPSIIGVNNHECGFL LPMKEFPEIIGGSNKSLALQLINSILHIPVQYLYLVANEYFYNMHSLVDI RNRFLDLLGDVFFVIPGLVTAQYHTDAGALVYFYEFQHQPQCLKDRKPPY VKADHTDEIRFVFGGAFLKGNIVMFEEATEEEKGLSRKMMRYWANFARTG NPNGKGLPLWPAYRQSEEYLQLDLNISVGQRLKEVELKFWTETLPLMMTS SGALLASLSSLTFLFLLLPFIFSFAP
We present here the de novo genome assembly CerEla1.0 for the red deer, Cervus elaphus, an emblematic member of the natural megafauna of the Northern Hemisphere. Humans spread the species in the South. Today, the red deer is also a farm-bred animal and is becoming a model animal in biomedical and population studies. Stag DNA was sequenced at 74x coverage by Illumina technology. The ALLPATHS-LG assembly of the reads resulted in 34.7 x 10(3) scaffolds, 26.1 x 10(3) of which were utilized in Cer.Ela1.0. The assembly spans 3.4 Gbp. For building the red deer pseudochromosomes, a pre-established genetic map was used for main anchor points. A nearly complete co-linearity was found between the mapmarker sequences of the deer genetic map and the order and orientation of the orthologous sequences in the syntenic bovine regions. Syntenies were also conserved at the in-scaffold level. The cM distances corresponded to 1.34 Mbp uniformly along the deer genome. Chromosomal rearrangements between deer and cattle were demonstrated. 2.8 x 10(6) SNPs, 365 x 10(3) indels and 19368 protein-coding genes were identified in CerEla1.0, along with positions for centromerons. CerEla1.0 demonstrates the utilization of dual references, i.e., when a target genome (here C. elaphus) already has a pre-established genetic map, and is combined with the well-established whole genome sequence of a closely related species (here Bos taurus). Genome-wide association studies (GWAS) that CerEla1.0 (NCBI, MKHE00000000) could serve for are discussed.