Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture.

Meadows, Jennifer R S; Kidd, Jeffrey M; Wang, Guo-Dong; Parker, Heidi G; Schall, Peter Z; Bianchi, Matteo; Christmas, Matthew J; Bougiouri, Katia; Buckley, Reuben M; Hitte, Christophe; Nguyen, Anthony K; Wang, Chao; Jagannathan, Vidhya; Niskanen, Julia E; Frantz, Laurent A F; Arumilli, Meharji; Hundi, Sruthi; Lindblad-Toh, Kerstin; Ginja, Catarina; Agustina, Kadek Karang; ... (2023). Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. Genome biology, 24(1), p. 187. BioMed Central Ltd. 10.1186/s13059-023-03023-7

[img]
Preview
Text
s13059-023-03023-7.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (4MB) | Preview

BACKGROUND

The international Dog10K project aims to sequence and analyze several thousand canine genomes. Incorporating 20 × data from 1987 individuals, including 1611 dogs (321 breeds), 309 village dogs, 63 wolves, and four coyotes, we identify genomic variation across the canid family, setting the stage for detailed studies of domestication, behavior, morphology, disease susceptibility, and genome architecture and function.

RESULTS

We report the analysis of > 48 M single-nucleotide, indel, and structural variants spanning the autosomes, X chromosome, and mitochondria. We discover more than 75% of variation for 239 sampled breeds. Allele sharing analysis indicates that 94.9% of breeds form monophyletic clusters and 25 major clades. German Shepherd Dogs and related breeds show the highest allele sharing with independent breeds from multiple clades. On average, each breed dog differs from the UU_Cfam_GSD_1.0 reference at 26,960 deletions and 14,034 insertions greater than 50 bp, with wolves having 14% more variants. Discovered variants include retrogene insertions from 926 parent genes. To aid functional prioritization, single-nucleotide variants were annotated with SnpEff and Zoonomia phyloP constraint scores. Constrained positions were negatively correlated with allele frequency. Finally, the utility of the Dog10K data as an imputation reference panel is assessed, generating high-confidence calls across varied genotyping platform densities including for breeds not included in the Dog10K collection.

CONCLUSIONS

We have developed a dense dataset of 1987 sequenced canids that reveals patterns of allele sharing, identifies likely functional variants, informs breed structure, and enables accurate imputation. Dog10K data are publicly available.

Item Type:

Journal Article (Original Article)

Division/Institute:

09 Interdisciplinary Units > Next Generation Sequencing (NGS) Platform
05 Veterinary Medicine > Department of Clinical Research and Veterinary Public Health (DCR-VPH) > Institute of Genetics
05 Veterinary Medicine > Department of Clinical Research and Veterinary Public Health (DCR-VPH)

UniBE Contributor:

Jagannathan, Vidya, Drögemüller, Michaela, Leeb, Tosso

Subjects:

500 Science > 570 Life sciences; biology
500 Science > 590 Animals (Zoology)
600 Technology > 610 Medicine & health

ISSN:

1465-6906

Publisher:

BioMed Central Ltd.

Language:

English

Submitter:

Pubmed Import

Date Deposited:

16 Aug 2023 15:35

Last Modified:

08 Nov 2023 16:08

Publisher DOI:

10.1186/s13059-023-03023-7

Related URLs:

PubMed ID:

37582787

Uncontrolled Keywords:

Canine Demographic history Dog Genetic diversity Genomics Mitochondrial DNA Variation

BORIS DOI:

10.48350/185494

URI:

https://boris.unibe.ch/id/eprint/185494

Actions (login required)

Edit item Edit item
Provide Feedback