| Literature DB >> 30988347 |
Sabine Felkel1,2, Claus Vogl1, Doris Rigler1, Viktoria Dobretsberger1, Bhanu P Chowdhary3, Ottmar Distl4, Ruedi Fries5, Vidhya Jagannathan6, Jan E Janečka7, Tosso Leeb6, Gabriella Lindgren8,9, Molly McCue10, Julia Metzger4, Markus Neuditschko11, Thomas Rattei12, Terje Raudsepp13, Stefan Rieder11, Carl-Johan Rubin14, Robert Schaefer11, Christian Schlötterer15, Georg Thaller16, Jens Tetens16,17, Brandon Velie8,18, Gottfried Brem1, Barbara Wallner19.
Abstract
Analysis of the Y chromosome is the best-established way to reconstruct paternal family history in humans. Here, we applied fine-scaled Y-chromosomal haplotyping in horses with biallelic markers and demonstrate the potential of our approach to address the ancestry of sire lines. We de novo assembled a draft reference of the male-specific region of the Y chromosome from Illumina short reads and then screened 5.8 million basepairs for variants in 130 specimens from intensively selected and rural breeds and nine Przewalski's horses. Among domestic horses we confirmed the predominance of a young'crown haplogroup' in Central European and North American breeds. Within the crown, we distinguished 58 haplotypes based on 211 variants, forming three major haplogroups. In addition to two previously characterised haplogroups, one observed in Arabian/Coldblooded and the other in Turkoman/Thoroughbred horses, we uncovered a third haplogroup containing Iberian lines and a North African Barb Horse. In a genealogical showcase, we distinguished the patrilines of the three English Thoroughbred founder stallions and resolved a historic controversy over the parentage of the horse 'Galopin', born in 1872. We observed two nearly instantaneous radiations in the history of Central and Northern European Y-chromosomal lineages that both occurred after domestication 5,500 years ago.Entities:
Mesh:
Year: 2019 PMID: 30988347 PMCID: PMC6465346 DOI: 10.1038/s41598-019-42640-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1LipY764 Assembly. (a) Assembly step: whole genome NGS reads from males (blue) are mapped to Y-specific bait sequences (black). Mapped reads (dark blue) are then extracted and assembled (grey). Classification step: the assembly is shown in grey with hatchmarks representing 50 bp windows. Male (blue) and female (red) reads are mapped to the assembly and mapping coverages normalised to autosomal coverages per window were estimated. The probability of Y- or nonMSY-specificity per window is obtained by comparing normalised coverages in males and females. class nonMSY: XX/XY/AUT ≈ 1 in males and ≈ 1 in females; class Y: scY ≤ 1 in males and ≈ 0 in females, mcY > 1 in males and ≈ 0 in females. (b) Frequency distribution of normalised mapping coverage in classY windows. The cut-off scY to seperate mcY is set to 1 (red dashed line). (c) Resulting statistics for the assembly and classification approach. (d) Position of LipY764 contigs on eMSYv3[46] (data in Supplementary Table S5). Contigs having a single unique position on eMSYv3 are shown in grey, contigs with multiple hits in black/bold.
Figure 2Horse MSY tree. A maximum parsimony tree showing the horse MSY phylogeny based on 2,192 scY variants detected in 139 males. The tree is rooted with the donkey and bootstrap values of 90% or higher are shown. The Przewalski’s horses are shown in brown. Blueish clades correspond to early splitting Asian samples (O), Northern European breeds (N and I) and other autochthonous Asian samples (M, Y and J). The three clearly separated crown group clades are represented in pink (H), green (A) and orange (T) shades. Assigned haplogroups are shown on the right. A detailed haplotype network with variants is shown in Supplementary Fig. S4, variant details are given in Supplementary Table S8.
Figure 3Detailed view on haplogroup Tb. (a) Haplotype network of group Tb. Circle sizes correspond to the number of samples. Nomenclature of HTs is based on Wallner et al.[23] and subbranches according to human guidelines[62]. Determining variants are given on branches (details in Supplementary Table S8). HTs derived from Darley Arabian are shown in red, Godolphin Arabian in orange and Byerley Turk in yellow. (b) Pedigree reconstruction of English Thoroughbred descendants and the respective HTs. Dotted lines connect relatives where at least one ancestor is omitted. For each HT the number of samples in the NGS dataset is given and the number of genotyped individuals, if available (Table S9), shown in parentheses.
Figure 4Divergence time estimates. (a) Maximum parsimony tree rooted with the donkey. Coloured circles represent contemporary haplogroups; grey rings indicate basal HTs of the crowns’ sublines. The number of mutations on a branch is given on its left, unless it is one. In the lower panel the full range of mutations observed after respective coalescence points (mut) and years back to the MRCA (y) under the assumption of a mutation rate of 1.69 × 10−8 site/generation and assuming a generation interval from eight to twelve years (b) is shown. 95% highest posterior density intervals are given in brackets. The position of variant fBOI (indicative for Y-HT1 in Wutke et al.[77]) is marked by an arrow. Details on variants are given in Supplementary Table S8. (b) Mean generation intervals calculated from deep pedigrees from males genotyped for the respective haplogroup. The number of father-son pairs is given with genotyped individuals in parentheses (data in Supplementary Table S10).