Literature DB >> 20926431

The cosmopolitan maternal heritage of the Thoroughbred racehorse breed shows a significant contribution from British and Irish native mares.

M A Bower1, M G Campana, M Whitten, C J Edwards, H Jones, E Barrett, R Cassidy, R E R Nisbet, E W Hill, C J Howe, M Binns.   

Abstract

The paternal origins of Thoroughbred racehorses trace back to a handful of Middle Eastern stallions, imported to the British Isles during the seventeenth century. Yet, few details of the foundation mares were recorded, in many cases not even their names (several different maternal lineages trace back to 'A Royal Mare'). This has fuelled intense speculation over their origins. We examined mitochondrial DNA from 1929 horses to determine the origin of Thoroughbred foundation mares. There is no evidence to support exclusive Arab maternal origins as some historical records have suggested, or a significant importation of Oriental mares (the term used in historic records to refer to Middle East and western Asian breeds including Arab, Akhal-Teke, Barb and Caspian). Instead, we show that Thoroughbred foundation mares had a cosmopolitan European heritage with a far greater contribution from British and Irish Native mares than previously recognized.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20926431      PMCID: PMC3061175          DOI: 10.1098/rsbl.2010.0800

Source DB:  PubMed          Journal:  Biol Lett        ISSN: 1744-9561            Impact factor:   3.703


Introduction

The English Thoroughbred is the best known breed of horse in the western world. Thoroughbreds were developed during the seventeenth and eighteenth centuries in England, largely owing to the enthusiasm of English aristocracy for horse racing and betting [1]. The paternal origins of the breed are well documented as being derived from a handful of Middle Eastern stallions, the most influential of which are Godolphin Arabian, Darley Arabian and Byerley Turk [2]. Yet, the origins of Thoroughbred mares are less well known. The General Studbook (GSB), the breed registry for Thoroughbred horses, first published in 1791 [3], documents Thoroughbred pedigrees back to seventeenth century foundation bloodstock, identifying 74 foundation mares. Present day membership of the GSB requires comprehensive records, including genetic verification of parentage. However, in the early history of the breed, only minimal details of founding mares were recorded, as females were not regarded as important [4]. Since then, the contribution of mares to race performance has been acknowledged [5,6], but the origins of female Thoroughbred lineages are contentious, with a history of intense speculation [7-9]. This speculation is primarily focused on the contribution of Arab and/or ‘Oriental’ mares (the term used in historic records to refer to Middle East and western Asian breeds including Arab, Akhal-Teke, Barb and Caspian [9]). Maternally inherited mitochondrial DNA (mtDNA) has been used for tracing maternal bloodlines in Thoroughbreds [6,10] and to study geographical origins of domestic horses [11-13]. Using mtDNA, we tested four hypotheses for the origins of Thoroughbred maternal lineages: Thoroughbred foundation mares were (i) imported Arabs [7], (ii) Oriental [9], i.e. imported from the Middle East and western Asia; (iii) native to the British Isles [8], and (iv) mares from a variety of origins depending on availability at the time and place.

Material and methods

Whole-genomic DNA was extracted from horse hair roots according to standard protocols. Polymerase chain reactions were set up as previously published [12]. We obtained 247 base pairs of mitochondrial D-loop from 196 Thoroughbred horses and 83 British Native horses (Fell, n = 16; Highland, n = 24; Shetland, n = 43). Sequences were deposited in GenBank (www.ncbi.nlm.nih.gov): Thoroughbred: EU580148–EU580172; Fell: GU563629–GU563645; Highland: GU563646–GU563668; Shetland: GU563669–GU563712. Our data were compared with 1550 horse D-loop sequences available from GenBank (www.ncbi.nlm.nih.gov). Breeds represented by fewer than 10 individuals were not included in analyses. Together, the data represented 30 major Thoroughbred maternal lineages ([10]; 296 Thoroughbreds), 201 Oriental horses (Arab, Akhal-Teke, Barb and Caspian) and 255 British Native and Irish horses (Connemara, Exmoor, Fell, Irish Draught, Kerry Bog and Shire) and horse breeds from across Eurasia (table 1; for details of breed and sample number see electronic supplementary material, table S1). Horses were grouped by geographical population: British Isles (n = 255), Central Asia (n = 38), China and the Far East (n = 339), Eastern Europe (n = 39), Lowlands and Central Europe (n = 153), Mediterranean (n = 435), Middle East and western Asia (n = 201), the North and Russia (n = 72), Scandinavia (n = 25) and Siberia and Mongolia (n = 76) (for details see electronic supplementary material, tables S2 and S3).
Table 1.

The proportion of clades within populations of domestic horses. (Haplotype definitions are after Jansen et al. [11]. Clade C is partitioned into two, named C1 and C2, for consistency with published literature since there is no phylogenetic basis for their amalgamation into a single clade as previously reported [1].)

proportion of clades per populationnA (%)B (%)C1 (%)C2 (%)D (%)E (%)F (%)G (%)H (%)
Thoroughbred horses2961113614552000
British and Irish Native horses2551781783114400
Arab horses914320501801400
Oriental horses (excluding Arabs)1103611512631710
European horses670327583301040
Central and East Asian horses507376621932321
total horses1929298763141320
The proportion of clades within populations of domestic horses. (Haplotype definitions are after Jansen et al. [11]. Clade C is partitioned into two, named C1 and C2, for consistency with published literature since there is no phylogenetic basis for their amalgamation into a single clade as previously reported [1].) Genetic groups (haplotypes) were defined using median-joining networks drawn according to Lei et al. [13]. These handle large datasets effectively, allow for multi-state data [14] and are commonly used for within-species comparisons where sequence variation is limited [15,16]. Population statistics were calculated and AMOVA [17] performed using Arlequin v. 3.11 [18]. Correspondence analyses (CA) were conducted using Adegenet [19]. Neighbour-joining trees were constructed in MEGA v. 4.1 [20]. Mixed stock analysis was performed using SPAM v. 3.7 [21]. SPAM v. 3.7 implements a conditional maximum-likelihood approach to estimate contributions of donor populations (Arab, Oriental and British Natives) to mixed populations (Thoroughbreds). The nomenclature of Jansen et al. [11] was used to define haplotypes within networks (figure 1a). Jansen et al. define haplotypes C1 and C2 as a single clade, however, there is no phylogenetic basis for this. For consistency with published literature, we retained Clade C nomenclature, but present Clades C1 and C2 separately. Associations among haplotype frequencies within and between populations were investigated using correspondence analysis and Fisher's exact tests [22].
Figure 1.

(a) Median-joining network of 1929 mitochondrial D-loop sequences from domestic horses, and (b) neighbour-joining trees based on mean pairwise differences between breeds (scale bar, 0.002) and (c) geographical regions (scale bar, 0.05), including Thoroughbred (purple circles in (a)), British and Irish Native, Arab, Oriental breeds and published sequences from domestic horses from European, Middle Eastern, Asian and Far Eastern populations. Nodes in the network are proportional to the frequency of haplotypes. Haplotypes are defined following Jansen et al. [11].

(a) Median-joining network of 1929 mitochondrial D-loop sequences from domestic horses, and (b) neighbour-joining trees based on mean pairwise differences between breeds (scale bar, 0.002) and (c) geographical regions (scale bar, 0.05), including Thoroughbred (purple circles in (a)), British and Irish Native, Arab, Oriental breeds and published sequences from domestic horses from European, Middle Eastern, Asian and Far Eastern populations. Nodes in the network are proportional to the frequency of haplotypes. Haplotypes are defined following Jansen et al. [11].

Results

Thoroughbreds showed extensive haplotype sharing with Eurasian domestic horses (figure 1a), with the exclusion of Clades F, G and H and the ancestral Clade A6 ([11]; table 1). AMOVA partitioned 90 per cent of total genetic variation among individuals within Thoroughbreds. Therefore, Thoroughbred mares encompass the majority of genetic variation within Eurasian horse populations. These data are consistent with a history of genetic amalgamation, rather than an origin from a single distinct population. Using CA of allele frequencies (figure 2a) and haplotype frequencies (figure 2b), we compared Thoroughbreds with British and Irish Native and Oriental horse breeds (including Arabs) to determine the origins of Thoroughbred foundation mares. CA confirmed the separation of Thoroughbred horses from Arab horses (figure 2a,b), with χ2 distances between Arabs and the average population being greater than that between Thoroughbreds and the average population. Pairwise genetic distances (table 2) showed that Thoroughbreds had closest affinity to Connemara (FST = 0.004) and Irish Draft horses (FST = 0.016) and were distantly related to Arab horses (FST = 0.177) compared with other breeds. This indicates that Thoroughbreds had a cosmopolitan rather than pure-Arabian origin. Furthermore, CA showed that Thoroughbreds had greater affinity to British and Irish Native breeds than Oriental horses, with the exception of Barbs. Pairwise genetic distances (table 2) showed that Thoroughbreds were distantly related to Exmoor horses (FST = 0.267), indicating that British Native breeds were not used indiscriminately.
Figure 2.

Correspondence analyses by (a,c) allele and (b,d) haplogroup frequencies. Associations within and between breeds (a,b) show that Thoroughbred horses have closer affinity to British Native than to Oriental horses (Arab, Barb, Turkmen, Akhal-Teke and Caspian). Associations within and between geographical groupings (c,d) show that Thoroughbred horses have closer affinity to British Native and European horses than Middle East and western Asian (including Arab and Oriental) horses. D-values denote scale of grid; scree plots indicate relative importance of plotted components.

Table 2.

Pairwise genetic distances (FST) between Thoroughbred, Arab, Oriental and British Native horses. (Negative values result from the inaccuracy of the Arlequin v. 3.11 algorithm's estimates when FST values are near zero, especially when combined with the small sample sizes of the Anatolian, Caspian, Connemara and Shire breeds.)

sample sizeArabBarbConnemaraExmoorFellHighlandIrish DraughtKerry BogShetlandShireAnatolianAkhal-TekeCaspianThoroughbred
Arab910
Barb390.145630
Connemara120.08677−0.003800
Exmoor200.090550.300300.198950
Fell170.051120.142970.047700.054440
Highland310.146760.03789−0.008590.212830.076920
Irish Draught590.109990.01385−0.031440.195690.07238−0.003290
Kerry Bog390.093300.187270.095370.071110.006000.106110.113350
Shetland670.087710.107680.061220.131530.037220.075340.087290.028660
Shire100.213920.027570.030680.361800.210610.036760.025520.248630.169640
Anatolian150.035070.08765−0.024860.12612−0.004060.043490.018300.051650.059680.145030
Akhal-Teke430.013650.141280.059450.069900.004300.098870.079400.030260.043440.195410.003050
Caspian130.074850.07342−0.029680.160900.030590.00446−0.011360.071790.078540.11638−0.011690.033220
Thoroughbred2960.177480.031920.004850.266960.140490.024940.016450.181030.14579−0.018550.071620.147160.056630
Pairwise genetic distances (FST) between Thoroughbred, Arab, Oriental and British Native horses. (Negative values result from the inaccuracy of the Arlequin v. 3.11 algorithm's estimates when FST values are near zero, especially when combined with the small sample sizes of the Anatolian, Caspian, Connemara and Shire breeds.) Correspondence analyses by (a,c) allele and (b,d) haplogroup frequencies. Associations within and between breeds (a,b) show that Thoroughbred horses have closer affinity to British Native than to Oriental horses (Arab, Barb, Turkmen, Akhal-Teke and Caspian). Associations within and between geographical groupings (c,d) show that Thoroughbred horses have closer affinity to British Native and European horses than Middle East and western Asian (including Arab and Oriental) horses. D-values denote scale of grid; scree plots indicate relative importance of plotted components. CA of geographical populations (figure 2c,d) show that Thoroughbreds have greater affinity to British Isles and European horses than Middle East and western Asian populations, i.e. Oriental horses. Multiple iterations of neighbour-joining trees based on mean pairwise differences between breeds or geographical regions (figure 1b,c) consistently placed Thoroughbreds with British and Irish Native horses. Based on our data, Thoroughbred foundation mares were not exclusively Arab or Oriental. Rather, Thoroughbred mares were of cosmopolitan European origin, with contribution from Barbs and with British and Irish Native horses playing a greater part in the founding of the Thoroughbred breed than previously recognized. This is supported by the analysis of haplotype sharing. For example, Clade F is strongly associated with Middle East, west Asian and Far Eastern horses, including Oriental breeds (Fisher's exact test: p < 0.00001), yet no Thoroughbred horse sequence lies within Clade F (figure 1). If horses of an Oriental origin made a major contribution to the Thoroughbred, we would expect to find Clade F among the Thoroughbred sequences, if only at low frequency. To estimate the proportion of contribution of Arab, Oriental and British Native horses to Thoroughbred horses, we performed mixed-stock analysis of allele frequencies, using SPAM v. 3.7 [21]. The estimated contribution of British and Irish Native horses was 61 per cent, whereas that of Arabs was 8 per cent. Oriental horses (without Arabs) contributed 31 per cent.

Discussion

Our data demonstrate that Thoroughbred foundation mares were of cosmopolitan European heritage, with contributions from British and Irish Native and Oriental horses. The contribution from British and Irish Native horses is close to twice that of Oriental horses. This British Native maternal influence, is apparent in the current Thoroughbred population, e.g. 2009 Kentucky Derby winner, Mine That Bird, probably has British Native maternal origins, since his founding matriarch, Piping Peg's Dam, foaled in 1690, is Clade C1 based on the haplotype of her direct female descendents (Clade C1 is strongly associated with British Native breeds: Fisher's exact test p < 0.000001). Additional foundation mares came from European horse populations, although we cannot determine precisely which. Our data show a contribution from Barb mares. However, Barb horses have undergone extensive crossbreeding with European horses, including Iberian breeds [23]. Thoroughbred affinity to Barbs may, therefore, reflect this crossbreeding rather than an original contribution. The majority of Thoroughbreds belong to Clade D (55%), previously reported as being associated with Iberian horses [24]. Yet, Clade D is frequent among European horse populations (31%) and thus, we cannot delineate a contribution to Thoroughbred foundation mares from Iberian breeds as opposed to one from European horse breeds as a whole. By contrast, Oriental mares made a limited contribution to Thoroughbred maternal lineages with a minimal contribution from Arabs. Thoroughbred foundation mares, therefore, most likely represent a cross-section of female bloodstock available at each stud participating in the foundation of the breed. While influential Thoroughbred breeders may still claim Thoroughbreds as purely Oriental (specifically Arab), our results argue strongly against this claim.
  14 in total

1.  Median-joining networks for inferring intraspecific phylogenies.

Authors:  H J Bandelt; P Forster; A Röhl
Journal:  Mol Biol Evol       Date:  1999-01       Impact factor: 16.240

2.  Evidence for biogeographic patterning of mitochondrial DNA sequences in Eastern horse populations.

Authors:  A McGahern; M A M Bower; C J Edwards; P O Brophy; G Sulimova; I Zakharov; M Vizuete-Forster; M Levine; S Li; D E MacHugh; E W Hill
Journal:  Anim Genet       Date:  2006-10       Impact factor: 3.169

3.  adegenet: a R package for the multivariate analysis of genetic markers.

Authors:  Thibaut Jombart
Journal:  Bioinformatics       Date:  2008-04-08       Impact factor: 6.937

4.  Worldwide phylogeography of wild boar reveals multiple centers of pig domestication.

Authors:  Greger Larson; Keith Dobney; Umberto Albarella; Meiying Fang; Elizabeth Matisoo-Smith; Judith Robins; Stewart Lowden; Heather Finlayson; Tina Brand; Eske Willerslev; Peter Rowley-Conwy; Leif Andersson; Alan Cooper
Journal:  Science       Date:  2005-03-11       Impact factor: 47.728

5.  Mitochondrial DNA and the origins of the domestic horse.

Authors:  Thomas Jansen; Peter Forster; Marsha A Levine; Hardy Oelke; Matthew Hurles; Colin Renfrew; Jurgen Weber; Klaus Olek
Journal:  Proc Natl Acad Sci U S A       Date:  2002-07-18       Impact factor: 11.205

6.  History and integrity of thoroughbred dam lines revealed in equine mtDNA variation.

Authors:  E W Hill; D G Bradley; M Al-Barody; O Ertugrul; R K Splan; I Zakharov; E P Cunningham
Journal:  Anim Genet       Date:  2002-08       Impact factor: 3.169

7.  Iberian origins of New World horse breeds.

Authors:  Cristina Luís; Cristiane Bastos-Silveira; E Gus Cothran; Maria do Mar Oom
Journal:  J Hered       Date:  2006-02-17       Impact factor: 2.645

8.  Mitochondrial DNA: an important female contribution to thoroughbred racehorse performance.

Authors:  Stephen Paul Harrison; Juan Luis Turrion-Gomez
Journal:  Mitochondrion       Date:  2006-03-03       Impact factor: 4.160

9.  Multiple maternal origins of native modern and ancient horse populations in China.

Authors:  C Z Lei; R Su; M A Bower; C J Edwards; X B Wang; S Weining; L Liu; W M Xie; F Li; R Y Liu; Y S Zhang; C M Zhang; H Chen
Journal:  Anim Genet       Date:  2009-09-10       Impact factor: 3.169

10.  Arlequin (version 3.0): an integrated software package for population genetics data analysis.

Authors:  Laurent Excoffier; Guillaume Laval; Stefan Schneider
Journal:  Evol Bioinform Online       Date:  2007-02-23       Impact factor: 1.625

View more
  18 in total

1.  Mitochondrial genomes from modern horses reveal the major haplogroups that underwent domestication.

Authors:  Alessandro Achilli; Anna Olivieri; Pedro Soares; Hovirag Lancioni; Baharak Hooshiar Kashani; Ugo A Perego; Solomon G Nergadze; Valeria Carossa; Marco Santagostino; Stefano Capomaccio; Michela Felicetti; Walid Al-Achkar; M Cecilia T Penedo; Andrea Verini-Supplizi; Massoud Houshmand; Scott R Woodward; Ornella Semino; Maurizio Silvestrelli; Elena Giulotto; Luísa Pereira; Hans-Jürgen Bandelt; Antonio Torroni
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-30       Impact factor: 11.205

2.  The genetic origin and history of speed in the Thoroughbred racehorse.

Authors:  Mim A Bower; Beatrice A McGivney; Michael G Campana; Jingjing Gu; Lisa S Andersson; Elizabeth Barrett; Catherine R Davis; Sofia Mikko; Frauke Stock; Valery Voronkova; Daniel G Bradley; Alan G Fahey; Gabriella Lindgren; David E MacHugh; Galina Sulimova; Emmeline W Hill
Journal:  Nat Commun       Date:  2012-01-24       Impact factor: 14.919

3.  Haplotype diversity in the equine myostatin gene with focus on variants associated with race distance propensity and muscle fiber type proportions.

Authors:  Jessica L Petersen; Stephanie J Valberg; James R Mickelson; Molly E McCue
Journal:  Anim Genet       Date:  2014-08-26       Impact factor: 3.169

4.  Molecular phylogeny of Indian horse breeds with special reference to Manipuri pony based on mitochondrial D-loop.

Authors:  Kshetrimayum Miranda Devi; Sankar Kumar Ghosh
Journal:  Mol Biol Rep       Date:  2013-09-26       Impact factor: 2.316

5.  Effect of Myostatin SNP on muscle fiber properties in male Thoroughbred horses during training period.

Authors:  Hirofumi Miyata; Rika Itoh; Fumio Sato; Naoya Takebe; Tetsuro Hada; Teruaki Tozaki
Journal:  J Physiol Sci       Date:  2017-10-20       Impact factor: 2.781

6.  Sequence variants of BIEC2-808543 near LCORL are associated with body composition in Thoroughbreds under training.

Authors:  Teruaki Tozaki; Fumio Sato; Mutsuki Ishimaru; Mio Kikuchi; Hironaga Kakoi; Kei-Ichi Hirota; Shun-Ichi Nagata
Journal:  J Equine Sci       Date:  2016-09-30

7.  Profiling of exercise-induced transcripts in the peripheral blood cells of Thoroughbred horses.

Authors:  Teruaki Tozaki; Mio Kikuchi; Hironaga Kakoi; Kei-Ichi Hirota; Kazutaka Mukai; Hiroko Aida; Seiji Nakamura; Shun-Ichi Nagata
Journal:  J Equine Sci       Date:  2016-12-15

8.  Genetic diversity in the modern horse illustrated from genome-wide SNP data.

Authors:  Jessica L Petersen; James R Mickelson; E Gus Cothran; Lisa S Andersson; Jeanette Axelsson; Ernie Bailey; Danika Bannasch; Matthew M Binns; Alexandre S Borges; Pieter Brama; Artur da Câmara Machado; Ottmar Distl; Michela Felicetti; Laura Fox-Clipsham; Kathryn T Graves; Gérard Guérin; Bianca Haase; Telhisa Hasegawa; Karin Hemmann; Emmeline W Hill; Tosso Leeb; Gabriella Lindgren; Hannes Lohi; Maria Susana Lopes; Beatrice A McGivney; Sofia Mikko; Nicholas Orr; M Cecilia T Penedo; Richard J Piercy; Marja Raekallio; Stefan Rieder; Knut H Røed; Maurizio Silvestrelli; June Swinburne; Teruaki Tozaki; Mark Vaudin; Claire M Wade; Molly E McCue
Journal:  PLoS One       Date:  2013-01-30       Impact factor: 3.240

9.  Identification of genetic variation on the horse y chromosome and the tracing of male founder lineages in modern breeds.

Authors:  Barbara Wallner; Claus Vogl; Priyank Shukla; Joerg P Burgstaller; Thomas Druml; Gottfried Brem
Journal:  PLoS One       Date:  2013-04-03       Impact factor: 3.240

10.  Genomic analysis establishes correlation between growth and laryngeal neuropathy in Thoroughbreds.

Authors:  Adam R Boyko; Samantha A Brooks; Ashley Behan-Braman; Marta Castelhano; Elizabeth Corey; Kyle C Oliveira; June E Swinburne; Rory J Todhunter; Zhiwu Zhang; Dorothy M Ainsworth; Norman Edward Robinson
Journal:  BMC Genomics       Date:  2014-04-03       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.