Y-Chromosome Analysis in Retuertas Horses
Claudia Brandariz-Fontes1,2, Jennifer A. Leonard3*, José Luis Vega-Pla4, Niclas Backström5,
Gabriella Lindgren6, Sebastian Lippold7, Ciro Rico1
1 Estación Biológica de Doñana (EBD-CSIC), Seville, Spain, 2 Facultad de Medicina Veterinaria, Universidad de Panamá, Panama City, Panama, 3 Conservation and
Evolutionary Genetics Group, Estación Biológica de Doñana (EBD-CSIC), Seville, Spain, 4 Laboratorio de Investigación Aplicada, Crı́a Caballar de las Fuerzas Armadas,
Córdoba, Spain, 5 Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden, 6 Department of Animal Breeding and Genetics, Swedish University of
Agricultural Sciences, Uppsala, Sweden, 7 Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Abstract
Several studies based on a variety of genetic markers have attempted to establish the origins of horse domestication. Thus
far a discrepancy between the results of mitochondrial DNA analysis, which show high levels of diversity, and results from
the Y-chromosome, with almost no genetic variability, has been identified. Most previous work on the horse Y-chromosome
has focused on widespread, popular breeds or local Asian breeds. It is possible that these breeds represent a reduced set of
the genetic variation present in the species. Additional genetic variation may be present in local breeds and ancient feral
populations, such as the Retuertas horse in Spain. In this study we analyzed the Y-chromosome of the Retuertas horse, a
feral horse population on the Iberian Peninsula that is at least several hundred years old, and whose genetic diversity and
morphology suggests that it has been reproductively isolated for a long time. Data from the Retuertas horse was compared
to another 11 breeds from the region (Portugal, Spain and France) or likely of Iberian origin, and then to data from 15 more
breeds from around the globe. We sequenced 31 introns, Zinc finger Y-chromosomal protein (ZFY) and anonymous Y-linked
fragments and genotyped 6 microsatellite loci found on the Y-chromosome. We found no sequence variation among all
individuals and all breeds studied. However, fifteen differences were discovered between our data set and reference
sequences in GenBank. We show that these likely represent errors within the deposited sequences, and suggest that they
should not be used as comparative data for future projects.
Citation: Brandariz-Fontes C, Leonard JA, Vega-Pla JL, Backström N, Lindgren G, et al. (2013) Y-Chromosome Analysis in Retuertas Horses. PLoS ONE 8(5): e64985.
doi:10.1371/journal.pone.0064985
Editor: Michael Hofreiter, University of York, United Kingdom
Received October 19, 2012; Accepted April 19, 2013; Published May 31, 2013
Copyright: ß 2013 Brandariz-Fontes et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits
unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This project had no direct funding, but was supported with technical and logistical assistance by Fernando Hiraldo, Juan Calderón and Juan José Negro
from EBD; the technical assistance from the Doñana Natural Park staff for sample collection; Laboratorio Ecologı́a Molecular, Estación Biológica de Doñana-CSIC;
Laboratorio de Investigación Aplicada, Crı́a Caballar de las Fuerzas Armadas for sample collection and Hans Niclas Backstrom for his assistance optimizing
laboratory procedures. CB-F was supported by the University of Panama and Fundación Carolina. The funders had no role in study design, data collection and
analysis, decision to publish, or preparation of the manuscript.
Competing Interests: The authors have declared that no competing interests exist.
* E-mail:
[email protected]
different alleles of polymorphisms that were present in the
progenitor population should be preserved or fixed in different
isolated populations. The only observations of genetic variation
reported in the modern domestic horse Y-chromosome is in a
study of microsatellite allelic diversity of several local breeds from
China [15] which suggests that Y-chromosome polymorphisms
may be more likely to be found in local breeds or in ancient feral
populations that have not contributed to the formation of the
major popular breeds around the world. In support of that view,
Iberian horse breeds show high levels of overall genetic variation
as compared to other European horse populations [16]. For these
reasons, we hypothesize that the Retuertas horse, a feral
population living in the Doñana National Park in southern Spain,
may harbor Y-chromosome polymorphisms not previously
observed in other horse breeds. This feral population has been
reproductively isolated from other breeds for substantial time and
does not cluster with either of the two major clades of European
and North African breeds based on microsatellites and allozyme
data [17]. Here we scanned for Y-linked genetic polymorphisms in
Retuertas horse using 31 previously reported Y-chromosome
specific loci and we also genotyped six microsatellites located on
the horse Y-chromosome.
Introduction
In stark contrast to the high levels of maternally inherited
mitochondrial DNA variation observed in domestic horses,
previous studies of the paternally inherited Y-chromosome have
reported a deficiency of genetic variation among a wide range of
domestic horses including both warm- and coldblooded breeds [1–
8]. This observed lack of diversity is unexpected given that most
mammals, including other domestic species such as cattle, sheep
and dog [9–13] have at least some variation on the Y-chromosome
and research using homologous Y-chromosome markers has found
them to be variable in other equids [5,6]. Further, a recent study
revealed extensive genetic variability in Y-chromosome sequences
from pre- and early domesticated horses, demonstrating that Ylinked genetic diversity was present in the wild ancestor and in past
populations of domesticated horses [14].
Even if variation was present in early domesticated horses, it
could have been lost quickly through drift within breeds due to low
effective population size as a consequence of breed formation,
especially in males where inter-individual variance in reproductive
output is immense. However, drift should have acted independently within each isolated breed and it is therefore expected that
PLOS ONE | www.plosone.org
1
May 2013 | Volume 8 | Issue 5 | e64985
Y-Chromosome Analysis in Retuertas Horses
Materials and Methods
Table 1. Breeds analyzed in this study.
Study System
The Retuertas horse is a feral population that lives in the
Guadalquivir marshes in southern Spain, including Doñana
National Park, and has been shown to be genetically differentiated
from 10 other Iberian, North African and European breeds using
22 microsatellite and 5 allozyme loci [17]. Their morphological
and physiological features are distinct from Spanish Purebred,
Arabian and other Iberian horses of Celtic origin (e.g. the average
height is only 1.42 m and its adaptable to a hostile marsh
environment). Furthermore, a private allele of a major gene
involved in lipid metabolism (Carboxilesterase) found in a
systematic protein electrophoresis analysis, suggested a long-term
genetic isolation of the Retuertas horse population [17]. This
suggests that this population has remained largely reproductively
isolated from other domestic breeds and likely represents a
population of horses that has survived in the Guadalquivir marshes
in southern Spain for several hundred years [18]. Extensive field
observations and information on management of the herd,
especially males, was used to construct a likely pedigree of all
individuals in the park.
Y-chrom loci*Microsatellites
Geographic region
Retuertas
6
6
Doñana National Park,
Spain
Spanish
Purebred
5
4
Córdoba-Andalusia, Spain
Asturcon
4
4
Breton
3
Losino
4
3
Burgos, Spain
Mallorquin
4
3
Mallorca
Menorquin
4
3
Menorca
Pottok
4
3
Uruguayan
Creole
2
Lusitano
3
Marismeño
5
Lipizzan
2
Asturias, Spain
France
Basque Country, Spain
Uruguay
Portugal
4
Huelva, Spain
Slovenia
Breed name, sample size per analysis, and geographic region of the stallions
from the 12 horse breeds from which new data has been generated here.
Breeds from which no data was previously available are indicated in italics.
*Y-chrom loci are the six fragments with putative SNPs: Amely6, SRY, EcaY2B17,Eca-Y3B1, Eca-Y3B12, Eca-Y3B19.
doi:10.1371/journal.pone.0064985.t001
Ethics Statement
Animal Care and Use Committee approval was not obtained for
this study because no animals were handled specifically for this
experiment. Blood samples were collected by qualified veterinarians through their routine practice, in the framework of official
programs aimed at the identification, health control and parentage
confirmation of the breeds and populations included in our study.
Therefore, the legal restrictions defined in ‘‘Spanish Law 32/2007
of November 7, on the care of animals in their husbandry,
transportation, testing and sacrifice’’ do not apply, as they are
waved in the case of non-experimental procedures and routine
veterinary practices with livestock species, in Article 3d of the
above-mentioned Law.
[9] and SRY [20], and the four anonymous Y-linked fragments
Eca-Y2B17, Eca-Y3B1, Eca-Y3B12 and Eca-Y3B19 [5].
In the second stage, we amplified and Sanger sequenced these
six fragments covering 2.5 kb from an additional 40 stallions
representing 11 additional breeds in order to determine if the
apparently new Y-chromosome variant was also present in other
regional or related breeds. Ten of these breeds have no published
Y chromosome sequences (Table 1). We included 5 stallions from
two previously sequenced breeds (Spanish Purebred and Lippizan)
to verify our findings with those reported in GenBank. Two female
samples were again included as negative PCR controls.
The PCR amplification of Y-chromosome fragments was
performed in 15 ml reactions containing 25–50 ng of template
DNA, 2.5 U BioTaq DNA Polymerase (Bioline), 0.2 mM each of
the four deoxynucleotide triphosphates (dNTPs), 30 ng BSA
(Sigma-Aldrich), 50 mM KCl, 2 mM MgCl2, 10 mM TrisHCl
(pH 8.3), and 15–25 pmoles of each primer (Table 2). Amplification reactions were performed in a MJ Research thermocycler,
Model PTC-100. The PCR thermal cycling conditions consisted
of an initial denaturation at 95uC for 3 minutes (min), followed by
35 cycles of 95uC denaturation for 30 seconds (sec), annealing for
30 sec at 56–60uC (annealing temperatures for each primer
detailed in Table 2) and extension at 72uC for 60 sec, followed by
a final extension at 72uC for 3 min. Alternatively, a touchdown
PCR profile was used, where the annealing temperature ranged
from 55–45uC with a decrease of 0.5uC/cycle for 20 cycles,
followed by a constant annealing temperature of 45uC for 10
cycles. All reactions, including the female controls and reaction
negative controls, were checked on 1.5% agarose gels stained with
CyberSafe dye (Invitrogen).
All primers were confirmed to amplify Y-chromosome-specific
products by using female horse DNA samples as controls.
Successful amplifications were purified using the enzymes exonuclease I and shrimp alkaline phosphatase (USB Corporation,) and
Sanger sequenced with BigDye chemistry (Applied Biosystems) in
both directions using the primers used for amplification. Nucle-
Samples and DNA Extraction
Total genomic DNA was extracted from frozen blood samples
collected for reasons other than this project using QIAmp DNA
Blood Mini Kit (Qiagen), according to the manufacturer’s
instructions.
We investigated 46 male horses from 12 different breeds
currently present in the region (Portugal, Spain and France) or
thought to have Iberian ancestry: Retuertas, Spanish Purebred
(also called Andalusian Horse), Asturcon, Breton, Losino, Mallorquin, Menorquin, Pottok, Uruguayan Creole, Lusitano, Marismeño and Lipizzan (Table 1). DNA from two Retuertas females
was used as a control. Given the pedigree, the six Retuertas males
were selected to represent all paternal lineages present in the
population. In the second step of the analyses the dataset was
expanded with data from previously published reports [7,14].
PCR Amplification and Sequencing of Y-specific
Fragments
Initially we sequenced 31 Y-chromosome introns, ZFY and
anonymous Y-linked fragments reported in [7] from 6 Retuertas
stallions (Table 2). These 31 loci cover more than 12.3 kb in total
and were amplified through polymerase chain reaction (PCR) and
then directly sequenced. DNA from two females was included as a
negative control in the PCRs to verify Y-chromosome specific
amplifications. Six of the 31 fragments yielded an apparently
different sequence in the Retuertas horse from the reference
sequences in GenBank: the two Y-chromosome introns AMELY 6
PLOS ONE | www.plosone.org
Breed
2
May 2013 | Volume 8 | Issue 5 | e64985
Y-Chromosome Analysis in Retuertas Horses
Table 2. Description of horse Y-chromosome nuclear markers.
Locus
L
Tm
Forward Primer (59–39)
Reverse Primer (59–39)
Acc. No.
AMELY1
394
TD
ACATGTTTTTCATTCAGAAATAT
GTTATTGAGGTACTTAAAGTGT
AB091794
AMELY2
358
TD
TTTACTACTTTGAAAAACACTTT
ATTGGATTTTAGGGGTTCTT
AB091794
AMELY3
481
TD
CCCTAAAATCCAATAGGGTT
CATGTATGTAATTAGTCCTT
AB091794
AMELY4
488
TD
CTATTTCACAAGCTTGAATGC
TCACCACATACAAGTCATAAG
AB091794
AMELY6
470
TD
CTTCACGTTCAAATGTGTGAC
TCATTACAGATCACAACATGG
AB091794
AMELY7
215
TD
CATGTTGTGATCTGTAATGAA
AATTAAATGACTTTCTCAGGG
AB091794
AMELY8
478
58
CAAGATGTTTTTCCATTCCTC
TTTTGAAGTGTGGGCATTAAT
AB091794
AMELY9
203
56
ATCTGTAGAAGGGAATTAATG
GAAACTGTGAAAGAGGAATAG
AB091794
AMELY11
245
58
CTCTGAAGTGGTACCAGAACA
ATTGGCTCCATTGACTCTCTG
AB091794
SMCY3
848
56
ATTTACCCTTATGAAATRTTT
TCAAATGGGTGWGTGTACAT
AY532887
SMCY7
341
60
TGGAGGTGCCCRAARTGT
AACTCTGCAAASTRTACTCCT
AY532888
SRY
452
TD
CGGACTTTCTCACGGTGATT
CAAGACTGGTTTCTCACAGC
AB004572
ZFYG
539
60
CCGAAATTGCTGATGAGGTT
TATGTGCAAGAGGGCACTTG
AY532846
ZFYH
579
56
TCTGAACCGGCGAAATGT
TCATCCTACCCAAAGCCAAC
AY532847
ZFY27A
323
58
CTAACTAAAGTTTTCAGTTTTG
AAGATGGAGATATTGCTCTA
AY532848
ZFY43A
435
60
GAAATAAACCCACACATACTCT
TATATGCGTGATGCTTGC
AY532849
ZFY44A
391
56
TGGTTTTGGTGTATTACATC
ATGAAAGGATAAACAAAATG
AY532850
ZFY46A
341
60
TGCACATTTCCTTTAATCT
GCACATTAAAGAGAAACCTT
AY532851
ZFY50A
252
60
TTAAAAAAGACCTTCTCCTA
CCTTTAGCTTTTGTTTTCT
AY532852
ZFY50B
314
60
AAAAGTTATTGTCAGCTTCAGC
TTCTGCCCTTTTCCTCTTC
AY532853
ZFY51A
353
60
CCAGGGAGACAGTGAAAGTAGG
GGTAGGGCACCTTGACTACACT
AY532854
ZFY52A
381
56
ATCCTTTCTTTTATTCCTTT
CATGCAAACTTAACCACTT
AY532855
ZFY53A
414
56
TTTCCTTTCAGTTACCTTTCAT
CTACCTGTTGATGGGATTGA
AY532856
ZFY53B
358
60
AAGGTAAAGTGTCTGTTCCA
AGACTCTCTCAGGAAAACTTAT
AY532857
ZFY55A
342
60
CTGTTTAGCAAATAATTGTT
TAGGCTATATCATGCAGAT
AY532858
ZFY55B
426
58
ATTAGGTGATTGCCTGAT
CAAAACTAGGGAGACAGTAA
AY532859
Eca-Y2B17
438
TD
TTCAGTCCTGCTTTCTCCTCA
CAGGATGTGCCATGTGATTG
G72335
Eca-Y3B1
468
TD
TGGGTTAATGGGATTTGGTG
CAAGCACAGCTCTGTATCAA
G72336
Eca-Y3B8
445
TD
CCCAAGTTCCTTGCCATC
AAATTGAAGAGGCCCCAAAG
G72337
Eca-Y3B12
392
TD
GGGAGGCACTGGAAAGTACA
GGTGGAGGAATCAGCTGGAG
G72338
Eca-Y3B19
215
TD
AAGCCTTTCATGGAAATTGG
TTACGCAGACATCCTGGACA
G72339
These loci all come from Lindgren et al. [7]. The first 12 loci are Y chromosome introns anf have the prefixes AMELY, SMC and SRY. The next 14 loci are based on
subclones from a ZFY-positive BAC clone and have the prefix ZFY. The last five loci are anonymous Y-lined fragments and they have the prefix Eca-Y. Primers for each
locus, along with annealing temperature (Tm), length of amplicon (L) and GenBank accession numbers (Acc. No.) are organized by locus type. The introns, ZFY and
anonymous Y-linked fragments were sequenced with amplification primers.
doi:10.1371/journal.pone.0064985.t002
were available were also obtained (Table S1). In addition to the
reference sequences, two other domestic horse SRY sequences not
associated with any publication are in GenBank (accession
numbers AC215855 and HM103387). Sequences from an ancient
domestic horse were available for all six loci [14]. Additionally, the
original sequences from Lindgren et al. [7] and alignment files
from Lippold et al. [14] including sequences from Wallner et al.
[22] that had not been deposited into GenBank were obtained and
all sequences for each locus were aligned and compared in BioEdit
(Ibis Biosciences, http://www.mbio.ncsu.edu/BioEdit/bioedit)
[23].
otide sequences were determined on an ABI PRISM 313061
automated sequencer (Applied Biosystems).
To verify the results approximately 50% of the samples were
replicated in a second PCR, with the same conditions. These
replications were also sequenced in both directions using the same
primers used for amplification by the commercial service offered
by Macrogen (Macrogen, Korea).
Sequence Analysis
Sequences were edited, assembled and aligned using the
program Sequencher (Gene Codes Corporation). Sequences
generated here were compared to all sequences in GenBank with
the Basic Local Alignment Search Tool (BLAST) (http://blast.
ncbi.nlm.nih.gov) [21]. Reference sequences from the six loci with
putative SNPs were downloaded from GenBank (Table 3).
Homologous sequences from all equid species for which they
PLOS ONE | www.plosone.org
PCR Amplification and Genotyping of Y-chromosome
Specific Microsatellite Markers
We also screened for length variation in six Y-chromosomespecific microsatellite loci [6] in 30 male horses from eight
3
May 2013 | Volume 8 | Issue 5 | e64985
Y-Chromosome Analysis in Retuertas Horses
performed using GeneMapper V3.7 software (Applied Biosystems).
Table 3. Putative polymorphic positions identified in the Ychromosome.
Results
Locus
Acc. No.
Position Polymorphism GenBank This study
AMELY 6
AB091794
2757
Substitution
G
A
2854
Substitution
C
T
2881
Indel
A
–
SRY
Eca-Y2B17
Eca-Y3B1
Eca-Y3B12
Eca-Y3B19
AB004572
G72335
G72336
G72338
G72339
825
Substitution
T
G
826
Substitution
T
G
1033
Indel
–
G
173
Substitution
T
C
460
Substitution
A
G
201
Substitution
G
A
265
Substitution
C
A
421
Substitution
G
A
49
Substitution
G
A
281
Substitution
C
T
115
Substitution
T
A
190
Substitution
G
A
Y-chromosome Fragments
PCR was successful for all six male Retuertas horses at all 31 Ychromosome non-microsatellite loci and no amplicons were found
in female controls. These sequences were completely monomorphic in the six male individuals sequenced. Twenty-five Ychromosome fragments, totaling 9.9 kb, were identical to previously published sequences [5,19,20], available in GenBank
(Table 2). A total 15 single nucleotide differences were identified
when we compared our other six sequences to the sequences from
GenBank (Table 3, Table S1). These were initially considered
putative single nucleotide polymorphisms (SNPs) within the
Retuertas breed. To confirm this unexpectedly high level of
divergence, we sequenced these six putatively polymorphic
fragments, in total 2.5 kb, in 40 additional male horses originating
from 11 breeds: Spanish Purebred, Asturcon, Breton, Losino,
Mallorquin, Menorquin, Pottok, Uruguayan Creole, Lusitano,
Marismeño and Lipizzan. This included two breeds that had
already been sequenced at these loci in another study, the Spanish
Purebred and Lippizan [5]. All PCRs yielded a single amplicon
from the male samples and none from the female controls verifying
Y-linkage. All of the additional 40 male horses sequenced at these
six fragments were identical, both to each other and to the six male
Retuertas horses, and thus yielded the same 15 differences in
comparison to the GenBank sequences (Table S1). These
sequences have previously been reported in the literature from
many individuals from a wide variety of breeds (Table 6) and are
now deposited in GenBank (JX888707–JX888713).
Differences observed when comparing the sequences available in GenBank to
the sequences we generated here and the ones generated by Lindgren et al. [7]
(in total 150 stallions from 25 different horse breeds). The first column lists the
name of the locus, followed by the GenBank accession number (Acc. No.) and
the position in the GenBank reference serquence of the mismatched base pairs.
Polymorphism refers to the nature of the difference- both single nucleotide
sequence differences (substitution) and length differences (indel) were
identified, and that is followed by the state of that base pair in the reference
sequence (GenBank) and that identified here (this study).
doi:10.1371/journal.pone.0064985.t003
Y-chromosome Microsatellite Markers
breeds (Tables 1,4,5), including the Retuertas horse. The loci
screened are: Eca.YA16, Eca.YH12, Eca.YM2, Eca.YP9, Eca.YE1,
and Eca.YJ10 [6]. DNA of two female horses was used as a
control in all PCR reactions. Typing of microsatellite loci was
carried out with tailed primers that were used to add a
phosphorescent dye in a second stage of amplification as
previously described [24,25].
PCR amplifications were performed in 15 ml reactions each
containing 20–30 ng DNA, 5 pmol of specific forward primer with
its M13 tail, 20 pmol of the FAM labeled universal M13 primer,
20 pmol of specific reverse primer, 16 magnesium-free PCR
buffer with KCl, 2.5 mM MgCl2, 1.5 mg BSA, 2 mM dNTPs, and
1.25 U BioTaq DNA Polymerase (Bioline). Amplifications were
carried out using a MJ Research thermocycler, Model PTC-100.
Cycling conditions were done in two stages without opening the
tubes in between. As the M13 primer requires a 53uC annealing
temperature [25], we added eight cycles at the end of the PCR
cycles to incorporate the M13 fluorescently labeled primer with
the previously formed amplicons. Amplification reactions were:
94uC for 1 min, then 25 cycles of 94uC for 30 sec, 54–60uC
(Table 4) for 30 sec and 72uC for 30 sec. This was followed by 8
cycles to incorporate the dye-labeled M13 consisting of: 94uC for
30 sec, 53uC for 30 sec and 72uC for 30 sec, and a final extension
at 72uC for 10 min. To confirm amplification, 5 mL of each
product was electrophoresed on a 2% agarose gel at 100v for
45 min and visualized with CyberSafe dye (Invitrogen). Size of
each PCR product was determined using an ABI 3130 DNA
Sequencer (Applied Biosystems) with GeneScan LIZ 500 internal
size standards (Applied Biosystems). Fragment size analysis was
PLOS ONE | www.plosone.org
All the samples were visually inspected on an agarose gel before
being run on the sequencer to verify the correct amplification of
the six equine Y-chromosome specific microsatellites. Some loci
were sensitive to PCR conditions. Locus Eca.YH12 yielded
multiple amplicons of unexpected size and locus Eca.YJ10
amplified in the female samples at some annealing temperatures
(56–57uC). For this reason locus Eca.YJ10 was discarded from
further analyses. The five microsatellites which passed the
verification process yielded no variation across all 32 male samples
from 8 breeds. This set of markers includes Eca.YA16, a locus at
which variation was reported in a recent study of microsatellite
allelic diversity in Chinese domestic horse breeds [15].
Discussion
Sequence Variation
Consistent with results from previous studies, complete monomorphism of Y-chromosome sequences generated for this study
was identified. All individuals sequenced here matched 100% to
the sequences generated in another lab for a separate study [7].
The only report of Y-linked sequence variation within a domestic
horse is from an ancient individual [14]. However, both standing
genetic variation and fixed differences have been reported within
these loci when other equid species were analyzed [5,7,14], despite
sample sizes of other species being generally very low. Since the
regions were already available in GenBank, some previous studies
did not deposit additional domestic horse sequences, perhaps
because no polymorphisms were found and by assuming that the
already deposited sequences were identical to the additional
sequences generated.
4
May 2013 | Volume 8 | Issue 5 | e64985
Y-Chromosome Analysis in Retuertas Horses
Table 4. Description of horse Y-chromosome specific microsatellite markers.
Locus
AS
Tm Dye
Forward Primer (59–39)
Reverse Primer (59–39)
Acc. No.
Eca.YH12
102
58
FAM
CGAACAGGTGACGAAGCATC
GCAGACATGCACACCAACC
BV005747
Eca.YM2
117
54
PET
TGGTTCAGATGGTGTATTTTGTT
TTTGCAGCCAGTACCTACCTT
BV005725
Eca.YP9
213
54
PET
AAGCACTGCCTTTTGGAATC
AACCCTGGACTTTCTTTTGAA
BV005726
Eca.YE1
196
60
NED
CTTCACTCCCGACCAAGAGA
GTGTGTCGTGCCGTGTTTAC
BV005726
Eca.YJ10
213
60
FAM
AGTTCCCCTGCACACCT
TGCCTCCCACAGCCATAC
BV005728
Eca.YA16
156
60
VZC
TGACTGGAAATTGAAGATG
TTGTAGCAACAAAGTAACAC
BV005729
These loci come from Wallner et al. [6]. Primers for each locus, along with allele size (AS), annealing temperature (Tm), dye label (Dye) and GenBank accession numbers
(Acc. No.).
doi:10.1371/journal.pone.0064985.t004
However, the sequences identified in all of our stallions in this
study and in [7] were identical to one another but differed at 15
unique positions from the previously deposited sequences in
GenBank. The disagreements included 13 single nucleotide
differences and two indels. This level of divergence is highly
unexpected, and is on the order of that observed between
divergent equid species [5,7,14].
A couple of observations lead us to suspect that the sequence
variation observed between studies springs from technical issues
rather than true allele calls, i) sampling of the same breeds between
studies but surprisingly different allele calls, and ii) differences
between studies in how sequence data were obtained (see below).
Two of the breeds sequenced for this study, the Spanish Purebred
and the Lipizzan, were also included in previous studies [5]. Both
previous larger studies using these markers included the Thoroughbred [5,7] and the Thoroughbred sequenced by [7] had the
same sequence observed in the Spanish Purebred, Lipizzan and all
other breeds in [5]. It seems, therefore, highly unlikely that all
individuals from these three breeds processed in one lab would
actually have a different sequence from all stallions of the same
breeds processed in a different lab. This is especially unexpected
for the Thoroughbred, which has a very well recorded breeding
history involving a limited number of sires.
In the course of analyzing our data we were able to compare
our data to an alignment of the sequences generated by Wallner
[22] which included four of the six loci containing putative
differences, Eca-Y2B17, Eca-Y3B1, Eca-Y3B12 and Eca-Y3B19.
These four fragments include nine of the putative mutations.
These alignments contained both vector cloned sequences and
sequences directly obtained from amplification of horse DNA
without a cloning step. Interestingly, only the cloned sequences
match the sequences in GenBank. Because sequences from clones
have a considerably higher error rate due to mis-incorporation of
nucleotides by the Taq polymerase used (10–100 times higher) [26]
than sequences generated directly from a PCR from genomic
DNA, it is common practice to accept sequences from clones only
after they have been encountered from multiple clones. The
sequences in the alignment labeled horse (as opposed to clone) did
not match the clone sequences, and did match the sequence we
obtained at these four fragments. Therefore, it seems likely that the
nine unverified sequence variants were based on single clones
rather than the sequences generated by direct sequencing of
genomic DNA, and it was the clone sequences which were
submitted to GenBank (Table 3).
The reference sequences for locus AMELY6 comes from a
comparatives study in which this fragment of the Y chromosome
was sequenced for several mammalian species [19]. There were
three differences between our sequence and the reference in this
470 base pair fragment: two base pair changes and one indel
(Table 3, Table S1). This study generated sequences for this
fragment from several species, some from direct sequencing of
PCR products, and some through the sequencing of clones from
the PCR products. It looks like all templates were sequenced twice
Table 5. Y chromosome microsatellite loci.
Locus
Repeat motif
Eca.YH12
(GT)131
Ni
Nb
1057 77
Ec
Ep
1,2
Ea
1
96
100
1001
1021,3
Acc. No.
3
BV005747
1101
BV005725
106
1021,3,4
Eca.YM2
(CA)121
1057 77
1161,4 1161
3
117
117
3
213
3
1963
214
1
1971
1192
Eca.YP9
(CA)10 TAT(CA)61
1057 77
3
213
1
(CA)10 TA(CA)65
214
111
3
1121
BV005726
2152
2184
Eca.YE1
(CA)101
1057 77
Eca.YJ106
(CA)3 CG (CA)61
1961,3 1961,3 1911
1992,4
(CA)10CT(CA)65
1027 69
2124
BV005726
1923
2131,3 2133
BV005728
1583
BV005729
1,3
213
Eca.YA16
(TG)3 TA (TG)183 1057 77
(TG)3 TA
(TG)163
(GT)3 TAT (GT)191
(TG)3 TA
1523
2,3
1563
1
156
159
1571
1611
(TG)195
1
Reference [6].
This study.
3
Reference [15].
4
Reference [8].
5
From GenBank reference (Acc. No) sequence listed in last column.
6
Found to amplify in female horses under some conditions.
Nomenclature of loci follows Wallner et al. [6]. The motif was sequenced only in
horse, and then genotyped in horse (Ec), Przwalski’s horse (Ep) and donkey (Ea)
samples independently in a subset of the studies as indicated. The number of
individuals typed (Ni) and number of breeds types (Nb) is a compilation of data
only for horse only from [6,8,15] and this study. Allele sizes reported in the
different studies for the different species are listed. Importantly, the same allele
may be scored differently by different individuals, when run on different
machines, or between different runs on the same machine. Hence, evidence for
polymorphism can only be deduced in cases when multiple alleles have been
reported in the same study.
doi:10.1371/journal.pone.0064985.t005
2
PLOS ONE | www.plosone.org
5
May 2013 | Volume 8 | Issue 5 | e64985
Y-Chromosome Analysis in Retuertas Horses
Table 6. Breeds that have been studied for the six putatively
polymorphic Y chromosome loci.
Breed
Amely6SRY
Y2B17 Y3B1
41
41
41
Akhal Teké
12+21 12+21
12+21 12+21 12+21 12+21
3
Andalusian
5+12
5+12
6
Appaloosa
1
41
5+12
41
5+12
2
5+12
1
2
1
12+41 12+41
12+41 12+41 12+41 12+41
5
Asturcon
4
4
4
4
4
4
4
12
12
12
12
12
1
12
Barb
Breton
12
1
3
3
3
3
3
3
3
Caspian Pony
31
31
31
31
31
31
3
Connemara
41
41
12+41 41
41
12+41
5
1
1
Exmoor
4
Fjord
41
Gotland
41
2
4
4
41
2
1 +4
4
41
41
1
1
2
1 +4
4
41
41
1
1
2
1 +4
41
1
2
1 +4
1
4
41
4
41
4
12+41
5
4
41
41
1
1
1
Icelandic Horse
1 +4
Khuzestan Arab
31
31
31
31
31
31
3
Kladruber
12
12
12
12
12
12
1
Lipizzaner
2+82
2+82
2+82
2+82
2+82
2+82
10
Losino
4
4
4
4
4
4
4
Lusitano
3
3
3
3
3
3
3
Mallorquin
4
4
4
4
4
4
4
Malwari
21
21
21
21
21
21
2
12
1
5
12
Mangalarga
Marchador
Marismeño
5
5
5
5
5
5
Menorquin
4
4
4
4
4
4
4
12
1
12
1
12
Miniature
Mongolian native
horse
12
12
2
2
12
12
12
12
1
2
2
12
1
41
41
12
New Forest Pony
Noric
1
North-Swedish
41
1
41
1
2
41
1
41
1
Amely6SRY
Trotter
4
Arabian
Austrian Warmblood 12
Breed
Y3B12 Y3B19 Total
Ardennais
5+12
41
Table 6. Cont.
Y2B17 Y3B1
1
Uruguayan Creole
2
Other/unidentified
13
Total
119
2
Y3B12 Y3B19 Total
2
2
12
2
2
2
1
2
1
120
134
119
119
134
1
Reference [7].
Reference [5].
3
Reference [19].
4
Reference [20].
List of domestic horse breeds represented in the literature and sequenced here
for the six Y chromosome loci showing differences in sequence between
studies. The original sequences from Lindgren et al. [7] are 100% identical to
the sequences presented in this study.
doi:10.1371/journal.pone.0064985.t006
2
in both directions to ensure there were not sequencing errors.
However, polymerase errors are very much more frequent and
thus problematic than sequencing errors [26,27]. It is possible that
the horse sequence was based on the clone sequences, and this is
how the three differences entered the dataset.
The final locus in which differences were identified between our
sequence and the reference sequence is the 452 base pair fragment
of the SRY gene (Table 3, Table S1). Three differences were
identified in this fragment, two base pair differences and one indel.
In the original publication [20], cDNA was amplified from
testicular RNA and then cloned for sequencing. It is not
mentioned in the publication if more than one clone was
sequenced, and so Taq amplification errors exposed through
sequencing of an insufficient number of clones could also explain
these differences. In addition to this sequence, the same fragment
from horse has also been deposited into GenBank two other times,
although not associated with publications (AC215855 and
HM103387). These sequences matched each other and our
sequences, which suggests that these sequences more accurately
reflect the genomic sequence of this fragment in the domestic
horse. Taken altogether, these observations call into question the
veracity of all 15 putative SNPs and indels.
4
12
12
Microsatellite Variation
Norwegian Fjord
1
Old Wuerttemberger
12
12
1
Oldenburger
12
12
1
Paint
1
2
12
1
Pinto
12
12
1
4
4
The six Y-chromosome microsatellite loci utilized here have
also been employed in other studies involving over 1000 stallions
(Table 5). Allele calling for microsatellite loci is not directly
comparable between studies because different individuals could
call the same allele differently, and the same allele can migrate in
the polymer used to measure amplicons differently on different
machines. For this reason the different sizes reported for the same
locus in the different studies does not necessarily indicate allelic
variation within the species. One study [6] reports three alleles for
locus Eca.YH12 (Table 5), however those three peaks form a
consistent genotype that was found in the same form in all
individuals, and thus was not variable in their sample [6]. Only
one study found two variants for domestic horses within its dataset
[15]. These two alleles at locus Eca.YA16 identified in some local
Chinese breeds is the only indication of standing Y-chromosome
variation in the domestic horse.
Pottok
4
Quarter Horse
1
Retuertas
6
4
2
1
4
2
6
Saddlebred
Shagya Arabian Shire
1
4
2
6
1
4
2
1
6
1
2
1
6
6
12
12
1
2
12
1
1
6
2
Shetland
41
41
41
41
41
41
4
Shetland Pony
12
12
12
12
12
12
1
Tarpan-like horse
12
12
12
12
12
12
1
Thai Pony
21
21
21
21
21
21
2
Thoroughbred
12+41 12+41+14 12+41 12+41 12+41 12+41
Tinker
Trakehner
1
12
12
PLOS ONE | www.plosone.org
2
12
1
12
1
1
12
12
6
2
Domestication and Husbandry of Horses
Genetic and archaeological evidence suggests that horse
domestication occurred more recently than the other primary
Eurasian domesticates, perhaps around 6000 years ago [1,28–31].
6
May 2013 | Volume 8 | Issue 5 | e64985
Y-Chromosome Analysis in Retuertas Horses
However, this is not known with certainty because domesticated
and wild horses are very difficult to differentiate in the
archaeological record [32] and the maternal and paternal genetic
stories are apparently different, with high levels of haplotype
diversity in the maternally-inherited mitochondrial DNA and
almost no variation in the paternally inherited Y-chromosome [1–
8,14,28–30,33–36]. This could suggest recurrent backcrossing
between limited numbers of stallions and numerous captured wild
female horses around the time of domestication.
Our results support previous observations of near complete
monomorphism on the Y-chromosome in extant domestic horses.
This could be explained in various ways: 1) the genetic diversity on
the Y-chromosome was already limited in the wild ancestral
species before the domestication; or 2) genetic variation was lost in
the bottleneck events associated with domestication; or 3) genetic
variation was lost after domestication through genetic drift,
selective breeding (i.e. breed formation and the ‘‘popular sire’’
effect); or 4) strong natural selection (selective sweep) after
domestication. Hypothesis 1 could be supported by several factors
that can explain the low levels of diversity of the Y-chromosome
found in other wild mammals, such as sexual selection, mating
system or sex-biased migration patterns, or other mechanisms that
promote a small male effective population size [37,38]. However,
hypothesis 1 is undermined by the recent finding of abundant
genetic diversity in the Y-chromosome sequences of ancient horses
[14]. Hypothesis 2 is supported by some theories regarding
domestication in which a strongly limited number of individuals
are removed from the wild and closely controlled by humans.
Hypothesis 3 could be supported by the selective breeding system
that is still used today in horse breeds, in which ‘‘popular sires’’
reduce the effective male population size and thus decrease Ychromosome diversity through genetic drift. For example, it is well
documented that all modern Thoroughbred horses can trace back
their pedigrees to three stallions imported to England from the
Middle East in the late 17th and early 18th centuries. Domestication results in strong artificial selection for specific traits. It is
possible that the severely reduced variation on Y-linked loci is a
result of a combination of multiple factors, and further studies on
the genetic variation of early domestic horses are necessary to
critically evaluate their relative importance.
Conclusion
In conclusion, the 15 SNPs in six of the 31 widely studied Ychromosome specific fragments which we identified by comparing
sequences generated in this study from 46 stallions originating
from 12 breeds to sequences from other studies available in
GenBank, likely all represent errors in the database. Our data
support the previously observed absence of sequence variation and
near-absence of microsatellite variation in extant domestic horse
Y-chromosomes. We suggest that the unconfirmed sequences,
GenBank accession AB091794 for AMELY6, AB004572 for SRY,
G72335 for Eca-Y2B17, G72336 for Eca-Y3B1, G72338 for EcaY3B12, and G72339 for Eca-Y3B19, should no longer be used as
reference sequences.
Supporting Information
Table S1 Alignment of the six Y-chromosome specific loci.
(PDF)
Acknowledgments
The authors gratefully acknowledge the support and technical assistance
provided by J. Calderón and D. Lopez as well as all park guards of the
Doñana Biological Reserve. We are also grateful to the technical personnel
of Laboratorio de Investigación Aplicada de Crı́a Caballar de las Fuerzas
Armadas for sample collection. Logistical support was provided by
Laboratorio de Ecologı́a Molecular, Estación Biológica de Doñana, CSIC
(LEM-EBD). We also thank the members of the Conservation and
Evolutionary Genetics Group at EBD for constructive comments.
Author Contributions
Conceived and designed the experiments: CB-F JLV-P CR. Performed the
experiments: CB-F. Analyzed the data: CB-F JAL NB GL SL. Contributed
reagents/materials/analysis tools: JLV-P NB GL SL CR. Wrote the paper:
CB-F JAL. Edited and approved submission of manuscript: CB-F JAL
JLV-P NB GL SL CR.
References
1. Vilà C, Leonard JA, Götherström A, Marklund S, Sandberg K, et al. (2001)
Widespread Origins of Domestic Horse Lineages. Science 291: 474–477.
DOI:10.1126/science.291.5503.474.
2. Cieslak M, Pruvost M, Benecke N, Hofreiter M, Morales A, et al. (2010) Origin
and History of Mitochondrial DNA Lineages in Domestic Horses. PLoS ONE
5(12): e15311. DOI:10.1371/journal.pone.0015311.
3. Lippold S, Matzke N, Reissmann M, Hofreiter M (2011) Whole mitochondrial
genome sequencing of domestic horses reveals incorporation of extensive wild
horse diversity during domestication. BMC Evol Biol 11: 328. DOI:10.1186/
1471–2148–11–328.
4. Santani AB (2004) Genomic analysis of the horse Y-chromosome. PhD thesis,
Texas A & M University, USA.
5. Wallner B, Brem G, Müller M, Achmann R (2003) Fixed nucleotide differences
on the Y-chromosome indicate clear divergence between Equus przewalskii and
Equus caballus. Anim Genet 34: 453–456. DOI:10.1046/j.0268–
9146.2003.01044.x.
6. Wallner B, Piumi F, Brem G, Muller M, Achmann R (2004) Isolation of Ychromosome-specific Microsatellites in the Horse and Cross-species Amplification in the Genus Equus. J Hered 95: 158–164. DOI:10.1093/jhered/esh020.
7. Lindgren G, Backström N, Swinburne J, Hellborg L, Einarsson A, et al. (2004)
Limited number of patrilines in horse domestication. Nat Genet 36: 335–336.
DOI:10.1038/ng1326.
8. Rafeie F, Amirinia C, Nejati Javaremi A, Ziaeddin Mirhoseini S, Amirmozafari
N (2011) A study of patrilineal genetic diversity in Iranian indigenous horse
breeds. Afr J Biotechnol 10(75): 17347–17352. DOI:10.5897/AJB11.1430.
9. Liu W-S, Beattie CW, Ponce de León FA (2003) Bovine Y-chromosome
microsatellite polymorphisms. Cytogenet Genome Res 102: 53–58.
DOI:10.1159/000075725.
10. Meadows JR, Hawken RJ, Kijas JW (2004) Nucleotide diversity on the ovine Y
chromosome. Anim Genet 35(5): 379–385. DOI:10.1111/j.13652052.2004.01180.x.
PLOS ONE | www.plosone.org
11. Bannasch DL, Bannasch MJ, Ryun JR, Famula TR, Pedersen NC (2005) Ychromosome haplotype analysis in purebred dogs. Mamm Genome 16(4): 273–
280. DOI:10.1007/s00335–004–2435–8.
12. Sundqvist A-K, Björnerfeldt S, Leonard JA, Hailer F, Hedhammar Å, et al.
(2006) Unequal contribution of sexes in the origin of dog breeds. Genetics 172:
1121–1128. DOI:10.1534/genetics.105.042358.
13. Ginja C, Gama LT, Penedo MCT (2009) Y-chromosome haplotype analysis in
Portuguese cattle breeds using SNPs and STRs. J Hered 100(2): 148–157.
DOI:10.1093/jhered/esn080.
14. Lippold S, Knapp M, Kuznetsova T, Leonard JA, Benecke N, et al. (2011)
Discovery of lost diversity of paternal horse lineages using ancient DNA. Nat
Commun 2: 450. DOI:10.1038/ncomms1447.
15. Ling Y, Ma Y, Guan W, Cheng Y, Wang Y, et al. (2010) Identification of Ychromosome genetic variations in Chinese indigenous horse breeds. J Hered
101: 639–643. DOI:10.1093/jhered/esq047.
16. Warmuth V, Eriksson A, Bower MA, Cañon J, Cothran G, et al. (2011)
European Domestic Horses Originated in Two Holocene Refugia. PLoS ONE
6(3): e18194. DOI:10.1371/journal.pone.0018194.
17. Vega-Pla JL, Calderón J, Rodrı́guez-Gallardo PP, Martı́nez AM, Rico C (2006)
Saving feral horse populations: does it really matter? A case study of wild horses
from Doñana National Park in southern Spain. Anim Genet 37: 571-578.
DOI:10.1111/j.1365-2052.2006.01533.x.
18. Vega Pla JL, Calderón J, Rodrı́guez Gallardo PP, Alcaide B, Sereno FTPS, et al.
(2005) The Retuertas horse: the ‘‘missing link’’ in the Iberoamerican horse bred
origin? Conservation genetics of endangered horse breeds (EAAP Publication).
Wageningen Academic Press: 167–176.
19. Iwase M, Satta Y, Hirai Y, Hirai H, Imai H (2003) The amelogenin loci span an
ancient pseudoautosomal boundary in diverse mammalian species. Proc Natl
Acad Sci USA 100:5258–5263. DOI:10.1073/pnas.0635848100.
7
May 2013 | Volume 8 | Issue 5 | e64985
Y-Chromosome Analysis in Retuertas Horses
20. Hasegawa T, Ishida M, Harigaya T, Sato F, Ishida N (1999) Linear SRY
transcript in equine testis. J Vet Med Sci 61(1): 97–100. DOI:10.1292/
jvms.61.97.
21. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic Local
Alignment Search Tool. J Mol Biol 215: 403–410.
22. Wallner B (2001) Selektive Klonierung von Y-chromosomalen DNA-Sequenzen
beim Pferd mittels Representational Difference Analysis. Dissertation (Phd
thesis),Vet.- Med. Univ. Wien.
23. Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and
analysis program for Windows 95/98/NT. Nucl. Acids. Symp. Ser. 41: 95–98.
24. Schuelke M (2000) An economic method for the fluorescent labelling of PCR
fragments. Nat Biotechnol 18: 233–234. DOI :10.1038/72708.
25. Boutin-Ganache I, Raposo M, Raymond M, Deschepper CF (2001) M13-tailed
primers improve the readability and usability of microsatellite analyses
performed with two different allele-sizing methods. Biotechniques 31: 25–28.
26. Loewen PC, Switala J (1995) Template secondary structure can increase the
error frequency of the dna-polymerase from thermus-aquaticus. Gene 164: 59–
63.
27. Keohavong P, Ling L, Dias C, Thilly WG (1993) Predominant mutations
induced by the Thermococcus litoralis, vent DNA polymerase during DNA
amplification in vitro. PCR Methods Appl 2: 288–292.
28. Lister AM, Kadwell M, Kaagan LM, Jordan WC, Richards MB, et al. (1998)
Ancient and modern DNA in a study of horse domestication. Anc Biomol 2: 267.
29. Jansen T, Forster P, Levine MA, Oelke H, Hurles M, et al. (2002) Mitochondrial
DNA and the origins of the domestic horse. Proc Natl Acad Sci USA 99: 10905–
10910. DOI:10.1073/pnas.152330099.
30. Vilà C, Leonard JA, Beja-Pereira A (2006) Genetic Documentation of Horse and
Donkey Domestication. In Zeder MA, Bradley DG, Emshwiller E, Smith BD.
PLOS ONE | www.plosone.org
31.
32.
33.
34.
35.
36.
37.
38.
8
Documenting Domestication: New Genetic and Archaeological Paradigms.
Berkeley: University of California Press: 245–269.
Outram AK, Stear NA, Bendrey R, Olsen S, Kasparov A, et al. (2009) The
earliest horse harnessing and milking. Science 323: 1332–1335. DOI:10.1126/
science.1168594.
Olsen SL (2006) Early horse domestication on the Eurasian steppe. In Zeder
MA, Bradley DG, Emshwiller E, Smith BD. Documenting Domestication: New
Genetic and Archaeological Paradigms. Berkeley: University of California Press:
245–269.
Ishida N, Oyunsuren T, Mashima S, Mukoyama H, Saitou N (1995)
Mitochondrial-DNA sequences of various species of the genus equus with
special reference to the phylogenetic relationship between Przewalskiis wild
horse and domestic horse. J Mol Evol 41: 180–188. DOI:10.1007/BF00170671.
Kavar T, Dovc P (2008) Domestication of the horse: Genetic relationships
between domestic and wild horses. Livest Sci 116: 1–14. DOI:10.1016/
j.livsci.2008.03.002.
Lau AN, Peng L, Goto H, Chemnick L, Ryder OA, et al. (2009) Horse
Domestication and Conservation Genetics of Przewalski’s Horse Inferred from
Sex Chromosomal and Autosomal Sequences. Mol Biol Evol 26: 199–208.
DOI:10.1093/molbev/msn239.
Achilli A, Olivieri A, Soares P, Lancionia H, Kashani BH, et al. (2012)
Mitochondrial genomes from modern horses reveal the major haplogroups that
underwent domestication. Proc Natl Acad Sci USA 109: 2449–2454.
DOI:10.1073/pnas.1111637109.
Hellborg L, Ellegren H (2004) Low Levels of Nucleotide Diversity in
Mammalian Y-chromosomes. Mol Biol Evol 21: 158–163. DOI:10.1093/
molbev/msh008.
Ellegren H (2009) The different levels of genetic diversity in sex chromosomes
and autosomes. Trends Genet 25: 278–284. DOI:10.1016/j.tig.2009.04.005.
May 2013 | Volume 8 | Issue 5 | e64985