5
Genetic Mapping in Conifers
Kermit Ritland,1,* Konstantin V. Krutovsky,2 Yoshihiko Tsumura,3
Betty Pelgas,4,a Nathalie Isabel4,b and Jean Bousquet5
ABSTRACT
This chapter summarizes the history and current status of genetic
mapping in conifers. We review the development of molecular markers,
methods to construct genetic maps, and the resulting conifer genetic
maps. Genetic maps are subdivided into (1) linkage maps of genetic
markers, (2) quantitative trait loci (QTL) maps, and (3) comparative
maps. Comparative maps involve alignment of marker genes and even
QTLs between species. Physical mapping is also briefly discussed.
Emphasis is placed up problems and approaches unique to conifers,
and the involvement of new genomics technologies.
Keywords: genetic markers, genetic mapping, quantitative trait loci
mapping, comparative mapping
5.1 Introduction
Genetic mapping is the ordering of specific genes or DNA fragments
(genetic markers) along a chromosome, based up observed frequencies of
recombination in pedigrees. It provides the approximate locations of these
1
Department of Forest Sciences, University of British Columbia, Vancouver, British Columbia
V6T 1Z4, Canada; e-mail:
[email protected]
2
Department of Ecosystem Science and Management, Texas A&M University, College Station,
Texas 77843-2138, USA; e-mail:
[email protected]
3
Forestry and Forest Products Research Institute, Tsukuba, Ibaraki 305-8687, Japan;
e-mail:
[email protected]
4
Natural Resources Canada, Canadian Forest Service, Laurentian Forestry Centre, 1055 du
P.E.P.S., P.O. Box 10380, Stn Sainte-Foy, Québec, Québec G1V 4C7, Canada;
a
e-mail:
[email protected]
b
e-mail:
[email protected]
5
Canada Research Chair in Forest and Environmental Genomics, Centre d’étude de la forêt,
Université Laval, Québec, Québec G1V 0A6, Canada; e-mail:
[email protected]
*Corresponding author
Copyright reserved © 2011
Genetic Mapping in Conifers 197
entities, which can serve as DNA “landmarks” for further studies (Ott
1999). Physical mapping, in contrast, uses various molecular techniques to
reassemble the actual DNA into contiguous stretches, such that numbers
of bases separating genes are approximately known, as in for example
the chloroplast genome (Tsumura et al. 1993) and the nuclear genome
(Amarasinghe and Carlson 1998). Quantitative trait loci (QTL) mapping
places the locations of putative genes underlying a quantitative trait
onto a genetic map (Lander and Botstein 1989). Conifers have enormous
genomes, on the order of tens of billions of nucleotides (Murray 1998). This
prohibits physical mapping, and suggests that marker/QTL mapping may
continue to dominate conifer genetics research (Neale et al. 1994; White
et al. 2007). In addition, the conserved nature of conifer evolution places
greater importance on comparing genetic and QTL maps (comparative
mapping) in conifers (Krutovsky et al. 2004) and transferring information
among these species.
Conifers provide unique opportunities but also problems for genetic
mapping. Most notably, the gametophyte allows direct observation of the
haploid product of maternal meiosis (Cairney and Pullman 2007). Secondly,
conifers are outbred, and issues in data analysis arise from the fact that
parents and grandparents are heterozygous for markers and QTL, requiring
more complex approaches for data analysis (Liu 1998). Thirdly, the large
genome size of conifers, a consequence of repeated DNA elements (Morse
et al. 2009), make protocols for marker screening more complex, and the
development of markers more difficult compared to most angiosperms
(Kinlaw and Neale 1997). Finally, the enormous evolutionary distance
between conifers and angiosperms, separated by 300 hundred million years
of evolution (Savard et al. 1994), makes gene identification and annotation in
conifers very difficult (Kirst et al. 2003; Ralph et al. 2008). Here, we review
the current state of conifer genome mapping, with reference to current
advances in genomics studies of conifers.
5.2 Types and Properties of Genetic Markers for Conifers
Over the past 20 years, the increasing availability of molecular genetic
markers such as restriction fragment length polymorphisms (RFLPs),
amplified fragment length polymorphisms (AFLPs), microsatellites or
simple sequence repeats (SSRs), single nucleotide polymorphisms (SNPs),
and conserved orthologous sets (COS), has resulted in the development of
numerous genetic linkage maps in conifers. The important conifer species—
many pines and spruces, Sugi and Douglas-fir—have been mapped, though
marker density is still low in relation to genome size. For a typical conifer, a
map with 1,000 markers would have, on average, 10–40 million nucleotide
sites separating adjacent markers.
Copyright reserved © 2011
198
Genetics, Genomics and Breeding of Conifers
5.2.1 First Generation Markers
Before molecular markers became popular in the 1980s, isozymes or
allozymes were used for molecular population genetic investigations in
conifers. Isozymes are enzymes that differ in amino acid sequence but
catalyze the same chemical reaction. Thus, they are representative of
differences at the DNA level. Isozymes gave the first revelation about
DNA variation in conifers, and for a period centering about the 1980s,
many conifers were the subject of isozyme investigations. The dawn of the
isozyme era was heralded by a seminal 1979 symposia on “Isozymes in
Forest Genetics and Forest Insects” (Conkle 1981). The dusk of the isozyme
era was after the 1990 IUFRO symposium, published in the journal New
Forests, Volume 6, and in book form by Adams (1992), on the more general
topic of “Isozymes in Forest Trees”. These two symposia bookmark this era.
Isozymes have been placed in genetic maps, but they are not numerous
enough to show much linkage. Typically, 20–30 loci are the maximum
number of loci that can be assayed, so that in a typical genome of 2,000–3,000
centiMorgans, few loci will be linked.
Another first generation marker used is the RFLP, a co-dominant
polymorphism for the presence/absence of restriction sites. Bands
were visualized via Southern blots, which require a probe or sequence
complementary to the region about the polymorphism. The RFLP technique
is relatively laborious to develop and implement compared to the more
recent polymerase chain reaction (PCR) based methods. Neverthess, a
number of conifer linkage maps were constructed using RFLP markers
during 1985–1995. This marker is regarded as “first generation”, as their
numbers were still quite limited.
5.2.2 Second Generation Markers
RAPD markers consist of fragments generated via the PCR using a randomly
selected ten base primer (Williams et al. 1990). A number of RAPD maps
were constructed during the period 1992–98. “Random” refers to the fact
that primers are chosen at random, without prior knowledge of any specific
primer sites in the genome. Hence the step of cloning and identification of
specific sequences is skipped. However RAPDs exhibit dominance, wherein
heterozygotes cannot be distinguished from dominant homozygotes (which
are the band phenotype; band-less phenotypes are recessive). Also, the
RAPD gel band patterns often lack reproducibility, making this class of
marker only reliable for studies involving controlled crosses such as genetic
mapping, where segregation ratios can verify proper inheritance. A variant
of RAPDs is “inter simple sequence repeats” (ISSRs), which are randomly
amplified markers produced by PCR amplification with short primers that
Copyright reserved © 2011
Genetic Mapping in Conifers 199
contain both a microsatellite motif and a random sequence (Bornet and
Branchard 2001). These have seen some applications for mapping.
AFLPs are a new class of dominant markers that avoid many of the
pitfalls of RAPDs. Assaying for this marker involves restriction digestion
of genomic DNA, then PCR amplification of a subset of these fragments
(Vos et al. 1995). These markers share many of the characteristics of RAPDs,
including dominance and the appearance of many loci on one gel. But the
fragment patterns are more reliable, and many more fragments per gel are
scoreable. AFLPs have made dense linkage maps possible. However, DNA
fragments generated by this technique differ by as little as a single base,
requiring use of vertical acrylamide gels or automatic fragment analyzers
for clear separation.
Due to the large genome size of conifers, modifications of the
AFLP technique for conifers are needed, as the standard +3/+3 primer
combinations used for AFLP result in too many bands. With large genomes,
one might think that one can limit the pool of selectively amplified DNA
by merely increasing the number of selective nucleotides. Vos et al. (1995)
found that primers with 4 or more added nucleotides actually suffered a
loss of selectivity. For conifers, with their huge genome size, as a means to
select subsets of fragments beyond this limit (and to also increase template
concentrations), Remington et al. (1999) introduced an additional step prior
to the main amplification, termed the “preamplification”. It corresponds
to a normal amplification, but with shorter primer combinations, usually
+1/+1 or +2/+2. Numerous conifer genetic maps have been constructed
with AFLPs since the end of the 1990s.
5.2.3 Third Generation Markers
The last class of markers requires a-priori knowledge of the DNA sequence
at, or around, the marker of interest. A hybrid between second and third
generation marker is characterized-sequence amplified region (SCAR),
which is developed by cloning RAPD or AFLP markers, and finding
the nucleotide sequence about these markers. SCARs are not suited for
mapping, as the procedure is laborious; they are useful for finding candidate
genes closely linked to anonymous RAPD and AFLP markers found linked
to a trait of interest, but they do find themselves occasionally included in
conifer genetic maps.
True third generation markers include SSRs (simple sequence repeats
or microsatellites), expressed sequence tag (EST)-SSRs (microsatellites in
expressed DNA regions), ESTPs (expressed sequence tag polymorphisms,
a marker found in ESTs) and finally, the gold standard, the SNP (single
nucleotide polymorphism, which directly reflects nucleotide polymorphism
at a specific nucleotide site). For further information about markers in plants,
see Ritland and Ritland (2000) and Weising et al. (2005).
Copyright reserved © 2011
200
Genetics, Genomics and Breeding of Conifers
The completion of the genome sequences in model species, and the
accumulation of numerous EST and genomic sequences in many other
species, will provide rich resources for the development of these third
generation markers.
5.2.3.1 Simple Sequence Repeats
SSRs are markers that are polymorphic for the numbers of repeats of a
simple motif (usually 2 to 4 bases long, for example dinucleotide repeats
ATATAT….). SSRs are usually co-dominant, highly variable, and somatically
stable (Morgante and Olivieri 1993). The locus is amplified by primers that
flank the locus. The flanking primers are usually highly species specific.
The cost of finding and designing the primers, which must be done for
every species, does limit the use of this technique. Sometimes SSRs can be
“transferred” to closely related species but at the risk of high null-allele
frequency (when the allele does not amplify due to primer mismatches).
One disadvantage of SSRs is that they cannot be multiplexed very well,
making high density maps impractical.
A special class of SSRs is “EST-SSRs”. These are SSRs found in EST
sequences and because they are in or near coding regions, the primer
regions are more conserved and better able to amplify across species, but
the markers are also less polymorphic (Rungis et al. 2004; Ellis and Burke
2007).
5.2.3.2 Single Nucleotide Polymorphisms
SNPs are clearly becoming the marker of choice for species that have been
subject to genomic work (either for ESTs, or for genome sequencing). This is
because high-throughput genotyping is possible for SNPs, and the number
of SNP loci is virtually unlimited. Common high-throughput genotyping
methods include the Illumina GoldenGate and Infinium assays (www.
illumina.com) (Pavy et al. 2008). However while the genotyping costs are
much lower (typically 5–10 cent per genotype, compared to 50–100 cents
for other methods), the scale of assay (96 SNPs, 480 sample minimum)
makes each experiment a big budget item, on the order of many thousands
of dollars.
The large number of SNPs uncovered by high-throughput technologies
also presents itself with opportunities for marker transfer between species.
Even if few in percentage, valuable anchor loci are provided. In the Arborea
genome project, for a subset of 1,964 SNPs successfully genotyped in eastern
P. glauca, 1,565 (80%) were found polymorphic among 11 western P. glauca
individuals, 728 (37%) among nine P. sitchensis, 386 (20%) among 10 P. abies,
and 321 (16%) among ten P. mariana (J. Bousquet et al. unpubl. data). Also,
Copyright reserved © 2011
Genetic Mapping in Conifers 201
about 10% of SNPs identified in loblolly pine (Pinus taeda L., subgenus Pinus,
section Pinus, subsection Australes) amplified in white spruce (D Neale et al.
unpubl. data); these are currently being utilized in the Treenomix II project
for map synteny comparisons.
5.2.3.3 Conserved Orthologous Sets (COS) and Orthology of Maps
Comparative mapping relies on orthologous markers. The concepts of
orthology and paralogy are essential to construct comparative maps.
Orthologous gene pairs are directly descended from a common ancestor.
Paralogous genes are separated by gene duplication events and may reside in
different locations, but also be very closely linked necessitating sequencing
to ascertain orthology. These concepts are essential for comparing maps
between species (Gogarten and Olendzenski 1999; Koonin 2005; Theissen
2005; Pelgas et al. 2006).
COS markers are genes of low copy number within a genome, and
also have low rate of evolution among species. COS markers are identified
by self-BLASTing ESTs within a species, to identify genes of low copy
number, then cross-BLASTing these sequences among taxa; genes of low
copy number within taxa, and low divergence between taxa, are identified
as COS markers (Fulton et al. 2002).
Krutovsky et al. (2006) identified COS markers for conifers using
sequence comparisons between Arabidopsis, rice, black cottonwood, loblolly
pine, white spruce, Douglas-fir, and sugi. Interestingly, almost half of the
single-copy genes in the non-tree species Arabidopsis and rice had additional
copies and homologs in poplar and conifers. However, laboratory assays
indicate that the high level of evolutionary conservation of COS markers
also results in lower gene diversity within populations, and less available
polymorphism for mapping purposes (Liewlaksaneeyanawin et al. 2009).
In lieu of these difficulties of COS markers, with the larger number of
genes available through automated genomic investigations, there is the
hope that orthologous markers from the huge library of SNPs for conifers
can be identified, to anchor genetic maps (Le Dantec et al. 2004; Pavy et al.
2006; Pavy et al. 2008).
5.2.4 Public Databases for Third Generation Markers
Public databases contain a wealth of in silico data for marker development.
Expressed sequence tags (ESTs) are segments of genes expressed as
messenger RNA. Hence they are most useful for identifying SNPs of
putative function. For ESTs, the most intensively surveyed conifer species
are pine and spruce. NCBI’s Entrez Taxonomy Browser (ncbi.nlm.nih.gov),
Copyright reserved © 2011
202
Genetics, Genomics and Breeding of Conifers
as of September 2010, contained 629,815 ESTs for Pinus and 542,939 ESTs
for Picea. Within Pinus, the numbers of ESTs are (in parenthesis) are: P. taeda
(328,756), P. contorta (40,483), P. banksiana (36,379), P. pinaster (34,044), and P.
radiata (7,538). Within Picea, the numbers are P. glauca (313,110), P. sitchensis
(186,637), P. engelmannii x P. glauca (28,174) and P. abies (14,224). Smaller
EST collections exist for other conifers including the family Cupressacea
(cedars), which has 72,146 ESTs deposited, mainly for Crytomeria japonica.
For in silico SNP development, a large number of ESTs are required, unless
the deposited ESTs are used to design primers to amplify a small panel of
individuals to find SNPs. For pure in silico marker development, a given
gene must have at least four overlapping ESTs, in which case a SNP will
be detected if two of four nucleotide sites differ in base composition (this
mostly rules out sequencing error).
5.3 Mapping Strategies in Conifers
5.3.1 Detecting Recombination
The techniques of marker mapping date from Mendel’s crosses. The pioneer
of genetic mapping, Thomas Hunt Morgan, showed that recombination
frequency can estimate distance separating genes; the distance over which
1% crossover frequency occurs was named by JBS Haldane as the “Morgan”,
and map distances are generally labeled in centiMorgans (cM) (Ott 1999).
In the context of conifer genetics, issues arise about determining
linkage phase. Because conifers are heterozygous, linkage phase cannot be
directly ascertained. For example, in the simplest cross, the “backcross”,
where for two loci A and B, with alleles A1, A2 and B1, B2, respectively, a
cross of grandparent genotype A1A1B1B1 with A2A2B2B2 results in double
heterozygote parent A1A2B1B2. Progeny from a backcross of this genotype
with either grandparent genotypes may reveal recombinants A1A2B1B1 or
A1A1B1B2. In the “intercross”, the double heterozygote parents A1A2B1B2
can be crossed with another double heterozygote. As recombination can be
detected in both parents in the “intercross”, the data are more informative,
up to twice as informative when linkage is tight (Ott 1999). However, this
assumes linkage phase is known, and grandparents are homozygous.
If grandparents are not homozygous, and/or the grandparents are not
genotyped, either single-heterozygote progeny or double-heterozygote
progeny (but not both) can be recombinant. This is analogous to the
inference of haplotypes from diploid population samples as originally
investigated by Clark (1990), in that the phase is indirectly determined by
reference to a population sample. Wu et al. (2002) describe how linkage
phase can be inferred for outcrossing species with unknown heterozygosity
of grandparents, commonly found in conifers. Margarido et al. (2007)
implement this procedure in “OneMap”.
Copyright reserved © 2011
Genetic Mapping in Conifers 203
5.3.2 Assembling Linkage Maps
Lander et al. (1987) popularized genetic mapping with their widely used
software, MAPMAKER followed by MAPMAKER/EXP and its close
descent, MAPMAKER/QTL. Since then, dozens of programs for both
linkage and quantitative trait loci (QTL) mapping have been made freely
available. A comprehensive list of linkage and QTL mapping software
can be found at http://linkage.rockefeller.edu/soft. MAPMAKER starts with a
two-point linkage analysis (recombination estimated between all pairs of
loci). It then uses a “greedy” algorithm, which builds up linkage groups
by sequentially adding markers. This does not guarantee correct orders, so
various permutations of maps are done by “rippling”. The most commonly
used mapping program is JoinMap (Stam 1993), discussed below.
Multipoint linkage analysis takes into consideration the segregation
of many linked markers simultaneously. With this approach, it becomes
possible to identify individual chromosomal breakpoints and establish
order with great certainty (Lathrop et al. 1985). This will become of
increasing importance with the advent of high-resolution mapping of
conifer genomes.
5.3.3 The Pseudo-testcross
With dominant markers, if a locus is heterozygous in one parent and null
(double recessive) in the other, this mimics a testcross with 1:1 segregation
ratios. This was termed a “two-way pseudo-testcross” by Grattapaglia and
Sederoff (1994), and this was meant to resolve the problem with dominance
of RAPD and AFLP markers. It was named “pseudo-testcross” because
while it is a testcross mapping configuration, the mating configuration of
the markers is not known a priori. However, in genetic mapping, one ends
up with a map for the female and a second map for the male. The maps
must be joined in some way.
5.3.4 Joining Maps
The integration of two or more marker genetic maps into a single unified
map, named a “composite” or “consensus” map, requires common markers
that segregate in two or more of the mapping populations. The “pseudotestcross strategy” is a simple case of multiple maps (two). With dominant
markers, one can infer maps for the male and female parent separately as
in Eucalyptus (Grattapaglia and Sederoff 1994). These workers recognized
that multiallelic co-dominant markers with alleles heterozygous in parents
are needed as “locus bridges”. Joining of maps is now a common activity in
Copyright reserved © 2011
204
Genetics, Genomics and Breeding of Conifers
conifers, as much of the pedigree material resides within breeding programs,
which include many small pedigrees in progeny tests or diallel crosses.
Stam (1993) developed a computer program “JoinMap” that joins
pairs of LGs that share the same marker(s) using either raw genetic data or
recombination frequencies. The “JoinMap” algorithm estimates information
about recombination in a given cross from LOD values and then combines
estimates among crosses assuming a binomial sampling distribution. With
more than two pedigrees, joining maps is more complicated. Hu et al.
(2004) presented a likelihood approach for joining genetic maps that uses
a joint likelihood function that combines information across all crosses. The
main advantage of this method is substantially improved accuracy when
dominant or a mixture of dominant and co-dominant markers are used.
A new approach to build verified multilocus consensus genetic maps
in which shared markers are integrated into stable consensus orders
was recently developed by Mester and his colleagues (Mester et al. 2003,
2004, 2006) and implemented into software (http://www.multiqtl.com/). The
approach is based on (1) combined analysis of initial mapping data rather
than manipulating with previously constructed maps, and (2) “synchronized
ordering”, facilitated by cycles of resampling.
However, several pitfalls exist in joining genetic maps, the most
important being differences in recombination rates between pedigrees.
Recombination rates can differ between crosses and individuals due to
environment particularly in stressful conditions where recombination
increases (Agrawal et al. 2005). It can also differ in relation to sex or age,
where recombination is lower in males and in older individuals (Rose and
Baillie 1979). In several pine species, significantly less recombination was
observed for the female gametes than for the male gametes in radiata pine
(Moran et al. 1983), loblolly pine (Groover et al. 1995) and maritime pine
(Plomion and O’Malley 1996). However, Pelgas et al. (2005) observed no
difference in map length between males and females in white spruce, as
did Pelgas et al. (2006) and Pavy et al. (2008) for white and black spruce
pedigrees. This suggests that sex-specific recombination rates may differ
between conifer species. Further investigation is needed on this topic.
Another pitfall in joining maps is that markers can vary in abundance and
distribution. In Norway spruce, low- and high-copy-number markers tend to
occupy separate genome regions (Scotti et al. 2005). Also, microsatellites may
be preferentially associated with nonrepetitive DNA (more coding DNA) in
plant genomes (Morgante et al. 2002). Both of these situations indicate that
joining maps with different classes of markers might be difficult, as common
polymorphic markers between these marker-type classes may not be present
in many parts of the genome.
Copyright reserved © 2011
Genetic Mapping in Conifers 205
5.3.5 Improving the Resolution of Maps
To get beyond the resolution of traditional marker mapping, which is of
5–10 cM resolution for mapping populations of size ca. 100, one can use
larger mapping populations, or else physical mapping. Physical mapping
involves the cloning and mapping (by fingerprinting) of large plasmid
inserts, such as bacterial artificial chromosomes (BACs), normally 150 KB
in length. In conifers, which harbor a 10–40 gigabase genome, this would
require 200,000 BAC clones for a 1× coverage; ideally 2 million BACs
would be needed for a 10 × coverage, as this is the typical required for a
BAC tiling path (Soderlund et al. 2000). At least, the repetitive nature of
the conifer genome would suggest that assembly of BAC fingerprints into
a tiling path is difficult. However, suggestive data indicate that the major
period of repetitive DNA activity (transposition) occured over 100 million
years ago (Mya) (M Morgante et al. unpubl. data). Such a feature would
actually increase the feasibility of genome assembly, since members of the
same repeat class have diverged since transposition. This is a current area
of research in conifer genomics—the nature of low complexity DNA in
conifers and its implication for genome assembly (Nelson et al. 2008).
With high-resolution meiotic maps, a problem is that low frequency of
genotyping error (1.5% or less) can influence mapping outcomes. Such an
error was observed to reduce power to discriminate orders, dramatically
inflate map length, and provide significant support for incorrect over
correct orders (Buetow 1991). Occasional genotype errors skew estimates
of recombination between closely linked loci; a similar situation occurs in
paternity analysis, where just one missscored locus can invalidate the correct
parent. Various workers have since dealt with this issue (Sobel et al. 2002)
and new SNP genotyping methods have shown to be highly accurate, with
error rate below 1% (Pavy et al. 2008).
To increase the rate that meiotic events can be detected, Gasbarra
and Sillanpaa (2006), proposed pooling haploid tissue, such as conifer
megametophytes, to estimate recombination rates between closely linked
loci (< 1 cM). Pools of several hundred were simulated but they found that
several pools were better than a single pool.
Selective mapping approach can facilitate the production of highquality, high-density genome-wide linkage maps (Vision et al. 2000). It
was demonstrated that, to construct a map with high genome-wide marker
density, it is neither necessary nor desirable to genotype all markers in every
individual of a large mapping population. Instead, a reduced sample of
individuals bearing complementary recombinational or radiation-induced
breakpoints may be selected for genotyping subsequent markers from a
large, but sparsely genotyped, mapping population.
Copyright reserved © 2011
206
Genetics, Genomics and Breeding of Conifers
5.4 Conifer Linkage Maps
5.4.1 Overview
Genetic mapping in conifers, and in all other species for that matter, has
progressed through three generations of development, corresponding to
the marker categories described above. The first generation maps involved
allozyme and RFLP markers, which rarely revealed genetic linkage because
of their sparsity. The second generation maps involved anonymous genetic
markers such as RAPDs and AFLPs; “anonymous” in the sense that we
have no idea of their gene function. Nevertheless, complete genetic maps
were inferred, as these markers were so much more numerous. The third
generation maps involved markers of known gene function, mainly SNPs
derived from genome projects. This last wave now allows incredibly detailed
maps of genomes, both with numerous markers, and with markers linked
to genes putatively related to adaptation and other desired traits.
As isozymes are limited in number, they did not play a significant
role in linkage mapping; occasionally a few isozyme markers were added
to other markers in a complete map. Significant effort into developing
RFLP markers for linkage mapping has been done only for Cryptomeria
japonica, Pinus elliottii, P. taeda, P. radiata, and Pseudotsuga menziesii (Table
5-1). Usually RFLPs were analyzed in conjunction with other markers. The
most significant early generation molecular marker map was developed
in loblolly pine. Devey et al. (1994a) reported an RFLP linkage map for
loblolly pine based on a three-generation outbred pedigree. Seventy three
of 90 loci (including two isozymes) clustered into 20 linkage groups (LGs).
Other studies are summarized in Table 5-1.
The first complete linkage maps in conifers, where the number of large
LGs equalled to the haploid number of chromosomes, were made possible
by the advent of RAPD and AFLP markers. In the first application of RAPD
markers for conifer mapping, Tulsieram et al. (1992) mapped 47 of 61 RAPD
markers into 12 LGs in white spruce. Subsequent studies are summarized in
Table 5-1. Genetic maps have been constructed for ca. 12 pine species (Table
5-1: Pinus brutia, caribaea, contorta, densiflora, edulis, elliottii, palustris, pinaster,
radiata, strobus, sylvestris, and taeda). Maps have been constructed for four
spruce species (Table 5-1: Picea abies, glauca, mariana and rubens). Cryptomeria
and Pseudotsuga are two other conifers that have received much attention,
while isolated work has been done with Abies, Cunninghamia, Larix and
Taxus. It should be noted that many of these studies used open-pollinated
seed progeny of an individual tree assayed for haploid megagametophytes
(this also avoids the dominance of RAPD markers). We now discuss more
detailed maps made in the four most important conifer genera.
Copyright reserved © 2011
Copyright reserved © 2011
Table 5-1 Genetic linkage maps in conifers. Linkage groups include at least 3 markers. Expected coverage is the ratio between observed and predicted map sizes estimated following Hulbert et al. (1988) and Chakravarti et al. (1991); NA = not available. The haploid number of chromosomes
in all species is 12 except Cryptomeria japonica (n = 11), and Pseudotsuga menziesii (n = 13). Numbers separated by “/” refer to the maternal/paternal
parents of the mapping population, respectively. Marker types are defined in Section 5.2.
Species
Marker type
P. mariana × P. rubens
AFLP, ESTP, RAPD, SSR
Linkage
groups
Map length,
cM
Expected
coverage, %
cM per
marker
Reference
556
101/94
91
84/119
91/132
438
19
11
13
14/21
19/23
11
1977
2283/2566
887
1112/1756
1266/1992
1372
80
NA
NA
40/62
50/80
96
NA
23/27
10
13/15
16/18
3
Hudson (2005)
Tong and Shi (2004)
Mukai et al. (1995)
Kuramoto et al. (2000)
Nikaido et al. (2000)
Tani et al. (2003)
117
125
165
82
413
755
203/152
17
21
17
13
22
12
27/23
1152
1206
3584
1385
2198
2035
2316/1669
80
81
NA
NA
77
NA
66/79
14
14
22
24
9
3
13/ 13
Arcade et al. (2000)
Arcade et al. (2000)
Binelli and Bucci (1994)
Skov & Wellendorf (1998)
Paglia et al. (1998)
Acheré et al. (2004)
Scotti et al.(2005)
61
144/165
802
821
1301
505
12
19
12
12
12
12
873
2008/2059
1934
2304
2087
1835
NA
73/87
89
98
NA
NA
14
9/15
2.4
2.8
1.6
3.5
1124
12
1846
92
1.6
Tulsieram et al. (1992)
Gosselin et al. (2002)
Pelgas et al. (2006)
Pavy et al. (2008)
Pelgas et al. (2011)
Liewlaksaneeyanawin et al.
(unpubl. data)
Pelgas et al. (2005)
Table 5-1 contd....
Genetic Mapping in Conifers 207
Abies nordmanniana
AFLP, RAPD
Cunninghamia lanceolata AFLP
Cryptomeria japonica
RFLP, RAPD, Isozymes
RAPD
AFLP, CAPS
CAPS, Isozymes, SNP,
RAPD, RFLP, SSR
Larix decidua
AFLP, ISSR, RAPD
L. kaempferi
AFLP, ISSR, RAPD
Picea abies
RAPD
RAPD
AFLP, SSR
AFLP, ESTP, rDNA, SSR
AFLP, IRAP, S-SAP,
ESTP, SSR
P. glauca
RAPD
ESTP, RAPD, SCAR
AFLP, ESTP, SSR
AFLP, ESTP, SNP, SSR
AFLP, ESTP, SNP, SSR
AFLP, ESTP, SSR, COS
Markers
208
Species
Marker type
P. mariana
AFLP, ESTP, SNP, RAPD,
SSR
RAPD
AFLP, SAMPL, ESTP,
SSR
AFLP, SSR
RAPD
AFLP
AFLP
RAPD
RAPD
ESTP, Isozymes, RAPD,
RFLP
AFLP, SSR
RAPD
RAPD
SNP
RAPD
Isozymes, RAPD
AFLP, Isozymes, RAPD
AFLP, ESTP, SSR
AFLP
AFLP, ESTP
RAPD, RFLP, SSR
RFLP, SSR
RAPD, SSR
AFLP, RAPD, SSR
RAPD, SSR, STS
Pinus brutia
P. caribaea
P. contorta
P. densiflora
P. edulis
P. elliottii
P. palustris
P. lambertiana
P. pinaster
P. radiata
P. strobus
Markers
Linkage
groups
Map length,
cM
Expected
coverage, %
cM per
marker
835
12
1850
98
2.2
Pavy et al. (2008)
13
1111
6
12
164
1770
NA
97
NA
1.6
Kaya and Neale (1995)
Kang et al. (2010)
109
225
152
338
73
91
154
27
16
19
22
13
13
15
1658
2287
2341
2012
782
953
1115
88
95
82
85
64
62
NA
16
15
18
9
11
16
7
Shepherd et al. (2003a)
Li and Yeh (2001)
Kim et al. (2005)
Travis et al. (1998)
Nelson et al. (1993)
Kubisiak et al. (1995)
Brown et al. (2001)
78
133
122
399
263
463
23
16
18
19
13
12
12
12
12
14
14
19
21
12
82
85
81
NA
90
93
93
NA
NA
NA
NA
75
56
77
58
15
15
13
3.1
9-10
8.3
1182
620
326
195
173
172
194
101
1170
1635
1368
1231
1380
1860
1873
1994
1441
1639
1382
1223
1117
1144
745
Shepherd et al. (2003a)
Nelson et al. (1994)
Kubisiak et al. (1995)
Jermstad et al. (2010)
Plomion et al. (1995b)
Plomion et al. (1995a)
Costa et al. (2000)
Ritter et al. (2002)
Chagné et al. (2002)
Chagné et al. (2003)
Devey et al. (1996)
Devey et al. (1999)
Kuang et al. (1999)
Wilcox et al. (2001)
Echt and Nelson (1997)
10
NA
NA
7
7
NA
12
14
Reference
Genetics, Genomics and Breeding of Conifers
Copyright reserved © 2011
Table 5-1 contd....
Copyright reserved © 2011
P. sylvestris
P. taeda
P. thunbergii
Pseudotsuga menziesii
261
94/155
188/245
120/112
75
508
357
223
235
302
373
14
15
12/15
21/16
10
12
20
20
12
12
12
2639
796/1335
1696/1719
929/1452
632
1528
1359
1281
1227
1274
1228
NA
77/86
86/99
66/85
NA
99
82
75
NA
NA
NA
10
18/17
9/7
9/12
NA
9
4
4
5
NA
4
462
Yazdani et al. (1995)
Lerceteau et al. (2000)
Yin et al. (2003)
Komulainen et al. (2003)
Devey et al. (1994a)
Remington et al. (1999)
Sewell et al. (1999)
Devey et al. (1999)
Brown et al. (2001)
Krutovsky et al. (2004)
Eckert et al. (2009)
Echt et al. (2011)
207
141
210
132
376
20
17
16
13
22
2085
1062
2279
2143
1859
77-78
NA
91
NA
NA
10
7.5
10
NA
NA
Hayashi et al. (2001)
Jermstad et al. (1998)
Krutovsky et al. (1998)
Carlson et al. (2007)
Krutovsky et al. (2004)
120
41
19
17
939
306
NA
9
Ukrainetz et al. (2008a)
Göçmen et al. (1996)
Genetic Mapping in Conifers 209
Taxus brevifolia
RAPD
AFLP
AFLP
AFLP, ESTP, SSR
Isozymes, RFLP
AFLP
Isozymes, RAPD, RFLP
RFLP, SSR
ESTP, Isozymes, RFLP
ESTP, Isozymes, RFLP
ESTP, Isozymes, RAPD,
RFLP, SNP, SSR
ESTP, Isozymes, RAPD,
RFLP, SSR
AFLP, RAPD
RFLP, RAPD
RAPD
RAPD
ESTP, Isozymes, RAPD,
RFLP, SSR, STS
AFLP
RAPD
210
Genetics, Genomics and Breeding of Conifers
Figure 5-1, based upon a meta-analysis of Table 5-1, shows how the
types of markers used in genetic maps have changed in the past 20 years.
In general, the numbers of maps have declined in the past five years.
From this graph, it is evident that RAPD and ISSR markers predominated
during 1995–2005, but their use has declined, as they cannot be transferred
among pedigrees to build additional maps. AFLPs had a big impact during
2000–2005; again, as they are anonymous markers, but their transferability
is limited. Isozymes (the hobbit in the corner) and RFLPs have had a
constant impact, but their numbers are still limited. SSRs and SNP/ESTP/
STS markers are obviously the markers of choice for future mapping. They
have seen increasing useage. These are all sequence based markers that can
be transferred among pedigrees and species.
Number of maps containing marker
18
16
1990-1994
1995-1999
2000-2004
2005-2009
14
12
10
8
6
4
2
0
AFLP
RAPD/ISSR
Isozyme
RFLP
SSR
SNP/ESTP/STS
Marker type
Marker
type
Figure 5-1 Trends in conifer maps over the past 20 years. I. Usage of the various classes of
genetic markers for conifer genetic mapping.
Figure 5-2, also based upon Table 5-1, shows how the number of markers
used in conifer genetic mapping has increased in the past 20 years. Figure
5-2a shows that the number of markers has obviously increased as expected,
but the variance of the number of markers has also increased. This does not
include plans from spruce and pine genome projects to radically increase
marker number to 5,000 and above. But total map length has remained almost
constant (Fig. 5.2b) as markers separated by 30 cM or less (ca. 100 markers
total) are sufficient to cover genome length. High density marker maps will
be of main use for assembling contigs from genome sequencing projects.
Copyright reserved © 2011
Genetic Mapping in Conifers 211
1400
Number of markers in map
1200
(a)
1000
800
600
400
200
1992
1994
1996
1998
2000
2002
2004
2006
2008
2010
1996
1998
2000
2002
2004
2006
2008
2010
4000
(b)
Total map length
3000
2000
1000
1992
1994
Year
Year
Figure 5-2 Trends in conifer maps over the past 20 years. II. (a) Numbers of markers linked
to maps, (b) Total map length explained by markers.
5.4.1.1 Loblolly Pine
Pinus taeda genome maps generally contain ca. 300 loci based mostly on
the two standard pedigrees: base (Devey et al. 1994b) and qtl (Groover et al.
1994). Mapping data for previously reported SSR, RFLP and ESTP markers
were combined with new SSR markers to generate a loblolly pine consensus
map of 462 markers covering 1,380 cM across 12 LGs, using both the qtl
pedigree (n = 171) and base pedigree (n = 98) (Echt et al. 2011). Of the 234
mapped SSR loci, 171 were newly developed, 81 of which were derived from
Copyright reserved © 2011
212
Genetics, Genomics and Breeding of Conifers
EST sequence data. Marker data were obtained for an additional 50 new
EST-SSR loci that did not segregate in either mapping population but which
were polymorphic in population surveys. One hundred and ninety four
mapped loci were given a functional GO assignment; 242 mapped loci were
assigned to a NCBI UniGene cluster. Unigene and GO assignments, along
with linkage data, aided in identifying duplicated and paralogous marker
loci on the map. This species may serve as a reference map in comparative
mapping with other pines and even other members of the Pinaceae family
such as spruce and Douglas-fir.
5.4.1.2 Spruce
Linkage mapping in spruce (Picea spp.) has been directed toward three species
of major economic importance: Picea abies, a European species, and P. glauca
and P. mariana, both primarily North American species. The first saturated
composite map for white spruce was reported by Gosselin et al. (2002), who
used 165 RAPD, SCAR and ESTP markers to join maps from two individuals.
They noted that co-dominant markers were needed to join the maps. In
Norway spruce, Acheré et al. (2004) developed the second map, involving
755 markers. Interestingly, 150 of these markers were tested for their pattern
of population differentiation differing from neutral expectations, and nine of
these markers were found to be “outliers”, or genes that showed excessive
population divergence, compared to the majority of markers, suggesting they
were linked to QTLs for adaptation (Acheré et al. 2005).
More recently, the Arborea project in Canada has constructed several
linkage maps involving both individual and composite maps for white
spruce and black spruce. A map for the black spruce × red spruce species
complex was constructed (Pelgas et al. 2005), and for white spruce alone
(Pelgas et al. 2006). Most notably, Pavy et al. (2008) assembled a white
spruce linkage map with markers assayed via the Illumina GoldenGate
SNP genotyping platform. The resulting composite map had 821 loci
including 461 AFLPs, 12 SSRs, 31 ESTPs and 317 gene SNPs, and map
coverage was > 98%. This map also positioned genes with SNPs involved in
among-population differentiation of eastern white spruce; 50 outlier SNPs
were identified (Namroud et al. 2008); these genes are putatively involved
in adaptive differentiation. An expanded white spruce composite map
containing 836 gene loci has recently been published (Pelgas et al. 2011).
The most recent white spruce gene composite map emerging from
the Arborea project integrates two pedigrees of 500 progeny and has an
increased resolution of 0.9 cM with 2,255 positioned loci including 455
AFLPs, 12 SSRs and 1,788 gene SNPs. The map covers 2,065.4 cM over 12
Copyright reserved © 2011
Genetic Mapping in Conifers 213
LGs. The average gene density is 1.16 cM. The current published spruce map
has 826 genes; the largest number of mapped genes in a conifer species.
5.4.1.3 Douglas-fir
In Pseudotsuga menziesii, the most recent marker development has focused
on ESTP and SNP markers (Krutovsky et al. 2004), which together with SSR
markers, have added to the RFLP and RAPD linkage maps (Jermstad et al.
1998). The most recently published genetic map of Douglas-fir consists of
376 markers, including 172 RFLP, 77 RAPD, 2 isozyme, 20 SSR, 4 sequence
tagged site (STS), and 101 expressed sequence tag (EST) markers (Krutovsky
et al. 2004). This map is organized into 22 LGs that have three or more
linked markers and spans 1,859 cM. Several hundred SNP markers were
developed recently (Eckert et al. 2009), and their mapping is under way.
When enough markers are mapped, the number of LGs should coalesce
into 13, corresponding to the 13 chromosome pairs in Douglas-fir. It would
be valuable to map additional ESTP, EST-SSR and SNP markers to create a
high-density map that can be used for QTL, candidate gene and physical
mapping to facilitate eventual complete Douglas-fir genome sequencing.
5.4.1.4 Sugi
Sugi (Cryptomeria japonica) has been planted widely throughout Japan over
an area of 4.5 million ha, accounting for 44% of all the Japanese artificial
forest. A second generation linkage map for Sugi was constructed by
integrating linkage data from two unrelated third-generation pedigrees.
The progeny segregation data of the first pedigree, which involved a cross
between half-sibs, were derived from cleaved amplified polymorphic
sequences (CAPS), SSRs, RFLPs, and SNPs (Tsumura et al. 1997; Iwata et
al. 2001). The data of the second pedigree, which involved a self-pollinated
individual, were derived from CAPS, isozyme markers, morphological
traits, RAPDs, and RFLPs. The co-dominant DNA markers such as CAPS,
RFLP and SNP were developed from ESTs and cDNA clones from several
kinds of cDNA libraries (Ujino-Ihara et al. 2000; Ujino-Ihara et al. 2005).
More than 95 % of the markers were gene-based markers.
Using JoinMap, linkage analyses were done for the first pedigree
assuming cross-pollination, and for the second pedigree assuming selfing.
Four hundred and thirty eight markers were assigned to 11 large LGs
(corresponding to the 11 chromosomes of C. japonica), 1 small LG, and
1 non-integrated LG from the second pedigree; their total length was
Copyright reserved © 2011
214
Genetics, Genomics and Breeding of Conifers
1,372.2 cM (Tani et al. 2003). On average, the consensus map showed one
marker every 3.0 cM. PCR-based co-dominant DNA marker such as CAPS,
microsatellite and SNP were distributed over all LGs and represented about
a half of mapped loci.
5.4.2 Genome Sizes
Besides providing a linear map of markers along a genome, mapping
experiments can also provide estimates of genome size, in terms of map
units. Hulbert et al. (1988) gave the first estimate of genome size based
upon observed recombination between randomly selected pairs of markers.
Chakravarti et al. (1991) improved this with a maximum likelihood method
for estimating genome size. Many conifer mapping studies have provided
estimate of genome size from either method; estimates range from ca. 2,000
to 3,000 map units. Relatively few numbers of markers can estimate genome
size, as long as some are linked.
Genome size can also be estimated by flow cytometry, in terms of
picograms (pg) of DNA per nucleus, which can be translated into millions
of base pairs using the relationship 1 pg = 978 million base pairs. This gives
an idea of how many nucleotides separate linked markers. Genome size in
the Pinaceae ranges from 5.8 to 32.2 pg with 20 pg (20 billion base pairs)
a rough average (Murray 1998); this is 100 times larger than Arabidopsis
thaliana (0.18 pg).
Genome evolution in the gymnosperm lineage of seed plants has given
rise to many of the most complex and largest plant genomes; however the
elements involved are poorly understood. Most of the enormous genome
complexity of pines can be explained by divergence of retrotransposons
(Morse et al. 2009); however the elements responsible for genome size
variation are yet to be identified. This is currently a very active area of
research in conifer genomics.
5.4.3 Physical Mapping Opportunities
Physical mapping complements genetic mapping. Unfortunately the large
physical genome size of conifers as just described prohibits most of these
approaches. Approaches that are free from constraints from large genome
size involve hybridization of certain genes to chromosomes. Earlier works
used fluorescence in situ hybridization (FISH) experiments to identify
location and distribution of ribosomal RNA. In Sitka spruce, 5s rDNA was
found to be restricted to one chromosome, whereas 18S-5.8S-26S rDNA
repeats occurred on five chromosomes (Brown and Carlson 1997). Both
distribution and location of large tandem repeats on the genome of white
spruce and Sitka spruce were comparable (Brown et al. 1998). A reference
Copyright reserved © 2011
Genetic Mapping in Conifers 215
karyotype was presented recently for loblolly pine based on FISH and using
18S–28S rDNA, 5S rDNA, and an Arabidopsis-type telomere repeat sequence,
A-type TRS signals (Islam-Faridi et al. 2007). Statistically, only seven of
the 12 loblolly pine chromosomes could be distinguished by their relative
lengths. However, the position and relative strength of the rDNA and
telomeric sites made it possible to uniquely identify all of the chromosomes,
providing a reference karyotype for use in comparative genome analyses.
A dichotomous key was developed to aid in the identification of loblolly pine
chromosomes and their comparison to chromosomes of other Pinus spp.
A cytomolecular map was developed using the interstitial 18S–28S rDNA
and A-type TRS signals. A total of 54 bins were assigned, ranging from
three to five bins per chromosome. This is the first report of a chromosomeanchored physical map for a conifer that includes a dichotomous key for
accurate and consistent identification of the loblolly pine chromosomes.
Recently, bacterial artificial chromosome (BAC) hybridization has been
developed as an alternative to rDNA hybridization, which allows very specific
identification of chromosomes, and such methods would be fruitful to apply
to conifers, particularly the Pinaceae, as chromosomal morphology is hardly
distinguishable among the dozen or so chromosomes. This method has been
used in many plant species (Zhang et al. 2004) but not in conifer.
The normal activity of physical mapping is to construct a library of
inserts, then to construct “tiling paths” to obtain an ordered set of clonal
inserts that span the entire genome. For coverage of a conifer genome
(5–10×), about two million BAC clones are needed, too large for practical
work. Nevertheless, BACs are useful for conifers, and there are currently
BAC libraries available for white spruce and loblolly pine. The spruce
library is unarrayed and about 5× coverage, while the loblolly pine library
is arrayed and about 8× coverage (Liu et al. 2009). Currently, both random
BACs and targeted BACs (BAC identified as having a gene of interest) are
being sequenced from both libraries (J MacKay et al. unpubl. data; DG
Peterson et al. unpubl. data; K Ritland et al. unpubl. data).
5.5 Conifer Comparative Mapping
Alignments of genetic or QTL maps among species demonstrate the
evolutionary conservation of gene linkages among species. An early
paradigm was set by work with the grass family (Gale and Devos 1998).
Conserved chromosomal number in the pines family (Pinaceae) suggested
that similar comparisons could be made in pine family members. The
“Conifer Comparative Genomics Project” organized by David Neale and
his colleagues at UC Davis has verified that such approaches can be used in
conifers (e.g., Krutovsky et al. 2004). The end goal is to transfer information
between species about co-localization of QTL and candidate genes among
species. In genome sequencing projects, it also predicts the reliability that
Copyright reserved © 2011
216
Genetics, Genomics and Breeding of Conifers
related conifer genomes can be “resequenced”, once a reference genome
is sequenced.
To facilitate the identification of orthologous markers for comparative
mapping, sequence-based gene markers such as ESTPs and SNPs are best
because they are usually orthologous across congeneric species, and more
reliable than anonymous markers. Hidden paralogy is the ghost of map
construction (Huynen and Bork 1998; Remm et al. 2001; Pelgas et al. 2006).
To reduce the risk of paralogous amplification, primer pairs should be
designed with a primer matching in the 3’ UTR gene region (e.g., Perry
and Bousquet 1998; Brown et al. 2001; Chagné et al. 2003; Pavy et al.
2008). In conifers, resequencing from megagametophyte DNA indicates
paralogous polymorphisms by the presence of double peaks on sequence
chromatograms (Pelgas et al. 2004).
Until recently, limited numbers of orthologous markers were available
for useful map comparisons. SNPs are virtually in unlimited number.
Because they can be annotated and are dense along linkage maps, SNPs can
better determine gene orthology, and serve as anchor markers for intra- and
interspecific map comparisons (Pelgas et al. 2006; Pavy et al. 2008).
5.5.1 Pine Species Comparisons
Historically, the most extensive genetic maps have involved loblolly pine.
Detailed comparative maps are needed to study conifer genome evolution
and to leverage genomic information of adaptive and economic traits from the
relatively well-studied species, such as loblolly pine, to other conifers. Most
comparative maps among Pinus species are within the subgenus Pinus and
based on comparisons of ESTP markers. They contain 41 common loci between
P. taeda and P. sylvestris (Komulainen et al. 2003) and 32 common loci between
P. taeda and P. pinaster (Chagné et al. 2004). Both of these studies used prior
published P. taeda maps (Krutovsky et al. 2004). Recently, maps from the
two subgenera of Strobus and Pinus could be compared, based on neaerly
400 gene SNPs (Jermstad et al. 2010). All 19 linkage groups of P. lambertiana
co-aligned with the 12 linkage groups of P. taeda, providing a basis for
integrated structural genomics approaches across pine subgenera.
5.5.2 Spruce Species Comparisons
The first comparative map of white spruce (Pelgas et al. 2006) revealed
remarkable synteny with black spruce (P. mariana) and Norway spruce
(P. abies); identical LGs and conservation of gene content and gene order
was found. One breakdown of synteny between P. glauca and the other
taxa involved an inter-chromosomal rearrangement of an insertional
translocation. Analysis of marker colinearity also revealed a putative
segmental duplication. This three-species comparison showed that genome
Copyright reserved © 2011
Genetic Mapping in Conifers 217
comparisons among Picea species can provide a platform for transfer of
genomic information across species of spruce.
More recently, a detailed analysis of synteny and macro-colinearity
between P. glauca and P. mariana, using 215 anchor markers, consisting
mainly SNPs, found that 98% of the anchor genes were in synteny (Pavy
et al. 2008). Translocations were validated in the case of previously
reported PgMyb4, and three new translocations involving three genes
were indicated. However, the sequencing of haploid megagametophytes
for these genes indicated that these new cases were likely false positives,
involving paralogous variation. Macro-colinearity was also well conserved
among homologous LGs between species, with 82% of syntenic anchor
markers positioned in the same order. Exceptions to colinearity involved
small inversions also observed between individual maps within species,
indicating that that most of these inversions were artefacts.
Figure 5-3 shows a relatively high density genetic map for both white
and black spruce (LGs III-VI only), with the maps also aligned between
the two species. Map distances in centiMorgan are indicated with a scale
on the left side. The composite map of each species was obtained by first
assembling two parental datasets for each species, using JoinMap (Stam
1993); then maps were aligned between species using common markers.
There are five types of markers in these maps: SNPs (bold), ESTPs (bold
and underlined), SSRs (bold and italics), RAPDs (italics and underlined)
and AFLPs (others). Typically AFLPs are the most in such maps with
several types of markers, but they are not useful for joining maps between
species (the loci are named after the primer combination used and the
band migration distance). Syntenic marker loci between spruce species
are indicated in black, and these are typically gene-based markers. These
syntenic markers are identified with a red solid line (colinear markers) or
a red dashed line (non-colinear markers). Orthologous markers positioned
onto non-homologous LGs are indicated in white with red background and
paralogous markers are identified in white with blue background. Overall,
there is a remarkable preservation of gene order between white and black
spruce, and the exceptions may be mistaken cases of orthology and merit
further investigation.
5.5.3 Pine Family Species Comparisons
The first intergeneric comparative map in conifers was constructed between
loblolly pine and Douglas-fir with ESTP and RFLP markers (Krutovsky
et al. 2004). Comparison of Douglas-fir and loblolly pine maps revealed
10 LGs (LG1–LG10) in loblolly pine that shared 2–10 orthologous markers
with 12 apparently syntenic LGs in Douglas-fir based on 46 orthologous
markers. The comparisons revealed extensive synteny and colinearity of
Copyright reserved © 2011
218
Genetics, Genomics and Breeding of Conifers
Figure 5-3 Comparison of homologous linkage groups between white spruce (Picea glauca)
and black spruce (species complex Picea mariana × P. rubens).
Color image of this figure appears in the color plate section at the end of the book.
gene order between the two genomes, consistent with the hypothesis of
conservative chromosomal evolution among even distantly related species
in the Pinaceae family. This study established a working framework that
the Pinaceae can be viewed as a single genetic system.
Homology of Pinaceae LGs was more recently extended to three spruce
species (Pelgas et al. 2006). Between spruce and loblolly pine, 26 of 29 anchor
markers were in synteny, identifying 11 homologous LGs. In this study,
orthology of anchor gene markers was checked by extensive resequencing
of single haploid megagametophytes in the various species. For the three
exceptions to synteny, sequencing of megagametophytes indicated at least
two cases of paralogy, while the third case remained dubious, implicating a
conserved gene family. Between spruce and Douglas-fir, synteny could be
assessed with 20 anchor markers, of which just one proved to be paralogous
after megagametophyte resequencing. Of the remaining markers, three were
not in synteny, including two markers on LG13 of Douglas-fir, confirming
that the supernumerary chromosome of Douglas-fir is the result of fission
(Krutovsky et al. 2004; Pelgas et al. 2005). The remaining marker, in synteny
between spruce and lodgepole pine, was translocated to a different LG in
Copyright reserved © 2011
Genetic Mapping in Conifers 219
Douglas-fir, thus indicating that chromosome rearrangements occurred in
the lineage leading to Douglas-fir. This study established rigorous criteria
for determining orthology of genetic markers among species, and only after
this criteria is met, can we make reliable inferences about chromosomal
rearrangements among species.
Figure 5-4 shows a recent syntenic map for Douglas-fir, loblolly pine and
Norway spruce. This was identified as LG6 of loblolly pine, as the high level
of synteny and conservation of gene order allows homologous LGs among
pine species to be identified (Neale and Krutovsky 2004). Orthologous
comparative mapping markers are underlined and shown in bold (this is
based upon unpublished data kindly provided by Craig Echt, USDA Forest
Service, Southern Institute of Forest Genetics, Saucier, Mississippi, USA [for
pine] and by Michela Troggio, IASMA Research and Innovation Centre,
San Michele, Italy [for spruce]). Overall, the alignment of maps between
species separated by over 100 million years of evolution is remarkable
Pseudotsuga menziesii
0
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
105
110
115
120
125
130
135
140
145
150
155
estPmIFG_119D01_c
rflpPmIFG_1588_a
estPmIFG_111F09_a
estPmIFG_144D01_a
rflpPmIFG_1504_a
ssrPmOSU_1C3
rflpPmIFG_1599_a
rflpPmIFG_1185_a
rapdOSU_OP_AE12_1630
rflpPmIFG_1104_b(PhyN)
rflpPmIFG_1339_a
rflpPmIFG_1420_a
rflpPmIFG_1075_a
rflpPmIFG_1009_b
rflpPmIFG_1009_a
estPmIFG_014A07_a
estPtIFG_0739_a
rflpPmIFG_1545_a
rflpPmIFG_1439_a
estPmaLU_SB07_a
rapdOSU_BC_309_550
rflpPmIFG_1407_a
estPmaLU_SB42_a
estPmIFG_109F09_a
estPmIFG_113C11_a
estPmIFG_101B05_a
rflpPmIFG_0102_a
rapdOSU_OP_G05_540
rflpPmIFG_1506_a
rflpPtIFG_2969_b
estPmIFG_201D12_a
estPtNCS_ctg3_a
estPpINR_AS01D10_a
estPtIFG_8415_e
Pinus taeda
estPtIFG_23C5_a
ssrPtTX3055_a
ssrPtRIP_0567_a
Picea abies
estPtIFG_8531_a
rflpPtIFG_2802_3
estPtIFG_SB12_a
estPtIFG_8647_a
estPtIFG_8972_a
rflpPtIFG_2291_A
ssrPtRIP_0619_a
ssrPtHBy_F1R1A-S1
ssrPtRIP_0609_a
estPtIFG_2358_a
estPtINR_PAL1_a
rflpPtIFG_1918_A
rflpPtIFG_2723_A
estPtIFG_1165_a
ssrPtRIP_0990_a
ssrPtNZPR0290_a
ssrPtSIFG_0635_a
ssrPpSIFG_3147_a
ssrPtTX4137_a
rflpPtIFG_2610_A
aflpPaSRC_pst71536_a
aflpPaSRC_pmc5501_a
ssrPaUDI_EATC1C09_d
ltrPaUDI_LTR006_a
estPmaLU_SB12_a
estPaTUM_PA0043_a
estPtINR_PAL1_a
estPmIFG_111F09_a
estPtINR_PAL1_b
aflpPaSRC_OA070680_a
estPtIFG1956_a
ssrPaUDI_EAC7H07_a
aflpPaSRC_pst80481_a
aflpPaSRC_pst80483_a
estPtIFG_739a
estPmIFG_14A07_a
ssrPaUDI_SpAC1F07_a
estPtIFG_1950_a
estPtIFG_2610E(S)_a
estPtIFG_1764_a
estPtIFG_8473_a
ssrPaUDI_EAC7B09_b
rflpPtIFG_2874_1
isoSkdh_1
ssrPtNZPR0116_a
estPmaLU_SB42_a
estPtIFG_0739_a
estPmIFG_113C11_a
rflpPtIFG_2090_1
ssrPtTX4062_a
ssrPaUDI_EAC6D11_a
aflpPaSRC_pma5701_a
ltrPaUDI_LTR024_a
estPtIFG_8564_a
ssrPaUDI_SpAC1H08_a
estPtIFG_9044_a
aflpPaSRC_pmc5011_a
ssrPaUDI_alE43129_a
rflpPtIFG_1902_1
ssrPtSIFG_4315_a
estPtIFG_8702_a
rflpPtIFG_4D4_A
estPtX_LP15(A)_a
estPtNCS_ctg3_a
ssrPtRIP_0960_a
rflpPtIFG_606_1
estPtIFG_0606_a
ssrPtTX3045_a
rflpPtIFG_2009_A
estPtIFG_2009_a
Figure 5-4 Comparison of homologous linkage groups between Douglas-fir (Pseudotsuga
menziesii), loblolly pine (Pinus taeda) and Norway spruce (Picea abies).
Copyright reserved © 2011
220
Genetics, Genomics and Breeding of Conifers
for the plant kingdom, and suggests that the pine family (Pinaceae) can
be viewed as one genetic system, allowing genomic information to be
readily transferred, in contrast to angiosperm species with even one-tenth
evolutionary separation.
However, on these maps, there are three instances of apparent segmental
inversions, two between Douglas-fir and pine, and one between pine and
spruce. A case where a pair of markers is reversed is likely due to mistaken
orthology. However, between Douglas-fir and pine, four markers are
involved with an apparent rearrangement (involving orthologous markers
3–6 in pine, which are linked to Douglas-fir). To have four, instead of two,
markers involved in an apparent inversion provide much stronger evidence
of true orthology. This suggests that the genetic system is less homologous
in Douglas-fir, as indeed its time since evolutionary divergence is greater
than between pine and spruce, and that there are limits to the transfer of
genomic information between conifer taxa.
5.6 Quantitative Trait Loci Mapping in Conifers
The last aspect of mapping in conifers involves identifying genes underlying
quantitative traits along the marker maps. The co-segregation of genetic
markers with phenotypes within pedigrees can reveal individual genes
underlying quantitative traits. The ultimate objective of QTL mapping
is to infer the “genetic architecture” of the quantitative trait, e.g., the
numbers of gene loci controlling the trait, the magnitudes of their effects,
and their location in LGs, epistatic interactions, and gene-by-environment
interactions. While the idea of using markers to study quantitative traits
dates from Sax (1923), who used single-locus morphological markers as
categories for continuous traits, the landmark paper that provided the
modern paradigm is Lander and Botstein (1989), who considered the
multiple marker mapping of QTL mapping.
QTL mapping involves associating alternative marker alleles with
phenotypes in segregating progenies. The major issue in conifers is that
parents should be heterozygous for both genetic markers and QTLs.
Separate QTL maps (but not marker maps) need to be constructed for each
parent. However, if a given marker is heterozygous in both parents, the QTL
cannot be assigned to a parent, unless there is a priori knowledge about
linkage. Issues about QTL mapping in outbred pedigrees are discussed in
Williams (1998).
Candidate genes can also be used as marker loci in QTL mapping. For
example, Wheeler et al. (2005) used 29 putative cold-hardiness candidate
genes for mapping cold-hardiness related traits in Douglas-fir, and Pot
et al. (2006) used 10 candidate genes involved in the biosynthesis and
deposition of the secondary cell wall in maritime pine. Recently, Pelgas
et al. (2011) used 836 candidate genes as marker loci for QTL mapping of
Copyright reserved © 2011
Genetic Mapping in Conifers 221
various adaptive traits. However, location of the candidate gene within a
QTL interval is not proof of causality; further testing using functional or
association genetic approaches are required for proof that such a candidate
gene underlies the quantitative trait.
5.6.1 Crossing Designs for QTL Analysis
There are several possible experimental designs for QTL detection. In
conifers, breeding programs offer genetic material for QTL analyses. The
most common test is a progeny test, where a small number (8–20) of openpollinated progeny are grown, which can estimate the genetic value of the
female parent (White et al. 2007). This number is too small to estimate QTL
effect in any single family, and variation among parents for QTL content
adds complexity. In outbred conifers, each parent will have different QTLs.
Ideally, large (> 100) full sib families are needed for reliable inference of QTL,
in order to avoid the bias of inference of QTL effect due to small family size
(Beavis 1998). However this ignores variation of QTL among individuals
in the larger population.
For QTL mapping, the two major designs are the “inter-specific F1”
design, and the “three-generation full-sib pedigree” design. Interspecific
F1 designs are rare if non-existent in conifers as they are based upon
hybridization between subspecies that are usually fixed for alternative
QTL and alternative markers. The three-generation design has been
employed for Douglas-fir and loblolly pine. An intermediate situation is
often encountered: factorial crossing designs with 10–50 progeny per family
(a complete factorial design is where N males are individually crossed
with M females, resulting in NM families). This design is used to estimate
general and specific combining abilities on both the male and female side
(Verhoeven et al. 2005).
QTLs found in one pedigree may not exist in other pedigrees.
“Validation” of QTLs is the replication of the finding on a second
population. In association genetic studies, validation in other populations
is a requirement. In QTL studies, this is a difficult task as emphasized
by Williams et al. (2007). They point out that a given QTL may not be
polymorphic in the second pedigree, and that other segregating QTLs can
cause gene interactions that obscure the QTL in another pedigree. In conifers,
replicate pedigrees are few due to the long generation times.
The density of markers needed for QTL mapping need not be that high.
Darvasi et al. (1993) found that QTL detection probability for a map with
10 cM spacing of markers was virtually the same as that for a map with an
infinite number of markers. Since SNPs usually have just two alleles per
locus, a larger number of SNPs are needed to obtain the ideal 10 cM marker
spacing. SSR markers are usually highly heterozygous and if on the order
of 100 markers are used; their distribution is sufficiently dense such that a
Copyright reserved © 2011
222
Genetics, Genomics and Breeding of Conifers
given individual is usually heterozygous for at least one locus over a small
(10 cM) genome interval.
5.6.2 QTL Traits of Interest
As in the choice of markers for genetic mapping, the phenotypic traits of
interest need to be identified. In conifers, the two main phenotypic traits
targeted in breeding programs are growth characteristics and wood quality.
Total volume, height and ring width are usually used as growth measures.
Wood quality is defined in terms of end-uses, and often involves several
traits related to wood density, chemical composition and fiber properties.
In the area of tree adaptation, phenological traits (timing of bud set and
bud burst), as well as cold-hardiness, are traits of interest.
New technologies are increasing the types and numbers of quantitative
traits that can be examined, and thus studied for their QTL architecture.
At the wood quality level, traits such as stem straightness, stiffness,
wood specific gravity, fiber coarseness, and microfibril angle can be
measured with x-ray diffraction, the SilvaScan technology, or near-infrared
technology (Byram et al. 2005). At the gene level, microarray technologies
allow monitoring of a vast number of gene transcripts, whose expression
levels are regarded as quantitative traits. Genes involved with the lignin
biosynthetic pathway are often of interest, as these genes are putatively
involved with wood quality and perhaps phenology. Wood cellulose
carbon isotope composition, δ13C, is another important trait of interest,
as it is regarded as a time integrated estimate of water use efficiency. A
vast number of metabolites can also be assayed via gas chromatography,
especially when interfaced with mass spectrometry or high performance
liquid chromatography. Like gene expression, metabolite levels can also be
considered a quantitative trait; however, they are not directly tied to a gene
locus like gene expression levels are. Considering global climate change it
becomes very important to study genetic control of adaptive traits such as
phenology, cold-hardiness and drought resistance related traits.
5.6.3 QTL Maps
5.6.3.1 Loblolly Pine
In the first QTL map for a conifer, Groover et al. (1994) inferred male and
female QTL maps in loblolly pine from a full-sib family of 177 progeny
assayed for RFLPs. Five genome regions contained one or more RFLP loci
for wood specific gravity. In an analysis of male-female QTL homology,
they inferred that the male can have a different QTL segregating at the same
locus than the female, and that these alleles can have epistatic interactions.
Copyright reserved © 2011
Genetic Mapping in Conifers 223
Following this original work, Knott et al. (1997) analyzed the same data for
evidence of multiple QTL in the same linkage interval, finding discordant
results with Groover et al. (1994). Kaya et al. (1999) used the pedigree of
Groover et al. (1994), termed “qtl”, as well as second pedigree, “base”, used
previously by Devey et al. (1994b). Thirteen height and eight diameter QTLs
were detected, suggesting control by few genes of large effect. However,
a given QTL was rarely expressed in multiple years or multiple genetic
backgrounds.
A series of works then ensued with the “qtl” pedigree. Sewell et al.
(2000) used the “qtl” pedigree to infer physical traits of wood: wood specific
gravity (wsg), volume percentage of latewood (vol%) and microfibril angle
(mfa), in both earlywood and latewood. Nine unique QTLs were detected
for wood specific gravity, five for volume percentage of latewood, and
five for microfibril angle (mfa). Most QTL for specific gravity were specific
to either earlywood or latewood, whereas each mfa QTL occurred in both
earlywood and latewood. Sewell et al. (2002) found eight unique chemical
wood property QTLs, with differences among populations for QTL. Brown
et al. (2003) stressed that verification of QTL is necessary, comparing inferred
QTL among populations and within populations for different years. They
found that QTL expressed within pedigrees were more stable than QTL
expressed among pedigrees.
An unusual approach to QTL mapping, which takes advantage of the
conifer megagametophyte, was undertaken by Gwaze et al. (2003). As
megagametophytes are haploid, QTL haplotypes can be traced from the
offspring back to individual founders in outbred pedigrees by combining
founder-origin probabilities with fully informative flanking markers. A
large QTL accounting for 11.3 % of the phenotypic variance in the growth
rate was detected in a loblolly pine pedigree; the QTL haplotype was traced
from offspring to its founder, GP3.
5.6.3.2 Maritime Pine
Some of the earliest conifer QTL studies also occurred in Pinus pinaster.
Plomion et al. (1996a) assayed 126 F2 progeny for RAPD markers, including
assay of megagametophytes to determine the linkage phase of the parents.
Height growth components related to the initiation (controlled by the apical
meristem) and elongation of shoot cycles (controlled by the subapical
meristem) were mapped to different chromosomes, suggesting that the
activity of these meristems is controlled by separate genetic mechanisms.
Plomion et al. (1996b) further studied this cross to find a major QTL for delta
3-carene, a monoterpene, which is a constituent of turpentine. In addition,
a qualitative approach found that the ‘’C’’ locus that controls the relative
Copyright reserved © 2011
224
Genetics, Genomics and Breeding of Conifers
quantity of delta 3-carene was associated with RAPD markers near the
major QTL. This was the first study of co-localization of QTL.
Markussen et al. (2003) found 10 QTLs for height or diameter and
40 QTLs for seven wood parameters in P. pinaster. They found that two
SSR markers linked to QTL also were linked in a QTL mapped for P. taeda
(Devey et al, 1999); such markers could be used for comparative QTL
studies. Using a second P. pinaster three-generation pedigree, Brendel et
al. (2002) found four QTLs for δ13C (the first time found in a tree) and two
QTLs for ring width, but they did not co-locate with the δ13C QTL. On the
same pedigree, Pot et al. (2006) detected 54 QTLs. QTL for different traits
in the same map position also showed genetic correlations as estimated by
traditional quantitative genetic analyses. Chagné et al. (2003) compared
QTL maps of Maritime pine and loblolly pine, using 32 common mapped
ESTP markers. The positions of two QTLs controlling wood density and
cell wall components were conserved between the two species. This was
the first ever comparison of QTL maps between conifer species.
5.6.3.3 Radiata Pine
In Pinus radiata, efforts for QTL mapping were directed towards eventual
use for marker-assisted selection (MAS; the use of specific allelic variants
detected in mapping population for tree improvement in unrelated
populations). In the first investigation (Emebiri et al. 1998a), haploid
megagametophytes were assayed, then progeny of the same individuals
grown up to evaluate traits for QTL analysis. This is not a pseudo-testcross
design, but rather it evaluates QTLs from the female parent only. From
222 RAPD markers, stem diameter, volume and height were compared at
5 months, and at 1, 2 and 3 years of age. In a second study, four QTLs for
stem growth efficiency were found, which accounted for 8.5–36.4% of the
population variance (Emebiri et al. 1998b). In a third study, wood density
was evaluated at three stages (Kumar et al. 2000). The results suggested
that early selection can be used in order to increase juvenile wood density,
although the putative QTLs detected in this study need to be verified in
an independent population.
Devey et al. (2004) mapped QTL for juvenile wood density (JWD) and
diameter at breast height (DBH) using a large full-sib family. The percent
variance accounted for by several QTL ranged from 0.78% to 3.58%,
suggesting a genomic architecture of many genes with small effect. Two
unrelated “bridging” families were chosen to identify markers for MAS.
Four markers showed consistent association with JWD, providing the first
basis for MAS in a conifer.
Copyright reserved © 2011
Genetic Mapping in Conifers 225
5.6.3.4 Scots Pine
In Pinus sylvestris, Lerceteau et al. (2000) generated both male and females
using the two-way pseudo-testcross strategy. On the female size, 12 QTLs
were detected, the largest for frost hardiness. A cluster of QTLs for tree
height, trunk diameter and volume was located on one LG. On the male
map, four QTLs for trunk diameter and volume were detected. Yazdani et
al. (2003) also adopted the pseudo-testcross method, and found QTLs for
shoot elongation; growth cessation and cold acclimation were found on
both maps. Their study concluded that major QTLs control growth rhythm
and autumn cold acclimation.
5.6.3.5 Pine hybrids
In the only QTL study of a conifer hybrid (Slash pine x Caribbean Pine), a
pseudo-testcross QTL detection strategy was used to identify QTLs for
wood density, secondary growth, and dry wood mass in a pedigree of size
133 (Shepherd et al. 2003b). Twelve QTLs were identified that clustered into
four LGs in the slash pine parent and in only one group in the Caribbean
pine parent. QTLs that influenced density and ring width did not co-locate,
suggesting independent inheritance of these characters. Two other pedigrees
were more recently mapped for QTLs for adventitious rooting (Shepherd et
al. 2006). Most small to moderate effect QTL were congruent between the
two pedigrees, while a large effect QTL was found only in one pedigree, and
was postulated to be a between-species effect. Targeting between-species
effects for improvement in synthetic hybrid populations may increase the
efficacy and predictability of hybrid breeding.
5.6.3.6 Douglas-fir
A series of studies used a three-generation pedigree to examine various
classes of traits for QTL in Douglas-fir (Pseudotsuga menziesii). Jermstad
et al. (2001a) genotyped 192 progeny for 74 evenly distributed RFLP
markers found by Jermstad et al. (1998). Thirty three QTL for timing of
spring bud flush were found, and measurements for each of 3 years and
2 test sites showed that several QTLs influence the timing of bud flush
over multiple years within sites but not between sites, indicating major
QTL of consistent effect within sites but interactions with environment
between sites. Using the same material, Jermstad et al. (2001b) found 11
and 15 QTLs affecting fall and spring cold-hardiness, respectively. Three
different shoot tissues phenotyped for spring hardiness showed similar
QTL, while different tissues phenotyped for fall hardiness showed little
QTL similarity, supporting previous reports that spring tissues are more
synchronized than fall tissues.
Copyright reserved © 2011
226
Genetics, Genomics and Breeding of Conifers
Jermstad et al. (2003) again used the same pedigree and markers, but
for additional individuals totaling 460, to investigate QTL interactions of
many of the above traits with photoperiod, moisture stress, winter chilling,
and spring temperature. In the first investigation of QTL interaction
with environment, they found two QTL-by-treatment interactions for
growth initiation traits, and several QTL-by-treatment interactions for
growth cessation traits. Finally, Wheeler et al. (2005) evaluated QTL for
cold-hardiness via artificial freezing and various cold injury assessment
methods in two pedigrees of size 170 and 383. Six QTL were found in the
first pedigree, eight in the second, of which four were shared between the
pedigrees; 17 of 29 putative cold-hardiness candidate genes identified from
ESTs were located within the QTL intervals, thus identifying them as high
priority for association studies. These works with Douglas-fir demonstrate
a unique opportunity of working with trees: long-lived species allow
“immortal” pedigrees that can be repeatedly phenotyped for different traits
after genotyping.
Finally, QTL analyses are normally conducted in single pedigrees. In
contrast, Ukrainetz et al. (2008b) examined eight full-sib families, each of
size 40 progeny, for wood-related QTLs, using the software “QTL Express”
(Seaton et al. 2002). They found that wood fiber and density traits both
showed the lowest number of QTLs (3) with relatively small effects; wood
chemistry traits showed more QTLs (7), while ring density traits large
numbers of QTLs (78) and interesting patterns of temporal variation. Growth
traits gave just five QTLs but of major effect. These wood quality traits
are the widest suite of traits yet examined for QTL analysis in a conifer.
Moreover, examination of multiple families for QTL gives a population
perspective of the true extent of QTL variation.
5.6.3.7 Norway spruce
Markussen et al. (2004) employed bulked segregant analysis and AFLP
markers to compare Norway spruce (Picea abies) individuals with high
and low wood density. Of 107 polymorphic AFLP markers, 15 markers
showed significant linkage to wood density, and two of these were found to
predict wood density in unrelated full-sib families. Markussen et al. (2005)
extended this strategy to compare individuals with high and low extractives
content. Of 14 polymorphic AFLP markers were detected between the
pools, one marker was linked to low extractives content and subsequently
verified as above. Recently, a full-sib family of size 250 has been assayed for
Heterobasidion (root rot), with the objective of mapping QTLs and identifying
candidate genes conferring reduced susceptibility to Heterobasidion spp.
(Jenny Arnerup and Jan Stenlid, Univ of Uppsala, pers. comm.).
Copyright reserved © 2011
Genetic Mapping in Conifers 227
5.6.3.8 North American Spruce Species
No QTL mapping studies have been conducted in spruce until recently.
In the Quebec Arborea genome project, two pedigrees of white spruce, of
size 395 and 740, have been established and genotyped for 768 and 1,536
gene SNPs, respectively, using the Illumina GoldenGate assay. Experiments
on different sites involving clonal propagation of root cuttings have been
used to evaluate genotype-by-environment interactions for growth and
adaptive traits (Pelgas et al. 2011). About 34 QTL clusters each explaining
generally below 15% of phenotypic variance were found for bud flush,
bud set and height growth, with about 20% of these replicated between
mapping populations and 50% of them with spatial or temporal stability.
At least three occurences of overlapping QTLs were noted, indicative of
potential pleiotropic effects. On a smaller scale, a black spruce pedigree of
size 283 is being studied for wood quality and phenology traits (J Prunier
et al. unpubl. data). As the genes have already been mapped in both this
pedigree and in the white spruce pedigrees, this will offer an excellent
opportunity to assess QTL homology across species.
The British Columbia Treenomix genome project has worked with
two factorial crosses from the spruce weevil resistance breeding program
(see Alfaro et al. 2004). In the first, involving Interior spruce, 369 progeny
in 3 × 2 factorial were genotyped for 253 informative SNP markers using
the Illumina GoldenGate assay (I Porth et al. unpubl. data). Over 300
metabolites were also assayed (R Dauwe et al. unpubl. data). The second
cross, involving Sitka spruce, is currently being assayed. An approach called
“genetical genomics” may also identify previously unidentified networks
of genes unique to conifers.
5.6.3.9 Sugi
Yoshimaru et al. (1998) mapped QTLs for growth, flowering and rooting
ability in Sugi (Crypomeria. japonica). Growth is one of the most important
traits for timber-producing woody species and also for carbon dioxide
fixation to mitigate global warming. QTLs for juvenile growth, including
height and diameter of basal area, were mapped. Flowering is essential for
reproduction, but is not necessary for timber production. If the expression
of flowering could be controlled, it would be useful not only for breeding
but also for forestry and the environment. QTLs for male and female flowers
have been mapped at two locations each, respectively. The rooting ability of
this species is very important for clonal forestry in the southwestern part of
Japan, especially in Kyushu Island. QTLs for rooting ability were found but
there were not highly significant in the family used in the study (Yoshimaru
et al. 1998). Wood quality QTLs, specifically modulus of elasticity (an
important indicator of wood strength), have also been mapped in Sugi
(Kuramoto et al. 2000).
Copyright reserved © 2011
228
Genetics, Genomics and Breeding of Conifers
Recently, pollinosis (human allergies to pollen) has become a serious
social problem; 10 to 20% of Japanese have pollinosis to pollen from Sugi
because of a large plantation, which now has matured to flowering. As
a countermeasure, the male-sterile lines of C. japonica are planned to be
used for reforestation. Some male-steriles seem to be controlled by a
single recessive locus (Taira et al. 1993). To determine the location of the
locus on the linkage map, co-dominant DNA markers have been used for
mapping of the gene, using SSRs (Moriguchi et al. 2003; Tani et al. 2004),
EST-SSRs (Y Moriguchi et al. unpubl. data), and SNPs (T Ujino-Ihara et al.
unpubl. data). After the genome location of this male-sterile gene is found,
a selective marker will be developed and used for selection of the malesterile individuals from the plantation forests and plus trees as breeding
materials.
5.7 Prospects
In a seminal review, Remington and Purugganan (2003) stated that future
research in plants should expand the number of traits that are intensively
studied and make greater use of QTL mapping in wild plant taxa, especially
those undergoing adaptive radiations, while continuing to draw on insights
from model plants. Conifers are inherently non-domesticated (e.g., wild
plant taxa) and the resources provided by breeding programs and genome
projects will provide rich resources for testing of candidate gene-trait
associations in wild populations, genetic mapping in hybrid zones, and
microarray analyses of gene expression.
In conifers, comparative analyses of genetic maps will continue to be a
fertile ground for future studies. In sunflower species, a comparative study
showed that in the face of extensive hybridization and gene flow, species
integrity is maintained (Strasburg et al. 2009). There are many examples
of hybrid zones in conifer species, such as the hybridization between
Englemann spruce and white spruce in British Columbia. There have been
no such studies in conifers that compare patterns of genetic divergence
and diversity along chromosomal segments, which can reveal divergent
selection for speciation. In conifers, few studies involving “genome scans”
have been done (but see Namroud et al. 2008).
Another approach possible for conifers is to use “hitchhiking mapping”
to identify regions of recent selective sweeps, due to adaptive divergence.
This method starts from a genome scan using a randomly spaced set of
molecular markers followed by a fine-scale analysis in the flanking regions
of the candidate regions under selection. In fish, the hitchhiking approach
identified a selective sweep around candidate locus Stn90 (Makinen et
al. 2008). Fine scale genome maps will help identify candidate loci for
adaptation in conifers, particularly those involved with strong ecological
Copyright reserved © 2011
Genetic Mapping in Conifers 229
gradients, such as that found Sitka spruce from coastal California to coastal
Alaska (see Mimura and Aitken 2007).
Yet another new avenue for using QTL maps is “genetical genomics”,
which combines genetic mapping with gene expression analysis. It uses
variation of gene expression induced by segregation within mapping
populations to infer interactions among expressed genes or metabolites.
Gene networks, and even directed gene networks, can be inferred by the
joint analysis of marker genotypes and gene expression and metabolite levels
(Rockman 2008). In the Treenomix II project, two genetical genomic studies
are nearing completion. These involve a 22K member cDNA microarray,
hundreds of assayed metabolites, and scores for weevil resistance in both
white spruce pedigree (I Porth et al. unpubl. data; R Dauwe et al. unpubl.
data) and a Sitka spruce pedigree (S Verne et al. unpubl. data).
Recently a number of “next-generation” sequencing technologies have
been invented, which can sequence fragments of DNA at astoundingly
higher rates compared to Sanger sequencing. These include the Illumina/
Solexa, ABI/SOLiD, 454/Roche, Pacific Biosciences/SMRT and Helicos
(Morozova and Marra 2008). To date, these technologies have been
applied mostly in non-marker contexts, such as whole-genome sequencing
(Bentley et al. 2008), targeted resequencing (Gnirke et al. 2009), discovery
of transcription factor binding sites, transcript and non-coding RNA
expression profiling, and other functional genomic studies (Eveland et al.
2008). These technologies should greatly facilitate genotyping of mapping
populations for mapping through direct and parallel sequencing of multiple
individuals.
Finally, and last but not least, for the past several years, there has been
an initiative to sequence a conifer genome, starting with the seminal paper
of Neale et al. (1994). There are several initiatives such as the Pine Genome
Initiative (http://pinegenomeinitiative.org/) and the International Conifer
Genome Initiative (http://www.pinegenome.org). It is not clear what strategy
is the best, and current initiatives are exploring alternatives. Fine scale
genetic mapping will clearly enable the assembly of contigs based upon
shotgun sequencing (for example, in the monkeyflower genome project,
John Willis pers. comm.). A current goal of the Arborea project is to map
10,000 genes in white spruce (J Bousquet pers. comm.). Other workers in the
USA, Canada and Spain have embarked upon exploratory BAC sequencing
and gene enrichment of the repetitive genome to discover the structure of
conifer genomes, using “gene space” explorations developed such as for
maize (Liu et al. 2007). These approaches will interface with genetic mapping
to help assemble the first conifer genome.
Copyright reserved © 2011
230
Genetics, Genomics and Breeding of Conifers
References
Acheré V, Faivre-Rampant P, Jeandroz S, Besnard G, Markussen T, Aragones A, Fladung M,
Ritter E, Favre J-M (2004) A full saturated linkage map of Picea abies including AFLP, SSR,
ESTP, 5S rDNA, and morphological markers. Theor Appl Genet 108: 1602–1613.
Acheré V, Favre J-M, Besnard G, Jeandroz S (2005) Genomic organization of molecular
differentiation in Norway spruce (Picea abies). Mol Ecol 14: 3191–3201.
Adams WS, Strauss SH, Copes DL, Griffin AR (1992) Population Genetics of Forest Trees.
Klewer, Dordrecht, The Netherlands.
Agrawal AF, Hadany L, Otto SP (2005) The evolution of plastic recombination. Genetics 171:
803–812.
Alfaro RI, vanAkker L, Jaquish B, King J (2004) Weevil resistance of progeny derived from
putatively resistant and susceptible interior spruce parents. For Ecol Manag 202:
369–377.
Amarasinghe V, Carlson JE (1998) Physical mapping and characterization of 5S rRNA genes
in Douglas-fir. J Hered 89: 495–500.
Arcade A, Anselin F, Faivre Rampant P, Lesage MC, Pâques LE, Prat D (2000) Application
of AFLP, RAPD and ISSR markers to genetic mapping of European and Japanese larch.
Theor Appl Genet 100: 299–307.
Beavis WD (1998) QTL analyses: power, precision, and accuracy. In: AH Paterson (ed) Molecular
Dissection of Complex Traits. CRC Press, Boca Raton, Florida, USA, pp 145–162.
Bentley DR, Balasubramanian S, Swerdlow HP, et al. (2008) Accurate whole human genome
sequencing using reversible terminator chemistry. Nature 456: 53–59.
Binelli G, Bucci G (1994) A genetic linkage map of Picea abies Karst., based on RAPD markers,
as a tool in population genetics. Theor Appl Genet 88: 283–288.
Bornet B, Branchard M (2001) Nonanchored Inter Simple Sequence Repeat (ISSR) markers:
reproducible and specific tools for genome fingerprinting. Plant Mol Biol Rep 19:
209–215.
Brendel O, Pot D, Plomion C, Rozenberg P, Guehl J-M (2002) Genetic parameters and QTL
analysis of delta C-13 and ring width in maritime pine. Plant Cell Environ 25: 945–953.
Brown GR, Carlson JE (1997) Molecular cytogenetics of the genes encoding 18s-5.8s-26s rRNA
and 5s rRNA in two species of spruce (Picea). Theor Appl Genet 95: 1–9.
Brown GR, Newton CH, Carlson JE (1998) Organization and distribution of a Sau3A tandem
repeated DNA sequence in Picea (Pinaceae) species. Genome 41: 560–565.
Brown GR, Kadel III EE, Bassoni DL, Kiehne KL, Temesgen B, van Buijtenen JP, Sewell MM,
Marshall KA, Neale DB (2001) Anchored reference loci in loblolly pine (Pinus taeda L.)
for integrating pine genomics. Genetics 159: 799–809.
Brown GR, Bassoni DL, Gill GP, Fontana JR, Wheeler NC, Megraw RA, Davis MF, Sewell MM,
Tuskan GA, Neale DB (2003) Identification of quantitative trait loci influencing wood
property traits in loblolly pine (Pinus taeda L.) III. QTL verification and candidate gene
mapping. Genetics 164: 1537–1546.
Buetow KH (1991) Influence of aberrant observations on high-resolution linkage analysis
outcomes. Am J Hum Genet 49: 985–994.
Byram TD, Myszewski JH, Gwaze DP, Lowe WJ (2005) Improving wood quality in the western
gulf forest tree improvement program: the problem of multiple breeding objectives. Tree
Genet Genomes 1: 85–92.
Cairney J, Pullman GS (2007) The cellular and molecular biology of conifer embryogenesis.
New Phytol 176: 511–536.
Carlson JE, Traore A, Agrama HA, Krutovsky KV (2007) Douglas-fir. In: C Kole (ed) Genome
Mapping and Molecular Breeding in Plants, vol 7: Forest Trees. Springer, Berlin, Tokyo,
pp 199–210.
Copyright reserved © 2011
Genetic Mapping in Conifers 231
Chagné D, Lalanne C, Madur D, Kumar S, Frigério J-M, Krier C, Decroocq S, Savouré A, BouDagher-Kharrat M, Bertocchi E, Brach J, Plomion C (2002) A high density genetic map
of maritime pine based on AFLPs. Ann For Sci 59: 627–636.
Chagné D, Brown G, Lalanne C, Madur D, Pot D, Neale D, Plomion C (2003) Comparative
genome and QTL mapping between maritime and loblolly pines. Mol Breed 12:
185–195.
Chagné D, Chaumeil P, Ramboer A, Collada C, Guevara A, Cervera MT, Vendramin GG, Garcia
V, Frigerio J-M, Echt C, Richardson T, Plomion C (2004) Cross-species transferability and
mapping of genomic and cDNA SSRs in pines. Theor Appl Genet 109: 1204–1214.
Chakravarti A, Lasher LK, Reefer JE (1991) A maximum likelihood method for estimating
genome length using genetic linkage data. Genetics 128: 175–182.
Clark AG (1990) Inference of haplotypes from PCR-amplified samples of diploid populations.
Mol Biol Evol 7: 111–122.
Conkle MT (1981) Proceedings of the Symposium on Isozymes of North American Forest Trees
and Forest Insects, 27 July, 1979, Berkeley, California. Gen Tech Rep PSW-GTR-48: Pacific
Southwest Forest and Range Exp Stn, Forest Service, US Department of Agriculture,
Berkeley, CA, USA.
Costa P, Pot D, Dubos C, Frigerio JM, Pionneau C, Bodenes C, Bertocchi E, Cervera M-T,
Remington DL, Plomion C (2000) A genetic map of Maritime pine based on AFLP, RAPD
and protein markers. Theor Appl Genet 100: 39–48.
Darvasi A, Weinreb A, Minke V, Weller JI, Soller M (1993) Detecting marker-QTL linkage and
estimating QTL gene effect and map location using a saturated genetic map. Genetics
134: 943–951.
Devey ME, Fiddler TA, Liu BH, Knapp SJ, Neale DB (1994a) An RFLP linkage map for loblolly
pine based on a three-generation outbred pedigree. Theor Appl Genet 88: 273–278.
Devey ME, Fiddler TA, Liu BH, Knapp SJ, Neale DB (1994b) An RFLP linkage map for loblolly
pine based on a 3-generation outbred pedigree. Theor Appl Genet 88: 273–278.
Devey ME, Bell JC, Smith DN, Neale DB, Moran GF (1996) A genetic linkage map for Pinus
radiata based on RFLP, RAPD, and microsatellite markers. Theor Appl Genet 92:
673–679.
Devey ME, Sewell MM, Uren TL, Neale DB (1999) Comparative mapping in loblolly and radiata
pine using RFLP and microsatellite markers. Theor Appl Genet 99: 656–662.
Devey ME, Carson SD, Nolan MF, Matheson AC, Te Riini C, Hohepa J (2004) QTL associations
for density and diameter in Pinus radiata and the potential for marker-aided selection.
Theor Appl Genet 108: 516–524.
Echt CS, Nelson CD (1997) Linkage mapping and genome length in eastern white pine (Pinus
strobus L.). Theor Appl Genet 94: 1031–1037.
Echt CS, Saha S, Krutovsky KV, Wimalanathan K, Erpelding JE, Liang C, Nelson CD (2011)
An Annotated genetic map of loblolly pine based on microsatellite and cDNA markers.
BMC Genetics (in press).
Eckert AJ, Pande B, Ersoz ES, Wright MH, Rashbrook VK, Nicolet CM, Neale DB (2009) Highthroughput genotyping and mapping of single nucleotide polymorphisms in loblolly
pine (Pinus taeda L.). Tree Genet Genomes 5: 225–234.
Ellis JR, Burke JM (2007) EST-SSRs as a resource for population genetic analyses. Heredity
99: 125–132.
Emebiri LC, Devey ME, Matheson AC, Slee MU (1998a) Age-related changes in the expression
of QTLs for growth in radiata pine seedlings. Theor Appl Genet 97: 1053–1061.
Emebiri LC, Devey ME, Matheson AC, Slee MU (1998b) Interval mapping of quantitative trait
loci affecting NESTUR, a stem growth efficiency index of radiata pine seedlings. Theor
Appl Genet 97: 1062–1068.
Ercht CS, Saha S, Krutovsky KV, Wimalanathan K, Erpelding JE, Liang C, Nelson CD (2011)
An annotated genetic map of loblolly pine based on microsatellite and cDNA markers.
BMC Genetics (in press).
Copyright reserved © 2011
232
Genetics, Genomics and Breeding of Conifers
Eveland AL, McCarty DR, Koch KE (2008) Transcript profiling by 3’-untranslated region
sequencing resolves expression of gene families. Plant Physiol 146: 32–44.
Fulton TM, Van der Hoeven R, Eannetta NT, Tanksley SD (2002) Identification, analysis, and
utilization of conserved ortholog set markers for comparative genomics in higher plants.
Plant Cell 14: 1457–1467.
Gale MD, Devos KM (1998) Comparative genetics in the grasses. Proc Natl Acad Sci USA 95:
1971–1974.
Gasbarra D, Sillanpää MJ (2006) Constructing the parental linkage phase and the genetic map
over distances < 1 cM using pooled haploid DNA. Genetics 172: 1325–1335.
Gnirke A, Melnikov A, Maguire J, Rogov P, LeProust E, Brockman W, Fennell T, Giannoukos
G, Fisher S, Russ C, Gabriel S, Jaffe DB, Lander ES, Nusbaum C (2009) Solution hybrid
selection with ultra-long oligonucleotides for massively parallel targeted sequencing.
Nat Biotechnol 27: 182–189.
Göçmen B, Jermstad KD, Neale DB, Kaya Z (1996) Development of random amplified
polymorphic DNA markers for genetic mapping in Pacific yew (Taxus brevifolia). Can J
For Res 26: 497–503.
Gogarten JP, Olendzenski L (1999) Orthologs paralogs and genome comparisons. Curr Opin
Genet Dev 9: 630–636.
Gosselin I, Zhou Y, Bousquet J, Isabel N (2002) Megagametophyte-derived linkage maps of
white spruce (Picea glauca) based on RAPD, SCAR and ESTP markers. Theor Appl Genet
104: 987–997.
Grattapaglia D, Sederoff R (1994) Genetic linkage maps of Eucalyptus grandis and Eucalyptus
urophylla using a pseudo-testcross: mapping strategy and RAPD markers. Genetics 137:
1121–1137.
Groover A, Devey M, Fiddler T, Lee J, Megraw R, Mitchel-Olds T, Sherman B, Vujcic S, Williams
C, Neale D (1994) Identification of quantitative trait loci influencing wood specific gravity
in an outbred pedigree of loblolly pine. Genetics 138: 1293–1300.
Groover AT, Williams CG, Devey ME, Lee JM, Neale DB (1995) Sex-related differences in
meiotic recombination frequency in Pinus taeda. J Hered 86: 157–158.
Gwaze DP, Zhou Y, Reyes-Valdes MH, Al-Rababah MA, Williams CG (2003) Haplotypic QTL
mapping in an outbred pedigree. Genet Res 81: 43–50.
Hayashi E, Kondo T, Terada K, Kuramoto N, Goto Y, Okamura M, Kawasaki H (2001) Linkage
map of Japanese black pine based on AFLP and RAPD markers including markers linked
to resistance against the pine needle gall midge. Theor Appl Genet 102: 871–875.
Hu X-S, Goodwillie C, Ritland KM (2004) Joining genetic linkage maps using a joint likelihood
function. Theor Appl Genet 109: 996–1004.
Hudson EE (2005) Development of a genetic linkage map in Abies nordmanniana accession 9M
and assessment of disease resistance to Phytophthora cinnamomi. North Carolina State
Univ, Raleigh, North Carolina, USA.
Hulbert SH, Ilott TW, Legg EJ, Lincoln SE, Lander ES, Michelmore RW (1988) Genetic analysis
of the fungus Bremia lactucae, using restriction fragment length polymorphisms. Genetics
120: 947–958.
Huynen MA, Bork P (1998) Measuring genome evolution. Proc Natl Acad Sci USA 95:
5849–5856.
Iwata H, Ujino-Ihara T, Yoshimura K, Nagasaka K, Mukai Y, Tsumura Y (2001) Cleaved
amplified polymorphic sequence markers in sugi, Cryptomeria japonica D. Don, and their
locations on a linkage map. Theor Appl Genet 103: 881–895.
Jermstad KD, Bassoni DL, Wheeler NC, Neale DB (1998) A sex-averaged genetic linkage map
in coastal Douglas-fir (Pseudotsuga menziesii [Mirb.] Franco var ‘menziesii’) based on RFLP
and RAPD markers. Theor Appl Genet 97: 762–770.
Jermstad KD, Bassoni DL, Jech KS, Wheeler NC, Neale DB (2001a) Mapping of quantitative
trait loci controlling adaptive traits in coastal Douglas-fir. I. Timing of vegetative bud
flush. Theor Appl Genet 102: 1142–1151.
Copyright reserved © 2011
Genetic Mapping in Conifers 233
Jermstad KD, Bassoni DL, Wheeler NC, Anekonda TS, Aitken SN, Adams WT, Neale DB (2001b)
Mapping of quantitative trait loci controlling adaptive traits in coastal Douglas-fir. II.
Spring and fall cold-hardiness. Theor Appl Genet 102: 1152–1158.
Jermstad KD, Bassoni DL, Jech KS, Ritchie GA, Wheeler NC, Neale DB (2003) Mapping of
quantitative trait loci controlling adaptive traits in coastal Douglas fir. III. Quantitative
trait loci-by-environment interactions. Genetics 165: 1489–1506.
Jermstad KD, Eckert AF, Wegrzyn JL, Delfino-Mix A, Davis DA, Burton DC, Neale DB (2010)
Comparative mapping in Pinus: sugar pine (Pinus lambertiana Dougl.) and loblolly pine
(Pinus taeda L.). Tree Genet Genomes (in press).
Kang B-Y, Mann IK, Major JE, Rajora OP (2010) Near-saturated and complete genetic linkage
map of black spruce (Picea mariana). BMC Genomics 11: 515.
Kaya Z, Neale DB (1995) Utility of random amplified polymorphic DNA (RAPD) markers for
linkage mapping in Turkish red pine (Pinus brutia Ten.). Silvae Genet 44: 110–116.
Kaya Z, Sewell MM, Neale DB (1999) Identification of quantitative trait loci influencing
annual height- and diameter-increment growth in loblolly pine (Pinus taeda L.). Theor
Appl Genet 98: 586–592.
Kim Y-Y, Choi HS, Kang B-Y (2005) An AFLP-based linkage map of Japanese red pine (Pinus
densiflora) using haploid DNA samples of megagametophytes from a single maternal
tree. Mol Cells 20: 201–209.
Kinlaw CS, Neale DB (1997) Complex gene families in pine genomes. Trends Plant Sci 2:
356–359.
Kirst M, Johnson AF, Baucom C, Ulrich E, Hubbard K, Staggs R, Paule C, Retzel E, Whetten
R, Sederoff R (2003) Apparent homology of expressed genes from wood-forming tissues
of loblolly pine (Pinus taeda L.) with Arabidopsis thaliana. Proc Nat Acad Sci USA 100:
7383–7388.
Knott SA, Neale DB, Sewell MM, Haley CS (1997) Multiple marker mapping of quantitative
trait loci in an outbred pedigree of loblolly pine. Theor Appl Genet 94: 810–820.
Komulainen P, Brown GR, Mikkonen M, Karhu A, Garcia-Gil MR, O’Malley D, Lee B, Neale
DB, Savolainen O (2003) Comparing EST-based genetic maps between Pinus sylvestris
and Pinus taeda. Theor Appl Genet 107: 667–678.
Koonin EV (2005) Orthologs, paralogs, and evolutionary genomics. Annu Rev Genet 39:
309–338.
Krutovsky KV, Vollmer SS, Sorensen FC, Adams WT, Knapp SJ, Strauss SH (1998) RAPD
genome maps of Douglas-fir. J Hered 89: 197–205.
Krutovsky KV, Troggio M, Brown GR, Jermstad KD, Neale DB (2004) Comparative mapping
in the Pinaceae. Genetics 168: 447–461.
Krutovsky KV, Elsik CG, Matvienko M, Kozik A, Neale DB (2006) Conserved ortholog sets in
forest trees. Tree Genet Genomes 3: 61–70.
Kuang H, Richardson T, Carson S, Wilcox P, Bongarten B (1999) Genetic analysis of inbreeding
depression in plus tree 850.55 of Pinus radiata D. Don. I. Genetic map with distorted
markers. Theor Appl Genet 98: 697–703.
Kubisiak TL, Nelson CD, Nance WL, Stine M (1995) RAPD linkage mapping in a longleaf pine
× slash pine F1 family. Theor Appl Genet 90: 1119–1127.
Kumar S, Spelman RJ, Garrick DJ, Richardson TE, Lausberg M, Wilcox PL (2000) Multiplemarker mapping of wood density loci in an outbred pedigree of radiata pine. Theor
Appl Genet 100: 926–933.
Kuramoto N, Kondo T, Fujisawa Y, Nakata R, Hayashi E, Goto Y (2000) Detection of quantitative
trait loci for wood strength in Cryptomeria japonica. Can J For Res 30: 1525–1533.
Lander ES, Botstein D (1989) Mapping mendelian factors underlying quantitative traits using
RFLP linkage maps. Genetics 121: 185–199.
Lander ES, Green P, Abrahamson J, Barlow A, Daly MJ, Lincoln SE, Newburg L (1987)
MAPMAKER: An interactive computer package for constructing primary genetic linkage
maps of experimental and natural populations. Genomics 1: 174–181.
Copyright reserved © 2011
234
Genetics, Genomics and Breeding of Conifers
Lathrop GM, Lalouel JM, Julier C, Ott J (1985) Multilocus linkage analysis in humans: detection
of linkage and estimation of recombination. Am J Hum Genet 37: 482–498.
Le Dantec L, Chagné D, Pot D, Cantin O, Garnier-Géré P, Bedon F, Frigerio J-M, Chaumeil P,
Léger P, Garcia V, Laigret F, de Daruvar A, Plomion C (2004) Automated SNP detection
in expressed sequence tags: statistical considerations and application to maritime pine
sequences. Plant Mol Biol 54: 461–470.
Lerceteau E, Plomion C, Andersson B (2000) AFLP mapping and detection of quantitative
trait loci (QTLs) for economically important traits in Pinus sylvestris: a preliminary study.
Mol Breed 6: 451–458.
Li C, Yeh FC (2001) Construction of a framework map in Pinus contorta subsp. latifolia using
random amplified polymorphic DNA markers. Genome 44: 147–153.
Liewlaksaneeyanawin C, Zhuang J, Tang M, Farzaneh N, Lueng G, Cullis C, Findlay S, Ritland
CE, Bohlmann J, Ritland K (2009) Identification of COS markers in the Pinaceae. Tree
Genet Genomes 5: 247–255.
Liu BH (1998) Statistical Genomics: Linkage, Mapping, and QTL Analysis. CRC Press, Boca
Raton, Florida, USA.
Liu R, Vitte C, Ma J, Mahama AA, Dhliwayo T, Lee M, Bennetzen JL (2007) A GeneTrek analysis
of the maize genome. Proc Nat Acad Sci USA 104: 11844–11849.
Liu W, Magbanua ZV, Ozkan S, Chouvarine P, Bartlett BD, Peterson DG (2009) BAC libraries
for two distantly related conifers, loblolly pine and bald cypress. In: Plant Anim Genome
XVII Conf, San Diego, CA, USA.
Mäkinen HS, Shikano T, Cano JM, Merilä J (2008) Hitchhiking mapping reveals a candidate
genomic region for natural selection in three-spined stickleback chromosome VIII.
Genetics 178: 453–465.
Margarido GRA, Souza AP, Garcia AAF (2007) OneMap: software for genetic mapping in
outcrossing species. Hereditas 144: 78–79.
Markussen T, Fladung M, Achere V, Favre JM, Faivre-Rampant P, Aragones A, Silva Perez DD,
Harvengt L, Espinel S, Ritter E (2003) Identification of QTLs controlling growth, chemical
and physical wood property traits in Pinus pinaster (Ait.). Silvae Genet 52: 8–15.
Markussen T, Tusch A, Stephan BR, Fladung M (2004) Identification of molecular markers
for selected wood properties of Norway spruce Picea abies L. (Karst.) I. Wood density.
Silvae Genet 53: 45–50.
Markussen T, Tusch A, Stephan BR, Fladung M (2005) Identification of molecular markers for
selected wood properties of Norway spruce Picea abies L. (Karst.) II. Extractives content.
Silvae Genet 54: 145–152.
Mester D, Ronin Y, Minkov D, Nevo E, Korol A (2003) Constructing large-scale genetic maps
using an evolutionary strategy algorithm. Genetics 165: 2269–2282.
Mester DI, Ronin YI, Nevo E, Korol AB (2004) Fast and high precision algorithms for
optimization in large-scale genomic problems. Comp Biol Chem 28: 281–290.
Mester DI, Ronin YI, Korostishevsky MA, Pikus VL, Glazman AE, Korol AB (2006) Multilocus
consensus genetic maps (MCGM): Formulation, algorithms, and results. Comp Biol
Chem 30: 12–20.
Mimura M, Aitken SN (2007) Adaptive gradients and isolation-by-distance with postglacial
migration in Picea sitchensis. Heredity 99: 224–232.
Moran GF, Bell JC, Hilliker AJ (1983) Greater meiotic recombination in male vs. female gametes
in Pinus radiata. J Hered 74: 62.
Morgante M, Olivieri AM (1993) PCR-amplified microsatellites as markers in plant genetics.
Plant J 3: 175–182.
Morgante M, Hanafey M, Powell W (2002) Microsatellites are preferentially associated with
nonrepetitive DNA in plant genomes. Nat Genet 30: 194–200.
Moriguchi Y, Iwata H, Ujino-Ihara T, Yoshimura K, Taira H, Tsumura Y (2003) Development
and characterization of microsatellite markers for Cryptomeria japonica D. Don. Theor
Appl Genet 106: 751–758.
Copyright reserved © 2011
Genetic Mapping in Conifers 235
Morozova O, Marra MA (2008) Applications of next-generation sequencing technologies in
functional genomics. Genomics 92: 255–264.
Morse AM, Peterson DG, Islam-Faridi MN, Smith KE, Magbanua Z, Garcia SA, Kubisiak TL,
Amerson HV, Carlson JE, Nelson CD, Davis JM (2009) Evolution of genome size and
complexity in Pinus. PLoS One 4: e4332.
Mukai Y, Suyama Y, Tsumura Y, Kawahara T, Yoshimaru H, Kondo T, Tomaru N, Kuramoto
N, Murai M (1995) A linkage map for sugi (Cryptomeria japonica) based on RFLP, RAPD,
and isozyme loci. Theor Appl Genet 90: 835–840.
Murray B (1998) Nuclear DNA amounts in gymnosperms. Ann Bot 82: 3–15.
Namroud M-C, Beaulieu J, Juge N, Laroche J, Bousquet J (2008) Scanning the genome for
gene single nucleotide polymorphisms involved in adaptive population differentiation
in white spruce. Mol Ecol 17: 3599–3613.
Neale DB, Krutovsky KV (2004) Comparative genetic mapping in trees: The group of conifers.
In: H Lörz, G Wenzel (ed) Biotechnology in Agriculture and Forestry: Molecular Marker
Systems in Plant Breeding and Crop Improvement. Springer, Berlin, Germany, pp
267–277 .
Neale DB, Kinlaw CS, Sewell MM (1994) Genetic mapping and DNA sequencing of the loblolly
pine genome. For Genet 1: 197–206.
Nelson CD, Nance WL, Doudrick RL (1993) A partial genetic linkage map of slash pine (Pinus
elliottii Engelm. var. elliottii) based on random amplified polymorphic DNAs. Theor Appl
Genet 87: 145–151.
Nelson CD, Kubisiak TL, Stine M, Nance WL (1994) A genetic linkage map of longleaf pine
(Pinus palustris Mill.) based on random amplified polymorphic DNAs. J Hered 85:
433–439.
Nelson CD, Peterson DG, Echt CS, Whetten R, Krutovsky KV, Yuceer C, Dean JF (2008) Pine
physical mapping and genome sequencing. In: Plant Anim Genome XVI Conf, San
Diego, CA, USA.
Nikaido AM, Ujino T, Iwata H, Yoshimura K, Yoshimura H, Suyama Y, Murai M, Nagasaka
K, Tsumura Y (2000) AFLP and CAPS linkage maps of Cryptomeria japonica. Theor Appl
Genet 100: 825–831.
Nurul Islam-Faridi M, Dana Nelson C, Kubisiak T (2007) Reference karyotype and
cytomolecular map for loblolly pine (Pinus taeda L.) Genome 50: 241–251
Ott J (1999) Analysis of human genetic linkage, 3rd edn. Johns Hopkins Univ Press, Baltimore,
Maryland, USA.
Paglia GP, Olivieri AM, Morgante M (1998) Towards second generation STS (sequence-tagged
sites) linkage maps in conifers: a genetic map of Norway spruce (Picea abies K.). Mol Gen
Genet 258: 466–478.
Pavy N, Parsons LS, Paule C, MacKay J, Bousquet J (2006) Automated SNP detection from a
large collection of white spruce expressed sequences: contributing factors and approaches
for the categorization of SNPs. BMC Genom 7: 174.
Pavy N, Pelgas B, Beauseigle S, Blais S, Gagnon F, Gosselin I, Lamothe M, Isabel N, Bousquet
J (2008) Enhancing genetic mapping of complex genomes through the design of highlymultiplexed SNP arrays: application to the large and unsequenced genomes of white
spruce and black spruce. BMC Genom 9: 21.
Pelgas B, Isabel N, Bousquet J (2004) Efficient screening for expressed sequence tag
polymorphisms (ESTPs) by DNA pool sequencing and denaturing gradient gel
electrophoresis (DGGE) in spruces. Mol Breed 13: 263–279.
Pelgas B, Bousquet J, Beauseigle S, Isabel N (2005) A composite linkage map from two crosses
for the species complex Picea mariana × P. rubens and analysis of synteny with other
Pinaceae. Theor Appl Genet 111: 1466–1488.
Pelgas B, Beauseigle S, Acheré V, Jeandroz S, Bousquet J, Isabel N (2006) Comparative genome
mapping among Picea glauca, P. mariana x P. rubens and P. abies, and correspondence with
other Pinaceae. Theor Appl Genet 113: 1371–1393.
Copyright reserved © 2011
236
Genetics, Genomics and Breeding of Conifers
Pelgas B, Bousquet J, Meirmans P, Ritland K, Isabel N (2011) QTL mapping in white spruce:
gene maps and genomic regions underlying adaptive traits across pedigrees, years and
environments. BMC Genomics (in press).
Perry DJ, Bousquet J (1998) Sequence-tagged-site (STS) markers of arbitrary genes: development,
characterization and analysis of linkage in black spruce. Genetics 149: 1089–1098.
Plomion C, O’Malley DM (1996) Recombination rate differences for pollen parents and seed
parents in Pinus pinaster. Heredity 77: 341–350.
Plomion C, Bahrman N, Durel C-E, O’Malley DM (1995a) Genomic mapping in Pinus pinaster
(maritime pine) using RAPD and protein markers. Heredity 74: 661–668.
Plomion C, O’Malley DM, Durel CE (1995b) Genomic analysis in maritime pine (Pinus pinaster).
Comparison of two RAPD maps using selfed seeds and open-pollinated of the same
individual. Theor Appl Genet 90: 1028–1034.
Plomion C, Durel C-E, OMalley DM (1996a) Genetic dissection of height in maritime pine
seedlings raised under accelerated growth conditions. Theor Appl Genet 93: 849–858.
Plomion C, Yani A, Marpeau A (1996b) Genetic determinism of delta 3-carene in maritime
pine using RAPD markers. Genome 39: 1123–1127.
Pot D, Rodrigues J-C, Rozenberg P, Chantre G, Tibbits J, Cahalan C, Pichavant F, Plomion C
(2006) QTLs and candidate genes for wood properties in maritime pine (Pinus pinaster
Ait.). Tree Genet Genomes 2: 10–24.
Ralph SG, Chun HJE, Kolosova N, Cooper D, Oddy C, Ritland CE, Kirkpatrick R, Moore R, Barber
S, Holt RA, Jones SJM, Marra MA, Douglas CJ, Ritland K, Bohlmann J (2008) A conifer
genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequencefinished full-length cDNAs for Sitka spruce (Picea sitchensis). BMC Genom 9: 484.
Ran J-H, Wei X-X, Wang X-Q (2006) Molecular phylogeny and biogeography of Picea
(Pinaceae): Implications for phylogeographical studies using cytoplasmic haplotypes.
Mol Phylogenet Evol 41: 405–419.
Remington DL, Purugganan MD (2003) Candidate genes, quantitative trait loci, and functional
trait evolution in plants. Int J Plant Sci 164: S7–S20.
Remington DL, Whetten RW, Liu B-H, O’Malley DM (1999) Construction of an AFLP genetic
map with nearly complete genome coverage in Pinus taeda. Theor Appl Genet 98:
1279–1292.
Remm M, Storm CEV, Sonnhammer ELL (2001) Automatic clustering of orthologs and inparalogs from pairwise species comparisons. J Mol Biol 314: 1041–1052.
Ritland C, Ritland K (2000) DNA-fragment markers in plants. In: AJ Baker (ed) Molecular
Methods in Ecology. Blackwell Publ, Oxford, UK, pp 208–234.
Ritter E, Aragonés A, Markussen T, Acheré V, Espinel S, Fladung M, Wrobel S, Faivre-Rampant
P, Favre J-M (2002) Towards construction of an ultra high density linkage map for Pinus
pinaster. Ann For Sci 596: 37–643.
Rockman MV (2008) Reverse engineering the genotype-phenotype map with natural genetic
variation. Nature 456: 738–744.
Rose AM, Baillie DL (1979) The effect of temperature and parental age on recombination and
nondisjunction in Caenorhabditis elegans. Genetics 92: 409–418.
Rungis D, Bérubé Y, Zhang J, Ralph S, Ritland CE, Ellis BE, Douglas C, Bohlmann J, Ritland
K (2004) Robust simple sequence repeat markers for spruce (Picea spp.) from expressed
sequence tags. Theor Appl Genet 109: 1283–1294.
Savard L, Li P, Strauss SH, Chase MW, Michaud M, Bousquet J (1994) Chloroplast and nuclear
gene sequences indicate Late Pennsylvanian time for the last common ancestor of extant
seed plants. Proc Nat Acad Sci USA 91: 5163–5167.
Sax K (1923) The association of size differences with seed-coat pattern and pigmentation in
Phaseolus vulgaris. Genetics 8: 552–560.
Scotti I, Burelli A, Cattonaro F, Chagné D, Fuller J, Hedley PE, Jansson G, Lalanne C, Madur D,
Neale D, Plomion C, Powell W, Troggio M, Morgante M (2005) Analysis of the distribution
of marker classes in a genetic linkage map: a case study in Norway spruce (Picea abies
karst). Tree Genet Genomes 1: 93–102.
Copyright reserved © 2011
Genetic Mapping in Conifers 237
Seaton G, Haley CS, Knott SA, Kearsey M, Visscher PM (2002) QTL Express: mapping
quantitative trait loci in simple and complex pedigrees. Bioinformatics 18: 339–340.
Sewell MM, Sherman BK, Neale DB (1999) A consensus map for loblolly pine (Pinus taeda
L.) I. Construction and integration of individual linkage maps from two outbred threegeneration pedigrees. Genetics 151: 321–330.
Sewell MM, Bassoni DL, Megraw RA, Wheeler NC, Neale DB (2000) Identification of QTLs
influencing wood property traits in loblolly pine (Pinus taeda L.). I. Physical wood
properties. Theor Appl Genet 101: 1273–1281.
Sewell MM, Davis MF, Tuskan GA, Wheeler NC, Elam CC, Bassoni DL, Neale DB (2002)
Identification of QTLs influencing wood property traits in loblolly pine (Pinus taeda L.)
II. Chemical wood properties. Theor Appl Genet 104: 214–222.
Shepherd M, Cross M, Dieters MJ, Henry R (2003a) Genetic maps for Pinus elliottii var. elliottii
and P. caribaea var. hondurensis using AFLP and microsatellite markers. Theor Appl Genet
106: 1409–1419.
Shepherd M, Cross M, Dieters MJ, Harding K, Kain D, Henry R (2003b) Genetics of physical
wood properties and early growth in a tropical pine hybrid. Can J For Res 33: 1923–
1932.
Shepherd M, Huang S, Eggler P, Cross M, Dale G, Dieters M, Henry R (2006) Congruence in
QTL for adventitious rooting in Pinus elliottii X Pinus caribaea hybrids resolves between
and within-species effects. Mol Breed 18: 11–28.
Skov E, Wellendorf H (1998) A partial linkage map of Picea abies clone V6470 based on
recombination of RAPD-markers in haploid megagametophytes. Silvae Genet 47:
273–282.
Sobel E, Papp JC, Lange K (2002) Detection and integration of genotyping errors in statistical
genetics. Am J Hum Genet 70: 496–508.
Soderlund C, Humphray S, Dunham A, French L (2000) Contigs built with fingerprints,
markers, and FPC V4.7. Genome Res 10: 1772–1787.
Stam P (1993) Construction of integrated genetic linkage maps by means of a new computer
package: JoinMap. Plant J 3: 739–744.
Strasburg JL, Scotti-Saintagne C, Scotti I, Lai Z, Rieseberg LH (2009) Genomic patterns of
adaptive divergence between chromosomally differentiated sunflower species. Mol Biol
Evol 26: 1341–1355.
Taira H, Teranishi H, Kenda S (1993) A case study of male sterility in sugi (Cryptomeria japonica)
(in Japanese with English summary). J Jap For Soc 75: 377–379.
Tani N, Takahashi T, Iwata H, Mukai Y, Ujino-Ihara T, Matsumoto A, Yoshimura K, Yoshimaru
H, Murai M, Nagasaka K, Tsumura Y (2003) A consensus linkage map for sugi (Cryptomeria
japonica) from two pedigrees, based on microsatellites and expressed sequence tags.
Genetics 165: 1551–1568.
Tani N, Takahashi T, Ujino-Ihara T, Iwata H, Yoshimura K, Tsumura Y (2004) Development and
characteristics of microsatellite markers for sugi (Cryptomeria japonica D. Don) derived
from microsatellite-enriched libraries. Ann For Sci 61: 569–575.
Theissen G (2005) Birth, life and death of developmental control genes: New challenges for
the homology concept. Theory Biosci 124: 199–212.
Tong C, Shi J (2004) Constructing genetic linkage maps in Chinese-fir using F1 progeny. Acta
Genet Sin 31: 1149–1156.
Travis SE, Ritland K, Whitham TG, Keim P (1998) A genetic linkage map of pinyon pine
(Pinus edulis) based on amplified fragment length polymorphisms. Theor Appl Genet
97: 871–880.
Tsumura Y, Ogihara Y, Sasakuma T, Ohba K (1993) Physical map of chloroplast DNA in sugi,
Cryptomeria japonica. Theor Appl Genet 86: 166–172.
Tsumura Y, Suyama Y, Yoshimura K, Shirato N, Mukai Y (1997) Sequence-tagged-sites (STSs)
of cDNA clones in Cryptomeria japonica and their evaluation as molecular markers in
conifers. Theor Appl Genet 94: 764–772.
Copyright reserved © 2011
238
Genetics, Genomics and Breeding of Conifers
Tulsieram LK, Glaubitz JC, Kiss G, Carlson JE (1992) Single tree genetic linkage mapping in
conifers using haploid DNA from megagametophytes. Bio/Technology 10: 686–690.
Ujino-Ihara T, Yoshimura K, Ugawa Y, Yoshimaru H, Nagasaka K, Tsumura Y (2000) Expression
analysis of ESTs derived from the inner bark of Cryptomeria japonica. Plant Mol Biol 43:
451–457.
Ujino-Ihara T, Kanamori H, Yamane H, Taguchi Y, Namiki N, Mukai Y, Yoshimura K, Tsumura
Y (2005) Comparative analysis of expressed sequence tags of conifers and angiosperms
reveals sequences specifically conserved in conifers. Plant Mol Biol 59: 895–907.
Ukrainetz NK, Ritland K, Mansfield SD (2008a) An AFLP linkage map for Douglas-fir based
upon multiple full-sib families. Tree Genet Genomes 4: 181–191.
Ukrainetz NK, Ritland K, Mansfield SD (2008b) Identification of quantitative trait loci for
wood quality and growth across eight full-sib coastal Douglas-fir families. Tree Genet
Genomes 4: 159–170.
Verhoeven KJF, Jannink J-L, McIntyre LM (2005) Using mating designs to uncover QTL and
the genetic architecture of complex traits. Heredity 96: 139–149.
Vision TJ, Brown DG, Shmoys DB, Durrett RT, Tanksley SD (2000) Selective mapping: a strategy
for optimizing the construction of high-density linkage maps. Genetics 155: 407–420.
Vos P, Hogers R, Bleeker M, Reijans M, van de Lee T, Hornes M, Frijters A, Pot J, Peleman J,
Kuiper M, Zabeau M (1995) AFLP: a new technique for DNA fingerprinting. Nucl Acids
Res 23: 4407–4414.
Weising K, Nybom H, Wolff K, Kahl G (2005) DNA Fingerprinting in Plants: Principles,
Methods, and Applications, 2nd edn. CRC Press, Boca Raton, Florida, USA.
Wheeler NC, Jermstad KD, Krutovsky K, Aitken SN, Howe GT, Krakowski J, Neale DB (2005)
Mapping of quantitative trait loci controlling adaptive traits in coastal Douglas-fir. IV.
Cold-hardiness QTL verification and candidate gene mapping. Mol Breed 15: 145–156.
White TL, Adams WT, Neale DB (2007) Forest Genetics. CABI Pub, Wallingford, Oxfordshire,
UK; Cambridge, Massachusetts, USA.
Wilcox PL, Richardson TE, Corbett GE, Ball RD, Lee JR, Djorovic A, Carson SD (2001)
Framework linkage maps of Pinus radiata D. Don based on pseudotestcross markers.
For Genet 8: 109–117.
Williams CG (1998) QTL Mapping in outbred pedigrees. In: AH Paterson (ed) Molecular
Dissection of Complex Traits. CRC Press, Boca Raton, Florida, USA, pp 81–94.
Williams CG, Reyes-Valdes MH, Huber DA (2007) Validating a QTL region characterized by
multiple haplotypes. Theor Appl Genet 116: 87–94.
Williams JGK, Kubelik AR, Livak KJ, Rafalski JA, Tingey SV (1990) DNA polymorphisms
amplified by arbitrary primers are useful as genetic markers. Nucl Acids Res 18:
6531–6535.
Wu R, Ma C-X, Painter I, Zeng Z-B (2002) Simultaneous maximum likelihood estimation of
linkage and linkage phases in outcrossing species. Theor Pop Biol 61: 349–363.
Yazdani R, Yeh FC, Rimsha J (1995) Genomic mapping of Pinus sylvestris (L.) using random
amplified polymorphic DNA markers. For Genet 2: 109–116.
Yazdani R, Nilsson JE, Plomion C, Mathur G (2003) Marker trait association for autumn cold
acclimation and growth rhythm in Pinus sylvestris. Scand J For Res 18: 29–38.
Yin T-M, Wang X-R, Andersson B, Lerceteau-Köhler E (2003) Nearly complete genetic maps of
Pinus sylvestris L. (Scots pine) constructed by AFLP marker analysis in a full-sib family.
Theor Appl Genet 106: 1075–1083 .
Yoshimaru H, Ohba K, Tsurumi K, Tomaru N, Murai M, Mukai Y, Suyama Y, Tsumura Y,
Kawahara T, Sakamaki Y (1998) Detection of quantitative trait loci for juvenile growth,
flower bearing and rooting ability based on a linkage map in sugi (Cryptomeria japonica
D. Don). Theor Appl Genet 97: 45–50.
Zhang P, Li W, Fellers J, Friebe B, Gill BS (2004) BAC-FISH in wheat identifies chromosome
landmarks consisting of different types of transposable elements. Chromosoma 112:
288–299.
Copyright reserved © 2011
452
Genetics, Genomics and Breeding of Conifers
Chapter 5
Figure 5-3 Comparison of homologous linkage groups between white spruce (Picea glauca)
and black spruce (species complex Picea mariana × P. rubens).
Copyright reserved © 2011