Upload
luis
View
222
Download
0
Embed Size (px)
Citation preview
8/3/2019 QTLrevTIPS
1/4
Believe it or not, QTLs are accurate!Adam H. Price
School of Biological Sciences, University of Aberdeen, Aberdeen, UK AB24 3UU
It is generally believed that mapping quantitative trait
loci (QTLs) does not accurately position genes under-
lying polygenic traits on the genome, which limits the
application of QTL analysis in marker-assisted selection
and gene discovery. However, now that a few plant
QTLs have been cloned or accurately tagged, it appears
that they might be accurate to within 2 cM or less. This
means that there will be circumstances when map-
based cloning using only original mapping data would
be a realistic option that avoids time-consuming and
expensive fine mapping. Acceptance of this view would
enhance the value of past and future mapping exper-iments, particularly those revealing small and environ-
mentally sensitive QTLs that are often considered
intractable at the molecular level.
Map-based cloning and the accuracy of QTLs
The genes responsible for genetic variation of quantitat-
ively variable traits constitute the vast majority of the
functional genetic diversity of the biosphere and probably
represent the main sites at which selection influences
evolutionary heredity. In the early days of genetics, major
genes were the focus of the first inheritance studies and
only later did the focus widen to include characterization
of the biometrics of more complex traits [1]. Likewise, the
technology of gene cloning, which originally targetedmajor genes, is increasingly including those genes
responsible for quantitative, multigenic traits [2]. Identi-
fying the genes behind these quantitative trait loci (QTLs)
has been described as the greatest challenge for geneti-
cists this century [3]. A powerful way to characterize these
genes (in terms of numbers and relative contribution) is to
use a mapping population to identify QTLs: there has been
a tenfold increase in the number of QTL studies published
annually over the past 10 years. Once QTLs have been
identified, the next challenge is to identify the genes. One
of the most promising ways to do this is positional cloning,
where the QTL is linked to the physical sequence of the
genome via the sequence of large insert clones [e.g.bacterial artificial chromosomes (BACs)] [2]. For those
species that have been sequenced, there should be no need
to generate the large insert clones because gene order is
already known. All that is required is to locate a QTL on
the sequence and then look for candidate genes. However,
a big obstacle exists in attempting to link a QTL on a
genetic map of a primary mapping population to a position
on a sequence map. Theory suggests that the positioning
of a QTL in a primary population is not accurate, covering
a region up to and over 20 cM depending on the type of
population, the number of individuals scored, and the
quality of the data [35]. The 1 likelihood of odds (LOD)
support interval with which QTLs are commonly reported
is often a large region covering 10 to 30 cM [5]. This region
spans the location on the genome where the statistical
support for the QTL is within one order of magnitude of
the peak statistic (Figure 1). In most species, this covers in
the order of one hundredth of the entire genetic map and
could include up to 2000 genes. Such a large number of
genes cannot be tested for candidacy so it is accepted that
to identify candidate genes based on map position, the
mapping must be done more accurately. This involvesMendelianizing the QTL in a near isogenic pair that is
used to make a secondary population for fine mapping. In
this approach, many individuals (sometimes O1000) are
genotyped for markers around the QTL. Those that show
recombination in the region are phenotyped for the trait,
allowing a much more accurate QTL localization, nor-
mally to !1 cM. This distance can represent from 50
genes to as few as 1 gene. However, this fine mapping
requires considerable expense and is practically difficult if
the QTL effect is small because the small genetic effect
limits the ability to assess phenotypic differences accu-
rately between allelic variants in the fine mapping
population. Most small QTLs will not be tractable using
fine mapping. But perhaps the original QTL mapping ismore accurate than was previously supposed.
Tagged or cloned genes are near their original QTL
position
A growing number of natural allelic variations in plants
have now been successfully characterized to the gene or
individual sequence polymorphism [4], and this includes a
few examples of cloning genes for QTLs. A recent article in
Trends in Plant Science also highlights the success of
cloning QTLs [2]. These first plant QTLs are now being
isolated and QTLs are being tagged using the fine
mapping approach and, therefore, the precision of QTL
analysis can now be evaluated (Table 1). The concept isillustrated in Figure 1. However, it is important to
distinguish between major QTLs, where a single locus
explains a large proportion of the genetic variation, and
the small QTLs exemplified in Figure 1. This is because
the precision of QTL location is considered to be
proportional to its contribution to the heritability of the
trait [5]. For example, it has been shown theoretically that
if five QTLs of equal size (effect) each control a trait of
heritability of 50% and, therefore, each have a heritability
of 10%, they can be placed with 95% confidence into a
region of 30 cM when tested on 300 F2 plants [5]. Accuracy
of gene position is theoretically increased by choosing onlyCorresponding author: Price, A.H. ([email protected] ).
Available online 17 April 2006
Opinion TRENDS in Plant Science Vol.11 No.5 May 2006
www.sciencedirect.com 1360-1385/$ - see front matter Q 2006 Elsevier Ltd. All rights reserved. doi:10.1016/j.tplants.2006.03.006
mailto:[email protected]://www.sciencedirect.com/http://www.sciencedirect.com/mailto:[email protected]8/3/2019 QTLrevTIPS
2/4
to study QTLs with a relatively large contribution to the
trait in question (i.e. major QTLs) or by increasing the
heritability of the small QTL (by reducing environmental
variation, by having more replicates or by combining
analysis of several traits that the gene affects pleiotropi-
cally (e.g. [8]). Here I attempt to assess the position of
cloned and a few tagged genes relative to the original
mapping position where the data published are suffi-
ciently detailed to allow it (usually it is not). Adifferentiation is made between major and small QTLs
using the arbitrary criteria whereby a small QTL is one
where its contribution to overall variation is in the order of
25% or less.
Accuracy of major QTLs
A fruit size QTL of tomato called fw2.2 was one of the first
tagged QTL in plants and explained 3047% of the
phenotypic variation in a population of 264 BC1s [9];
subsequently, this QTL has been cloned [10] and was
found to be within 1.6 cM of its original QTL peak. Ovate
is another major QTL affecting tomato fruit shape andexplained 4767% of variation [11]; this has now been
characterized [12] and was found to be at the marker
originally identified. In Arabidopsis, the FLOWERING1
(FLW1) QTL, which explained 27% and 62% of variation in
flowering time of long and short days, respectively, has
been detected at a marker for the underlying gene [13].
Another Arabidopsis flowering time gene (Cry2) has been
shown to be responsible [14] for a QTL explaining between
20% and 55% of variation in leaf number [15] and was
between 0.8 and 1.6 cM from the QTL peak for this trait,
with the mean QTL position being only 0.1 cM from the
gene. Another QTL in Arabidopsis that now has a
molecular explanation is the transpiration efficiency
QTL on chromosome 1, which explained 2164% of
variation in carbon isotope discrimination, and was
found to be within 1 cM of the ERECTA gene, which is
responsible [16]. In wheat, a cold-regulated transcription
factor, Cbf3, has been identified as a candidate gene
located at 2.8 and 1.6 cM from QTL peaks for frost
tolerance assessed in different years and less than
0.5 cM from the mean position of those screens [17]. In
this example, the QTLs explained between 40% and 48%
of the variation. A major QTL in wheat explaining 66% ofthe variation for grain protein content was first mapped to
marker Xmwg79 in a population derived from a chromo-
some substitution line that was therefore segregating in
only one chromosome [18]. The peak has proved to be
within 0.2 cM of the tagged gene [19]. The gene Ppd-H1,
which regulates photoperiod response, has been identified
[20] after fine mapping [21] and is located 1.9 cM from the
original position of a QTL described as having a highly
significant effect on flowering time and identified in 94
double haploid populations [22]. In soybean, a QTL (FT1)
explaining 62% of the variation in flowering time
originally mapped to marker satt365 [23], which has
subsequently been found to be 0.1 cM from the gene afterfine mapping [24]. A gene controlling flowering time has
been isolated from Brassica [25] that was within 1 cM of
the peak LOD of a QTL explaining 45% of the variation
[26]. Another example from Brassica is provided by Johan
Pelemanet al. [27]: a QTL explaining 43% of the variation
for euric acid content was mapped to 11.3 cM in 184 F 2plants, and subsequent fine mapping revealed its accurate
location to be 12.3 cM.
Accuracy of small QTLs (low heritability)
QTLs for several traits centred on the teosinte branched 1
(tb1) locus in two maize ! teosinte crosses have been
reported [28]. Despite being a major gene for branching,
TRENDS in Plant Science
0
2
4
6
8
10
100 120 140 160 180 200 220
sd1
Position on chromosome 1 (cM)
One LOD supportinterval for each trait
DroughtControlAverageLow nitrogenLow light
LOD
Figure 1. The concept of QTL accuracy. QTL scans for plant height from 160 recombinant inbred lines (RILs) of the Bala!Azucena mapping population of rice [6] are shown,
together with the position of the sd1 (semi-dwarfing) locus. The gene is a gibberellin oxidase [7] and Bala has the 383 bp deletion mutant allele. It maps to 176 cM in this
population.In different environments, plant height QTLsexplain 7.8to 14.6% of the variation and peaks occurat 166,171, 173and 183 cM witha meanposition of 173 cM. The
1 LOD confidence intervals range from 10-18 cM. The position of the QTL obtained by combining all data across all environments (orange) is 174 cM, only 2 cM from the
strong candidate gene. For the drought treatment (blue), the blue broken lines indicate the generation of the 1 LOD support interval.
Opinion TRENDS in Plant Science Vol.11 No.5 May 2006214
www.sciencedirect.com
http://www.sciencedirect.com/http://www.sciencedirect.com/8/3/2019 QTLrevTIPS
3/4
this locus displays pleiotropy and QTLs for several traits
mapped to the locus. The location of QTLs explaining
1225% of the variation [29] ranged up to 9 cM either side
of the tb1 locus, but the mean position of all 14 QTLs was
less than 1 cM from the gene that was subsequently
identified [30].
In rice, Masahiro Yanos group have tagged or isolated
the genes for five heading date (Hd) QTLs [31,32]. Genes
Hd1Hd5 were found to be 0.5, 0.3, 0.0, 2.6 and 1.2 cM,
respectively, from the position originally identified by a
single mapping experiment [33]. Although the contri-bution of each QTL to the overall variation was not given,
because five QTLs were detected at least three must be
small QTLs by the definition used here. Yanos group has
also tagged a gene for phosphate uptake [34] within 1 cM
of the QTL peaks for phosphate uptake and three other
related traits that explained 1928% of the variation in a
population of 98 backcross inbred lines [35]. Tagging of an
environmentally sensitive grain weight QTL explaining
1017% of variation in an advanced backcross population
of258 BC2F2s, which was originally mapped to the nearest
marker RZ452 [36], revealed the fine map location to be
1.6 cM from that marker [37]. In the out-breeding crop
potato, an invertase gene is an exceptionally strong
candidate for a sugar content QTL [38] that explained
5.714.5% of the phenotypic variation [39] and mapped
less than 3 cM from it although the position of the peak is
not given, just the nearest marker CP137.
Map-based cloning without fine mapping
From the data presented in Table 1, the position of genes
underlying major QTLs ranges from 0.0 to 1.9 cM and the
mean is less than 0.7 cM, whereas for small QTLs the
range is 0 to !3 cM and the mean is !1.2 cM. Although
the list given in Table 1 might not be exhaustive, an
attempt was made to find as many examples as possible. It
is possible that the literature itself has a bias because
examples where QTL are distant from the underlying
gene are not represented or because they have proved
more difficult to clone or to tag. None the less, these data
indicate that the position of a QTL obtained from a
primary mapping population can be an order of magnitude
more accurate than is often stated. This appears to be true
even for small QTLs, particularly where accuracy can be
improved by averaging peak positions from different
screens or by combining multiple data sets to increase
the heritability of the trait. This implies that a successfulapproach to identifying candidate genes for small QTLs
for which fine mapping is likely to be problematic but
where multiple data sets are available to improve
accuracy, is to test the candidacy of genes within 12 cM
either side of the mean QTL position detected in the
primary mapping population. The genes around the QTL
can be tested to see if: (i) they are expressed in the
conditions in which the QTL is detected; (ii) if they have
allelic diversity in expression; or (iii) if they show amino
acid sequence polymorphism between the parents of the
mapping population (or populations) displaying the QTL.
It has been demonstrated that expression arrays
represent a powerful way to address points (i) and (ii) inplants and animals [13,40]. Proof of function can then be
strengthened by: characterizing mutants of the candidate
gene; association mapping (e.g. [25,41,42]); and conduct-
ing gene complementation (replacing one allele by
another) either by crossing (e.g. [43]) or by transgenics
(e.g. [16]). Positional cloning of small QTL without fine
mapping appears to be a realistic possibility for species
such as rice and Arabidopsis that have been sequenced
and, although the probability of success will inevitably
depend on the quality of the mapping and trait data, this
realization should greatly increase the potential value of
past and future QTL mapping experiments.
Table 1. The distance between original QTL peak position and subsequently tagged or cloned genes in plant species
Species Trait Gene or tagged locus Mapping populationa Distance to original LOD
peak (cM)
Refs
Major QTLs
Tomato Fruit size fw2.2 264 BC1s !1.6 [9,10]
Tomato Fruit shape Ovate 82 F2s 0.0 [11,12]
Arabidopsis Flowering time FLW1 98 RILs 0.0 [13]
Arabidopsis Flowering time CRY2 162 RILs 0.1b [14,15]
Arabidopsis Transpiration ERECTA 100 RILs !1.0 [16]
Wheat Frost tolerance Cbf3 74 RILs 0.1b
[17]Wheat Grain protein GPC 85 RICLs 0.2 [18,19]
Barley Photoperiod response Ppd-H1 94 DH 1.9 [2022]
Soybean Flowering time FT1 156 RILs 0.4 [23,24]
Brassica Flowering time COL1 88 BC1s 1.0 [25,26]
Brassica Euric acid content E1 184 F2s 1.0 [27]
Small QTLs
Maize! teosinte Shoot morphology tb1 290 F2s 0.6b [2830]
Rice Heading date Hd1 186 F2s 0.5 [31,33]
Rice Heading date Hd2 186 F2s 0.3 [31,33]
Rice Heading date Hd3 186 F2s 0.0 [31,33]
Rice Heading date Hd4 186 F2s 2.6 [30,32]
Rice Heading date Hd5 186 F2s 1.2 [30,32]
Rice P uptake Pup1 98 BILs 1.0 [34,35]
Rice Grain weight gw3.1 258 BC2F2s !1.6 [36,37]
Potato Sugar content inv/GE 146 F1s !3.0 [38,39]a
Abbreviations: BC1, backcross 1; BC2F2, selfed backcross 2; BIL, backcross inbred lines; DH, double haploids;RICL, recombinant inbred chromosome lines; RIL, recombinantinbred lines. Note, because potato is inbreeding, an F 1 is a segregating population.bPosition based on mean position of multiple traits or trait screens.
Opinion TRENDS in Plant Science Vol.11 No.5 May 2006 215
www.sciencedirect.com
http://www.sciencedirect.com/http://www.sciencedirect.com/8/3/2019 QTLrevTIPS
4/4
AcknowledgementsThe data presented in Figure 1 was gathered by Keith MacMillan in a
project funded by the BBSRC (project no. P13058).
References1 Thoday, J.M. (1961) Location of polygenes. Nature 191, 368370
2 Salvi, S. and Tuberosa, R. (2005) To clone or not to clone plant QTLs:
present and future challenges. Trends Plant Sci. 10, 297304
3 Luo, Z.W. et al. (2002) Precision and high-resolution mapping of
quantitative trait loci by use of recurrent selection, backcross orintercross schemes. Genetics 161, 915929
4 Alonso-Blanco, C. et al. (2005) From phenotypic to molecular
polymorphisms involved in naturally occurring variation of plant
development. Int. J. Dev. Biol. 49, 717732
5 Kearsey, M.J. and Farquhar, A.G.L. (1998) QTL analysis in plants;
where are we now? Heredity 80, 137142
6 Price, A.H. et al. (2000) A combined RFLP and AFLP linkage map of
upland rice (Oryza sativa L.) used to identify QTLs for root-
penetration ability. Theor. Appl. Genet. 100, 4956
7 Sasaki, A. et al. (2002) Green revolution: a mutant gibberellin-
synthesis gene in rice new insight into the rice variant that helped to
avert famine over thirty years ago. Nature 416, 701702
8 Korol, A.B. et al. (2001) Enhanced efficiency of quantitative trait loci
mapping analysis based on multivariate complexes of quantitative
traits. Genetics 157, 17891803
9 Alpert, K.B. et al. (1995) fw-2.2: A major QTL controlling fruit weightis common to both red-fruited and green-fruited tomato species. Theor.
Appl. Genet. 91, 9941000
10 Frary, A. et al. (2000) fw2.2: A quantitative trait locus key to the
evolution of tomato fruit size. Science 289, 8588
11 Ku, H.M. et al. (1999) The genetic basis of pear-shaped tomato fruit.
Theor. Appl. Genet. 99, 844850
12 Liu, J. et al. (2002) A new class of regulatory genes underlying the
cause of pear-shaped tomato fruit. Proc. Natl. Acad. Sci. U. S. A. 99,
1330213306
13 Werner, J.D. et al. (2005) Quantitative trait locus mapping and DNA
array hybridization identify an FLM deletion as a cause for natural
flowering-time variation. Proc. Natl. Acad. Sci. U. S. A. 102,
24602465
14 El-Din El-Assal, S. et al. (2001) A QTL for flowering time in
Arabidopsis reveals a novel allele of CRY2. Nat. Genet. 29, 435440
15 Alonso-Blanco, C. et al. (1998) Analysis of natural allelic variation atflowering time loci in the Landsberg erecta and Cape Verde islands
ecotypes of Arabidopsis thaliana. Genetics 149, 749764
16 Masle, J.et al. (2005) TheERECTA gene regulates plant transpiration
efficiency in Arabidopsis. Nature 436, 866870
17 Vagujfalvi, A.et al. (2003) The cold-regulated transcriptional activator
Cbf3 is linked to the frost-tolerance locus Fr-A2 on wheat chromosome
5A. Mol. Genet. Genomics 269, 6067
18 Joppa, L.R. et al. (1997) Mapping gene(s) for grain protein in
tetraploid wheat (Triticum turgidum L.) using a population of
recombinant inbred chromosome lines. Crop Sci. 37, 15861589
19 Distelfeld, A. et al. (2004) Microcolinearity between a 2-cM region
encompassing the grin protein content locus Gpc-6B1 on wheat
chromosome 6B and a 350-kb region on rice chromosome 2. Funct.
Integr. Genomics 4, 5966
20 Turner, A. et al. (2005) The pseudo-response regulator Ppd-H1
provides adaptation to photoperiod in barley. Science 310, 10311034
21 Decousset, L. et al. (2000) Development of STS markers closely linked
to the Ppd-H1 photoperiod response gene of barley (Hordeum vulgare
L.). Theor. Appl. Genet. 101, 12021206
22 Laurie, D.A. et al. (1995) RFLP mapping of 5 major genes and 8
quantitative trait loci controlling flowering time in a winter x spring
barley (Hordeum-vulgare l) cross. Genome 38, 575585
23 Watanabe, S. et al. (2004) Analysis of QTLs for reproductive
development and seed quality traits in soybean using recombinant
inbred lines. Breed. Sci. 54, 399407
24 Yamanaka, N. et al. (2005) Fine mapping of the FT1 locus for soybean
flowering time using a residual heterozygous line derived from a
recombinant inbred line. Theor. Appl. Genet. 110, 634639
25 Osterberg, M.K.et al. (2002)Naturally occurringindelvariation in the
Brassica nigra COL1 gene is associated with variation in flowering
time. Genetics 161, 299306
26 Lagercrantz, U.etal. (1996) Comparativemapping inArabidopsis andBrassica, fine scale genome collinearity and congruence of genes
controlling flowering time. Plant J. 9, 1320
27 Peleman, J.D. et al. (2005) Quantitative trait locus (QTL) isogenic
recombinant analysis: a method for high-resolution mapping of QTL
within a single population. Genetics 171, 13411352
28 Doebley, J. et al. (1995) Teosinte branched1 and the origin of maize
evidence for epistasis and the evolution of dominance. Genetics 141,
333346
29 Doebley, J. and Stec, A. (1993) Inheritance of the morphological
differences between maize and teosinte comparison of results for two
F2 populations. Genetics 134, 559570
30 Doebley, J. et al. (1997) The evolution of apical dominance in maize.
Nature 386, 485488
31 Yamamoto, T.et al. (1998) Fine mapping of quantitative trait lociHd-1
Hd-2 and Hd-3 controlling heading date of rice as single Mendelian
factors. Theor. Appl. Genet. 97, 374432 Lin,H.et al. (2003) Fine mapping and characterization of quantitative
trait loci Hd4 andHd5 controlling heading date in rice. Breed. Sci. 53,
5159
33 Yano, M. et al. (1997) Identification of quantitative trait loci
controlling heading date in rice using a high-density linkage map.
Theor. Appl. Genet. 95, 10251032
34 Wissuwa, M. et al. (2002) Substitution mapping ofPup1: a major QTL
increasing phosphorus uptake of rice from a phosphorus-deficient soil.
Theor. Appl. Genet. 105, 890897
35 Wissuwa, M. et al. (1998) Mapping of QTLs for phosphorus-deficiency
tolerance in rice (Oryza sativa L.). Theor. Appl. Genet. 97, 777783
36 Thomson, M.J. et al. (2003) Mapping quantitative trait loci for yield,
yield components and morphological traits in an advanced backcross
population between Oryza rufipogon and the Oryza sativa cultivar
Jefferson. Theor. Appl. Genet. 107, 479493
37 Li, J. et al. (2004) Fine mapping of a grain-weight quantitative traitlocus in thepericentromeric regionof rice chromosome 3. Genetics 168,
21872195
38 Li, L. et al. (2005) DNA variation at the invertase locus invGE/GF is
associated with tuber quality traits in populations of potato breeding
clones. Genetics 170, 813821
39 Menendez, C.M. et al. (2002) Cold sweetening in diploid potato:
mapping quantitative trait loci and candidate genes. Genetics 162,
14231434
40 Wayne, M.L. and McIntyre, L. (2002) Combining mapping and
arraying: an approach to candidate gene identification. Proc. Natl.
Acad. Sci. U. S. A. 99, 1490314906
41 Olsen, K.M. et al. (2004) Linkage disequilibrium mapping of
Arabidopsis CRY2 flowering time alleles. Genetics 167, 13611369
42 Simko, I. et al. (2004) Linkage disequilibrium mapping of a
Verticillium dahliae resistance quantitative trait locus in tetraploid
potato ( Solanum tuberosum) through a candidate gene approach.
Theor. Appl. Genet. 108, 217224
43 Fridman, E. et al. (2000) A recombination hotspot delimits a wild-
species quantitative trait locus for tomato sugar content to 484 bp
within an invertase gene. Proc. Natl. Acad. Sci. U. S. A. 97,
47184723
Opinion TRENDS in Plant Science Vol.11 No.5 May 2006216
www.sciencedirect.com
http://www.sciencedirect.com/http://www.sciencedirect.com/