The family Apiaceae comprises approximately 3700 species of herbaceous plants, including importan... more The family Apiaceae comprises approximately 3700 species of herbaceous plants, including important crops, aromatic herbs and field weeds. Here we report a study of 10 preserved historical or recent virus samples of apiaceous plants collected in the United Kingdom (UK) import interceptions from the Mediterranean region (Egypt, Israel and Cyprus) or during surveys of Australian apiaceous crops. Seven complete new genomic sequences and one partial sequence, of the apiaceous potyviruses apium virus Y (ApVY), carrot thin leaf virus (CaTLV), carrot virus Y (CarVY) and celery mosaic virus (CeMV) were obtained. When these 7 and 16 earlier complete non-recombinant apiaceous potyvirus sequences were subjected to phylogenetic analyses, they split into 2 separate lineages: 1 containing ApVY, CeMV, CarVY and panax virus Y and the other CaTLV, ashitabi mosaic virus and konjac virus Y. Preliminary dating analysis suggested the CarVY population first diverged from CeMV and ApVY in the 17th century ...
Potato virus X (PVX) occurs worldwide and causes an important potato disease. Complete PVX genome... more Potato virus X (PVX) occurs worldwide and causes an important potato disease. Complete PVX genomes were obtained from 326 new isolates from Peru, which is within the potato crop′s main domestication center, 10 from historical PVX isolates from the Andes (Bolivia, Peru) or Europe (UK), and three from Africa (Burundi). Concatenated open reading frames (ORFs) from these genomes plus 49 published genomic sequences were analyzed. Only 18 of them were recombinants, 17 of them Peruvian. A phylogeny of the non-recombinant sequences found two major (I, II) and five minor (I-1, I-2, II-1, II-2, II-3) phylogroups, which included 12 statistically supported clusters. Analysis of 488 coat protein (CP) gene sequences, including 128 published previously, gave a completely congruent phylogeny. Among the minor phylogroups, I-2 and II-3 only contained Andean isolates, I-1 and II-2 were of both Andean and other isolates, but all of the three II-1 isolates were European. I-1, I-2, II-1 and II-2 all cont...
Potato virus V (PVV) causes a disease of potato (Solanum tubersosum) in South and Central America... more Potato virus V (PVV) causes a disease of potato (Solanum tubersosum) in South and Central America, Europe, and the Middle East. We report here the complete genomic sequences of 42 new PVV isolates from the potato’s Andean domestication center in Peru and of eight historical or recent isolates from Europe. When the principal open reading frames of these genomic sequences together with those of nine previously published genomic sequences were analyzed, only two from Peru and one from Iran were found to be recombinant. The phylogeny of the 56 nonrecombinant open reading frame sequences showed that the PVV population had two major phylogroups, one of which formed three minor phylogroups (A1 to A3) of isolates, all of which are found only in the Andean region of South America (Peru and Colombia), and the other formed two minor phylogroups, a basal one of Andean isolates (A4) that is paraphyletic to a crown cluster containing all the isolates found outside South America (World). This sugg...
Forty-seven potato virus A (PVA) isolates from Europe, Australia, and South America’s Andean regi... more Forty-seven potato virus A (PVA) isolates from Europe, Australia, and South America’s Andean region were subjected to high-throughput sequencing, and 46 complete genomes from Europe (n = 9), Australia (n = 2), and the Andes (n = 35) obtained. These and 17 other genomes gave alignments of 63 open reading frames 9,180 nucleotides long; 9 were recombinants. The nonrecombinants formed three tightly clustered, almost equidistant phylogroups; A comprised 14 Peruvian potato isolates; W comprised 37 from potato in Peru, Argentina, and elsewhere in the world; and T contained three from tamarillo in New Zealand. When five isolates were inoculated to a potato cultivar differential, three strain groups (= pathotypes) unrelated to phylogenetic groupings were recognized. No temporal signal was detected among the dated nonrecombinant sequences, but PVA and potato virus Y (PVY) are from related lineages and ecologically similar; therefore, “relative dating” was obtained using a single maximum-likel...
In this review, encouraged by the dictum of Theodosius Dobzhansky that “Nothing in biology makes ... more In this review, encouraged by the dictum of Theodosius Dobzhansky that “Nothing in biology makes sense except in the light of evolution”, we outline the likely evolutionary pathways that have resulted in the observed similarities and differences of the extant molecules, biology, distribution, etc. of the potyvirids and, especially, its largest genus, the potyviruses. The potyvirids are a family of plant-infecting RNA-genome viruses. They had a single polyphyletic origin, and all share at least three of their genes (i.e., the helicase region of their CI protein, the RdRp region of their NIb protein and their coat protein) with other viruses which are otherwise unrelated. Potyvirids fall into 11 genera of which the potyviruses, the largest, include more than 150 distinct viruses found worldwide. The first potyvirus probably originated 15,000–30,000 years ago, in a Eurasian grass host, by acquiring crucial changes to its coat protein and HC-Pro protein, which enabled it to be transmitt...
Potato virus Y (PVY) causes disease in potatoes and other solanaceous crops. The appearance of it... more Potato virus Y (PVY) causes disease in potatoes and other solanaceous crops. The appearance of its necrogenic strains in the 1980s made it the most economically important virus of potatoes. We report the isolation and genomic sequences of 32 Peruvian isolates of PVY which, together with 428 published PVY genomic sequences, gave an alignment of 460 sequences. Of these 190 (41%) were non-recombinant, and 162 of these provided a dated phylogeny, that corresponds well with the likely history of PVY, and show that PVY originated in South America which is where potatoes were first domesticated. The most basal divergences of the PVY population produced the N and C: O phylogroups; the origin of the N phylogroup is clearly Andean, but that of the O and C phylogroups is unknown, although they may have been first to establish in European crops. The current PVY population originated around 156 CE. PVY was probably first taken from South America to Europe in the 16th century in tubers. Most of t...
In 1976, a virus with flexuous, filamentous virions typical of the family Potyviridae was isolate... more In 1976, a virus with flexuous, filamentous virions typical of the family Potyviridae was isolated from symptomatic pepino (Solanum muricatum) plants growing in two valleys in Peru’s coastal desert region. In 2014, a virus with similar-shaped virions was isolated from asymptomatic fruits obtained from pepino plants growing in six coastal valleys and a valley in Peru’s Andean highlands. Both were identified subsequently as Wild potato mosaic virus (WPMV) by serology or high-throughput sequencing (HTS). The symptoms caused by two old and seven new isolates from pepino were examined in indicator plants. Infected solanaceous hosts varied considerably in their sensitivities to infection and individual isolates varied greatly in virulence. All seven new isolates caused quick death of infected Nicotiana benthamiana plants and more than half of them killed infected plants of Physalis floridana and S. chancayense. These three species were the most sensitive to infection. The most virulent is...
Narcissus plants (Narcissus tazetta var. chinensis) showing mosaic or striping leaves were collec... more Narcissus plants (Narcissus tazetta var. chinensis) showing mosaic or striping leaves were collected from around Japan, and tested for virus infections using potyvirus-specific primers. Many were found to be infected with a macluravirus and mixtures of different potyviruses, one third of them narcissus yellow stripe virus (NYSV)-like viruses. Genomes of nine of the NYSV-like viruses were sequenced and, together with four already published, provided data for phylogenetic and pairwise identity analyses of their place in the turnip mosaic virus (TuMV) phylogenetic group. Using existing ICTV criteria for defining potyvirus species, the narcissus viruses in TuMV group were found to be from five species; the previously described NLSYV, and four new species we call narcissus virus 1 (NV-1) and narcissus yellow stripe-1 to -3 (NYSV-1, NYSV-2 and NYSV-3). However, as all are from a single host species, and natural recombinants with NV-1 and NYSV-3 'parents have been found in China and In...
A recent proposal that the genus Rymovirus be assimilated into the genus Potyvirus is examined, d... more A recent proposal that the genus Rymovirus be assimilated into the genus Potyvirus is examined, discussed, and rejected. It illustrates the danger of using 'sequence identity' as a proxy for phylogenetic relatedness to distinguish closely related but distinct groups of viruses.
Plant viruses have important global impacts on crops, and identifying their centre and date of em... more Plant viruses have important global impacts on crops, and identifying their centre and date of emergence is important for planning control measures. Turnip mosaic virus (TuMV) is a member of the genus Potyvirus in the family Potyviridae and is a major worldwide pathogen of brassica crops. For two decades, we have collected TuMV isolates, mostly from brassicas, in Turkey and neighbouring countries. This region is thought to be the centre of emergence of this virus. We determined the genomic sequences of 179 of these isolates and used these to estimate the timescale of the spread of this virus. Our Bayesian coalescent analyses used synonymous sites from a total of 417 novel and published whole-genome sequences. We conclude that TuMV probably originated from a virus of wild orchids in Germany and, while adapting to wild and domestic brassicas, spread via Southern Europe to Asia Minor no more than 700 years ago. The population of basal-B group TuMVs in Asia Minor is older than all other...
Potato virus Y (PVY) is a major pathogen of potatoes and other solanaceous crops worldwide. It is... more Potato virus Y (PVY) is a major pathogen of potatoes and other solanaceous crops worldwide. It is most closely related to potyviruses first or only found in the Americas, and it almost certainly originated in the Andes, where its hosts were domesticated. We have inferred the phylogeny of the published genomic sequences of 240 PVY isolates collected since 1938 worldwide, but not the Andes. All fall into five groupings, which mostly, but not exclusively, correspond with groupings already devised using biological and taxonomic data. Only 42 percent of the sequences are not recombinant, and all these fall into one or other of three phylogroups; the previously named C (common), O (ordinary), and N (necrotic) groups. There are also two other distinct groups of isolates all of which are recombinant; the R-1 isolates have N (5′ terminal minor) and O (major) parents, and the R-2 isolates have R-1 (major) and N (3′ terminal minor) parents. Many isolates also have additional minor intra- and i...
A phylogeny has been calculated by maximum likelihood comparisons of the concatenated consensus p... more A phylogeny has been calculated by maximum likelihood comparisons of the concatenated consensus protein sequences of 29 tobamoviruses shown to be non-recombinant. This phylogeny has statistically significant support throughout, including its basal branches. The viruses form eight lineages that are congruent with the taxonomy of the hosts from which each was first isolated and, with the exception of three of the twenty-nine species, all fall into three clusters that have either asterid or rosid or caryophyllid hosts (i.e. the major subdivisions of eudicotyledonous plants). A modified Mantel permutation test showed that the patristic distances of virus and host phylogenies are significantly correlated, especially when the three anomalously placed viruses are removed. When the internal branches of the virus phylogeny were collapsed the congruence decreased. The simplest explanation of this congruence of the virus and host phylogenies is that most tobamovirus lineages have co-diverged with their primary plant hosts for more than 110 million years, and only the brassica-infecting lineage originated from a major host switch from asterids to rosids. Their co-divergence seems to have been 'fuzzy' rather than 'strict', permitting viruses to switch hosts within major host clades. Our conclusions support those of a coalesence analysis of tobamovirus sequences, that used proxy node dating, but not a similar analysis of nucleotide sequences from dated samples, which concluded that the tobamoviruses originated only 100 thousand years ago.
Cucumber mosaic virus (CMV) is a damaging pathogen of over 200 mono- and dicotyledonous crop spec... more Cucumber mosaic virus (CMV) is a damaging pathogen of over 200 mono- and dicotyledonous crop species worldwide. It has the broadest known host range of any virus, but the timescale of its evolution is unknown. To investigate the evolutionary history of this virus, we obtained the genomic sequences of 40 CMV isolates from brassicas sampled in Iran, Turkey and Japan, and combined them with published sequences. Our synonymous ('silent') site analyses revealed that the present CMV population is the progeny of a single ancestor existing 1550-2600 years ago, but that the population mostly radiated 295-545 years ago. We found that the major CMV lineages are not phylogeographically confined, but that recombination and reassortment is restricted to local populations and that no reassortant lineage is more than 251 years old. Our results highlight the different evolutionary patterns seen among viral pathogens of brassica crops across the world.
Background: Potyviruses are found world wide, are spread by probing aphids and cause considerable... more Background: Potyviruses are found world wide, are spread by probing aphids and cause considerable crop damage. Potyvirus is one of the two largest plant virus genera and contains about 15% of all named plant virus species. When and why did the potyviruses become so numerous? Here we answer the first question and discuss the other. Methods and Findings: We have inferred the phylogenies of the partial coat protein gene sequences of about 50 potyviruses, and studied in detail the phylogenies of some using various methods and evolutionary models. Their phylogenies have been calibrated using historical isolation and outbreak events: the plum pox virus epidemic which swept through Europe in the 20th century, incursions of potyviruses into Australia after agriculture was established by European colonists, the likely transport of cowpea aphid-borne mosaic virus in cowpea seed from Africa to the Americas with the 16th century slave trade and the similar transport of papaya ringspot virus from India to the Americas. Conclusions/Significance: Our studies indicate that the partial coat protein genes of potyviruses have an evolutionary rate of about 1.15610 24 nucleotide substitutions/site/year, and the initial radiation of the potyviruses occurred only about 6,600 years ago, and hence coincided with the dawn of agriculture. We discuss the ways in which agriculture may have triggered the prehistoric emergence of potyviruses and fostered their speciation.
Turnip mosaic virus (TuMV), a species of the genus Potyvirus, occurs worldwide. Seventy-six isola... more Turnip mosaic virus (TuMV), a species of the genus Potyvirus, occurs worldwide. Seventy-six isolates of TuMV were collected from around the world, mostly from Brassica and Raphanus crops, but also from several non-brassica species. Host tests grouped the isolates into one or other of two pathotypes; Brassica (B) and Brassica-Raphanus (BR). The nucleotide sequences of the first protein (P1) and coat protein (CP) genes of the isolates were determined. One-tenth of the isolates were found to have anomalous and variable phylogenetic relationships as a result of recombination. The 5'-terminal 300 nt of the P1 gene of many isolates was also variable and phylogenetically anomalous, whereas the 380 nt 3' terminus of the CP gene was mostly conserved. Trees calculated from the remaining informative parts of the two genes of the non-recombinant sequences by neighbour-joining, maximum-likelihood and maximum-parsimony methods were closely similar, and so these parts of the sequences were...
The genetic structure of populations of Turnip mosaic virus in Eurasia was assessed by making hos... more The genetic structure of populations of Turnip mosaic virus in Eurasia was assessed by making host range and gene sequence comparisons of 142 isolates. Most isolates collected in West Eurasia infected Brassica plants whereas those from East Eurasia infected both Brassica and Raphanus plants. Analyses of recombination sites (RSs) in five regions of the genome (one third of the full sequence) showed that the protein 1 (P1 gene) had recombined more frequently than the other gene regions in both subpopulations, but that the RSs were located in different parts of the genomes of the subpopulations. Estimates of nucleotide diversity showed that the West Eurasian subpopulation was more diverse than the East Eurasian subpopulation, but the Asian-BR group of the genes from the latter subpopulation had a greater nonsynonymous/synonymous substitution ratio, especially in the P1, viral genome-linked protein (VPg) and nuclear inclusion a proteinase (NIa-Pro) genes. These subpopulations seem to ha...
Turnip mosaic virus (TuMV) is a potyvirus that is transmitted by aphids and infects a wide range ... more Turnip mosaic virus (TuMV) is a potyvirus that is transmitted by aphids and infects a wide range of plant species. We investigated the evolution of this pathogen by collecting 32 isolates of TuMV, mostly from Brassicaceae plants, in Australia and in New Zealand. We performed a variety of sequence-based phylogenetic and population genetic analyses of the complete genomic sequences and of three non-recombinogenic regions of those sequences. The substitution rates, divergence times, and phylogeographic patterns of the virus populations were estimated. Six inter- and five intralineage recombination type patterns were found in the genomes of the Australian and New Zealand isolates, and all were novel. Only one recombination type pattern has been found in both countries. Australian and New Zealand populations were genetically different, and were different from the European and Asian populations. Our Bayesian coalescent analyses, based on a combination of novel and published sequence data ...
Proceedings of the National Academy of Sciences, 1992
Many protein families are common to all cellular organisms, indicating that many genes have ancie... more Many protein families are common to all cellular organisms, indicating that many genes have ancient origins. Genetic variation is mostly attributed to processes such as mutation, duplication, and rearrangement of ancient modules. Thus it is widely assumed that much of present-day genetic diversity can be traced by common ancestry to a molecular "big bang." A rarely considered alternative is that proteins may arise continuously de novo. One mechanism of generating different coding sequences is by "overprinting," in which an existing nucleotide sequence is translated de novo in a different reading frame or from noncoding open reading frames. The clearest evidence for overprinting is provided when the original gene function is retained, as in overlapping genes. Analysis of their phylogenies indicates which are the original genes and which are their informationally novel partners. We report here the phylogenetic relationships of overlapping coding sequences from ster...
Proceedings of the National Academy of Sciences, 1995
Immunology. In the article "Analysis of patterns of DNA sequence variation in flanking and coding... more Immunology. In the article "Analysis of patterns of DNA sequence variation in flanking and coding regions of murine germ-line immunoglobulin heavy-chain variable genes: Evolutionary implications" by
The family Apiaceae comprises approximately 3700 species of herbaceous plants, including importan... more The family Apiaceae comprises approximately 3700 species of herbaceous plants, including important crops, aromatic herbs and field weeds. Here we report a study of 10 preserved historical or recent virus samples of apiaceous plants collected in the United Kingdom (UK) import interceptions from the Mediterranean region (Egypt, Israel and Cyprus) or during surveys of Australian apiaceous crops. Seven complete new genomic sequences and one partial sequence, of the apiaceous potyviruses apium virus Y (ApVY), carrot thin leaf virus (CaTLV), carrot virus Y (CarVY) and celery mosaic virus (CeMV) were obtained. When these 7 and 16 earlier complete non-recombinant apiaceous potyvirus sequences were subjected to phylogenetic analyses, they split into 2 separate lineages: 1 containing ApVY, CeMV, CarVY and panax virus Y and the other CaTLV, ashitabi mosaic virus and konjac virus Y. Preliminary dating analysis suggested the CarVY population first diverged from CeMV and ApVY in the 17th century ...
Potato virus X (PVX) occurs worldwide and causes an important potato disease. Complete PVX genome... more Potato virus X (PVX) occurs worldwide and causes an important potato disease. Complete PVX genomes were obtained from 326 new isolates from Peru, which is within the potato crop′s main domestication center, 10 from historical PVX isolates from the Andes (Bolivia, Peru) or Europe (UK), and three from Africa (Burundi). Concatenated open reading frames (ORFs) from these genomes plus 49 published genomic sequences were analyzed. Only 18 of them were recombinants, 17 of them Peruvian. A phylogeny of the non-recombinant sequences found two major (I, II) and five minor (I-1, I-2, II-1, II-2, II-3) phylogroups, which included 12 statistically supported clusters. Analysis of 488 coat protein (CP) gene sequences, including 128 published previously, gave a completely congruent phylogeny. Among the minor phylogroups, I-2 and II-3 only contained Andean isolates, I-1 and II-2 were of both Andean and other isolates, but all of the three II-1 isolates were European. I-1, I-2, II-1 and II-2 all cont...
Potato virus V (PVV) causes a disease of potato (Solanum tubersosum) in South and Central America... more Potato virus V (PVV) causes a disease of potato (Solanum tubersosum) in South and Central America, Europe, and the Middle East. We report here the complete genomic sequences of 42 new PVV isolates from the potato’s Andean domestication center in Peru and of eight historical or recent isolates from Europe. When the principal open reading frames of these genomic sequences together with those of nine previously published genomic sequences were analyzed, only two from Peru and one from Iran were found to be recombinant. The phylogeny of the 56 nonrecombinant open reading frame sequences showed that the PVV population had two major phylogroups, one of which formed three minor phylogroups (A1 to A3) of isolates, all of which are found only in the Andean region of South America (Peru and Colombia), and the other formed two minor phylogroups, a basal one of Andean isolates (A4) that is paraphyletic to a crown cluster containing all the isolates found outside South America (World). This sugg...
Forty-seven potato virus A (PVA) isolates from Europe, Australia, and South America’s Andean regi... more Forty-seven potato virus A (PVA) isolates from Europe, Australia, and South America’s Andean region were subjected to high-throughput sequencing, and 46 complete genomes from Europe (n = 9), Australia (n = 2), and the Andes (n = 35) obtained. These and 17 other genomes gave alignments of 63 open reading frames 9,180 nucleotides long; 9 were recombinants. The nonrecombinants formed three tightly clustered, almost equidistant phylogroups; A comprised 14 Peruvian potato isolates; W comprised 37 from potato in Peru, Argentina, and elsewhere in the world; and T contained three from tamarillo in New Zealand. When five isolates were inoculated to a potato cultivar differential, three strain groups (= pathotypes) unrelated to phylogenetic groupings were recognized. No temporal signal was detected among the dated nonrecombinant sequences, but PVA and potato virus Y (PVY) are from related lineages and ecologically similar; therefore, “relative dating” was obtained using a single maximum-likel...
In this review, encouraged by the dictum of Theodosius Dobzhansky that “Nothing in biology makes ... more In this review, encouraged by the dictum of Theodosius Dobzhansky that “Nothing in biology makes sense except in the light of evolution”, we outline the likely evolutionary pathways that have resulted in the observed similarities and differences of the extant molecules, biology, distribution, etc. of the potyvirids and, especially, its largest genus, the potyviruses. The potyvirids are a family of plant-infecting RNA-genome viruses. They had a single polyphyletic origin, and all share at least three of their genes (i.e., the helicase region of their CI protein, the RdRp region of their NIb protein and their coat protein) with other viruses which are otherwise unrelated. Potyvirids fall into 11 genera of which the potyviruses, the largest, include more than 150 distinct viruses found worldwide. The first potyvirus probably originated 15,000–30,000 years ago, in a Eurasian grass host, by acquiring crucial changes to its coat protein and HC-Pro protein, which enabled it to be transmitt...
Potato virus Y (PVY) causes disease in potatoes and other solanaceous crops. The appearance of it... more Potato virus Y (PVY) causes disease in potatoes and other solanaceous crops. The appearance of its necrogenic strains in the 1980s made it the most economically important virus of potatoes. We report the isolation and genomic sequences of 32 Peruvian isolates of PVY which, together with 428 published PVY genomic sequences, gave an alignment of 460 sequences. Of these 190 (41%) were non-recombinant, and 162 of these provided a dated phylogeny, that corresponds well with the likely history of PVY, and show that PVY originated in South America which is where potatoes were first domesticated. The most basal divergences of the PVY population produced the N and C: O phylogroups; the origin of the N phylogroup is clearly Andean, but that of the O and C phylogroups is unknown, although they may have been first to establish in European crops. The current PVY population originated around 156 CE. PVY was probably first taken from South America to Europe in the 16th century in tubers. Most of t...
In 1976, a virus with flexuous, filamentous virions typical of the family Potyviridae was isolate... more In 1976, a virus with flexuous, filamentous virions typical of the family Potyviridae was isolated from symptomatic pepino (Solanum muricatum) plants growing in two valleys in Peru’s coastal desert region. In 2014, a virus with similar-shaped virions was isolated from asymptomatic fruits obtained from pepino plants growing in six coastal valleys and a valley in Peru’s Andean highlands. Both were identified subsequently as Wild potato mosaic virus (WPMV) by serology or high-throughput sequencing (HTS). The symptoms caused by two old and seven new isolates from pepino were examined in indicator plants. Infected solanaceous hosts varied considerably in their sensitivities to infection and individual isolates varied greatly in virulence. All seven new isolates caused quick death of infected Nicotiana benthamiana plants and more than half of them killed infected plants of Physalis floridana and S. chancayense. These three species were the most sensitive to infection. The most virulent is...
Narcissus plants (Narcissus tazetta var. chinensis) showing mosaic or striping leaves were collec... more Narcissus plants (Narcissus tazetta var. chinensis) showing mosaic or striping leaves were collected from around Japan, and tested for virus infections using potyvirus-specific primers. Many were found to be infected with a macluravirus and mixtures of different potyviruses, one third of them narcissus yellow stripe virus (NYSV)-like viruses. Genomes of nine of the NYSV-like viruses were sequenced and, together with four already published, provided data for phylogenetic and pairwise identity analyses of their place in the turnip mosaic virus (TuMV) phylogenetic group. Using existing ICTV criteria for defining potyvirus species, the narcissus viruses in TuMV group were found to be from five species; the previously described NLSYV, and four new species we call narcissus virus 1 (NV-1) and narcissus yellow stripe-1 to -3 (NYSV-1, NYSV-2 and NYSV-3). However, as all are from a single host species, and natural recombinants with NV-1 and NYSV-3 'parents have been found in China and In...
A recent proposal that the genus Rymovirus be assimilated into the genus Potyvirus is examined, d... more A recent proposal that the genus Rymovirus be assimilated into the genus Potyvirus is examined, discussed, and rejected. It illustrates the danger of using 'sequence identity' as a proxy for phylogenetic relatedness to distinguish closely related but distinct groups of viruses.
Plant viruses have important global impacts on crops, and identifying their centre and date of em... more Plant viruses have important global impacts on crops, and identifying their centre and date of emergence is important for planning control measures. Turnip mosaic virus (TuMV) is a member of the genus Potyvirus in the family Potyviridae and is a major worldwide pathogen of brassica crops. For two decades, we have collected TuMV isolates, mostly from brassicas, in Turkey and neighbouring countries. This region is thought to be the centre of emergence of this virus. We determined the genomic sequences of 179 of these isolates and used these to estimate the timescale of the spread of this virus. Our Bayesian coalescent analyses used synonymous sites from a total of 417 novel and published whole-genome sequences. We conclude that TuMV probably originated from a virus of wild orchids in Germany and, while adapting to wild and domestic brassicas, spread via Southern Europe to Asia Minor no more than 700 years ago. The population of basal-B group TuMVs in Asia Minor is older than all other...
Potato virus Y (PVY) is a major pathogen of potatoes and other solanaceous crops worldwide. It is... more Potato virus Y (PVY) is a major pathogen of potatoes and other solanaceous crops worldwide. It is most closely related to potyviruses first or only found in the Americas, and it almost certainly originated in the Andes, where its hosts were domesticated. We have inferred the phylogeny of the published genomic sequences of 240 PVY isolates collected since 1938 worldwide, but not the Andes. All fall into five groupings, which mostly, but not exclusively, correspond with groupings already devised using biological and taxonomic data. Only 42 percent of the sequences are not recombinant, and all these fall into one or other of three phylogroups; the previously named C (common), O (ordinary), and N (necrotic) groups. There are also two other distinct groups of isolates all of which are recombinant; the R-1 isolates have N (5′ terminal minor) and O (major) parents, and the R-2 isolates have R-1 (major) and N (3′ terminal minor) parents. Many isolates also have additional minor intra- and i...
A phylogeny has been calculated by maximum likelihood comparisons of the concatenated consensus p... more A phylogeny has been calculated by maximum likelihood comparisons of the concatenated consensus protein sequences of 29 tobamoviruses shown to be non-recombinant. This phylogeny has statistically significant support throughout, including its basal branches. The viruses form eight lineages that are congruent with the taxonomy of the hosts from which each was first isolated and, with the exception of three of the twenty-nine species, all fall into three clusters that have either asterid or rosid or caryophyllid hosts (i.e. the major subdivisions of eudicotyledonous plants). A modified Mantel permutation test showed that the patristic distances of virus and host phylogenies are significantly correlated, especially when the three anomalously placed viruses are removed. When the internal branches of the virus phylogeny were collapsed the congruence decreased. The simplest explanation of this congruence of the virus and host phylogenies is that most tobamovirus lineages have co-diverged with their primary plant hosts for more than 110 million years, and only the brassica-infecting lineage originated from a major host switch from asterids to rosids. Their co-divergence seems to have been 'fuzzy' rather than 'strict', permitting viruses to switch hosts within major host clades. Our conclusions support those of a coalesence analysis of tobamovirus sequences, that used proxy node dating, but not a similar analysis of nucleotide sequences from dated samples, which concluded that the tobamoviruses originated only 100 thousand years ago.
Cucumber mosaic virus (CMV) is a damaging pathogen of over 200 mono- and dicotyledonous crop spec... more Cucumber mosaic virus (CMV) is a damaging pathogen of over 200 mono- and dicotyledonous crop species worldwide. It has the broadest known host range of any virus, but the timescale of its evolution is unknown. To investigate the evolutionary history of this virus, we obtained the genomic sequences of 40 CMV isolates from brassicas sampled in Iran, Turkey and Japan, and combined them with published sequences. Our synonymous ('silent') site analyses revealed that the present CMV population is the progeny of a single ancestor existing 1550-2600 years ago, but that the population mostly radiated 295-545 years ago. We found that the major CMV lineages are not phylogeographically confined, but that recombination and reassortment is restricted to local populations and that no reassortant lineage is more than 251 years old. Our results highlight the different evolutionary patterns seen among viral pathogens of brassica crops across the world.
Background: Potyviruses are found world wide, are spread by probing aphids and cause considerable... more Background: Potyviruses are found world wide, are spread by probing aphids and cause considerable crop damage. Potyvirus is one of the two largest plant virus genera and contains about 15% of all named plant virus species. When and why did the potyviruses become so numerous? Here we answer the first question and discuss the other. Methods and Findings: We have inferred the phylogenies of the partial coat protein gene sequences of about 50 potyviruses, and studied in detail the phylogenies of some using various methods and evolutionary models. Their phylogenies have been calibrated using historical isolation and outbreak events: the plum pox virus epidemic which swept through Europe in the 20th century, incursions of potyviruses into Australia after agriculture was established by European colonists, the likely transport of cowpea aphid-borne mosaic virus in cowpea seed from Africa to the Americas with the 16th century slave trade and the similar transport of papaya ringspot virus from India to the Americas. Conclusions/Significance: Our studies indicate that the partial coat protein genes of potyviruses have an evolutionary rate of about 1.15610 24 nucleotide substitutions/site/year, and the initial radiation of the potyviruses occurred only about 6,600 years ago, and hence coincided with the dawn of agriculture. We discuss the ways in which agriculture may have triggered the prehistoric emergence of potyviruses and fostered their speciation.
Turnip mosaic virus (TuMV), a species of the genus Potyvirus, occurs worldwide. Seventy-six isola... more Turnip mosaic virus (TuMV), a species of the genus Potyvirus, occurs worldwide. Seventy-six isolates of TuMV were collected from around the world, mostly from Brassica and Raphanus crops, but also from several non-brassica species. Host tests grouped the isolates into one or other of two pathotypes; Brassica (B) and Brassica-Raphanus (BR). The nucleotide sequences of the first protein (P1) and coat protein (CP) genes of the isolates were determined. One-tenth of the isolates were found to have anomalous and variable phylogenetic relationships as a result of recombination. The 5'-terminal 300 nt of the P1 gene of many isolates was also variable and phylogenetically anomalous, whereas the 380 nt 3' terminus of the CP gene was mostly conserved. Trees calculated from the remaining informative parts of the two genes of the non-recombinant sequences by neighbour-joining, maximum-likelihood and maximum-parsimony methods were closely similar, and so these parts of the sequences were...
The genetic structure of populations of Turnip mosaic virus in Eurasia was assessed by making hos... more The genetic structure of populations of Turnip mosaic virus in Eurasia was assessed by making host range and gene sequence comparisons of 142 isolates. Most isolates collected in West Eurasia infected Brassica plants whereas those from East Eurasia infected both Brassica and Raphanus plants. Analyses of recombination sites (RSs) in five regions of the genome (one third of the full sequence) showed that the protein 1 (P1 gene) had recombined more frequently than the other gene regions in both subpopulations, but that the RSs were located in different parts of the genomes of the subpopulations. Estimates of nucleotide diversity showed that the West Eurasian subpopulation was more diverse than the East Eurasian subpopulation, but the Asian-BR group of the genes from the latter subpopulation had a greater nonsynonymous/synonymous substitution ratio, especially in the P1, viral genome-linked protein (VPg) and nuclear inclusion a proteinase (NIa-Pro) genes. These subpopulations seem to ha...
Turnip mosaic virus (TuMV) is a potyvirus that is transmitted by aphids and infects a wide range ... more Turnip mosaic virus (TuMV) is a potyvirus that is transmitted by aphids and infects a wide range of plant species. We investigated the evolution of this pathogen by collecting 32 isolates of TuMV, mostly from Brassicaceae plants, in Australia and in New Zealand. We performed a variety of sequence-based phylogenetic and population genetic analyses of the complete genomic sequences and of three non-recombinogenic regions of those sequences. The substitution rates, divergence times, and phylogeographic patterns of the virus populations were estimated. Six inter- and five intralineage recombination type patterns were found in the genomes of the Australian and New Zealand isolates, and all were novel. Only one recombination type pattern has been found in both countries. Australian and New Zealand populations were genetically different, and were different from the European and Asian populations. Our Bayesian coalescent analyses, based on a combination of novel and published sequence data ...
Proceedings of the National Academy of Sciences, 1992
Many protein families are common to all cellular organisms, indicating that many genes have ancie... more Many protein families are common to all cellular organisms, indicating that many genes have ancient origins. Genetic variation is mostly attributed to processes such as mutation, duplication, and rearrangement of ancient modules. Thus it is widely assumed that much of present-day genetic diversity can be traced by common ancestry to a molecular "big bang." A rarely considered alternative is that proteins may arise continuously de novo. One mechanism of generating different coding sequences is by "overprinting," in which an existing nucleotide sequence is translated de novo in a different reading frame or from noncoding open reading frames. The clearest evidence for overprinting is provided when the original gene function is retained, as in overlapping genes. Analysis of their phylogenies indicates which are the original genes and which are their informationally novel partners. We report here the phylogenetic relationships of overlapping coding sequences from ster...
Proceedings of the National Academy of Sciences, 1995
Immunology. In the article "Analysis of patterns of DNA sequence variation in flanking and coding... more Immunology. In the article "Analysis of patterns of DNA sequence variation in flanking and coding regions of murine germ-line immunoglobulin heavy-chain variable genes: Evolutionary implications" by
Uploads
Papers by Adrian Gibbs