Cancergastrico

Download as pdf or txt
Download as pdf or txt
You are on page 1of 10

BMC Genomics BioMed Central

Research article Open Access


Genomes of Helicobacter pylori from native Peruvians suggest
admixture of ancestral and modern lineages and reveal a western
type cag-pathogenicity island
S Manjulata Devi†1, Irshad Ahmed†2,3, Aleem A Khan2, Syed Asad Rahman4,
Ayesha Alvi1, Leonardo A Sechi5 and Niyaz Ahmed*1,5

Address: 1Pathogen Evolution Group, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India, 2Centre for Liver Research and
Diagnostics, Deccan College of Medical Sciences and allied hospitals, Hyderabad, India, 3Department of Microbiology, Shri Shivaji College of Arts,
Commerce and Science (SGB Amravati University), Akola, MS, India, 4Cologne University Bioinformatics Centre, Cologne, Germany and
5ISOGEM Collaborative Network on Genetics of Helicobacters, The International Society for Genomic and Evolutionary Microbiology, University

of Sassari, Sassari, Italy


Email: S Manjulata Devi - [email protected]; Irshad Ahmed - [email protected]; Aleem A Khan - [email protected];
Syed Asad Rahman - [email protected]; Ayesha Alvi - [email protected]; Leonardo A Sechi - [email protected];
Niyaz Ahmed* - [email protected]
* Corresponding author †Equal contributors

Published: 27 July 2006 Received: 31 March 2006


Accepted: 27 July 2006
BMC Genomics 2006, 7:191 doi:10.1186/1471-2164-7-191
This article is available from: http://www.biomedcentral.com/1471-2164/7/191
© 2006 Devi et al; licensee BioMed Central Ltd.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract
Background: Helicobacter pylori is presumed to be co-evolved with its human host and is a highly
diverse gastric pathogen at genetic levels. Ancient origins of H. pylori in the New World are still
debatable. It is not clear how different waves of human migrations in South America contributed
to the evolution of strain diversity of H. pylori. The objective of our 'phylogeographic' study was to
gain fresh insights into these issues through mapping genetic origins of H. pylori of native Peruvians
(of Amerindian ancestry) and their genomic comparison with isolates from Spain, and Japan.
Results: For this purpose, we attempted to dissect genetic identity of strains by fluorescent
amplified fragment length polymorphism (FAFLP) analysis, multilocus sequence typing (MLST) of
the 7 housekeeping genes (atpA, efp, ureI, ppa, mutY, trpC, yphC) and the sequence analyses of the
babB adhesin and oipA genes. The whole cag pathogenicity-island (cagPAI) from these strains was
analyzed using PCR and the geographic type of cagA phosphorylation motif EPIYA was determined
by gene sequencing. We observed that while European genotype (hp-Europe) predominates in
native Peruvian strains, approximately 20% of these strains represent a sub-population with an
Amerindian ancestry (hsp-Amerind). All of these strains however, irrespective of their ancestral
affiliation harbored a complete, 'western' type cagPAI and the motifs surrounding it. This indicates
a possible acquisition of cagPAI by the hsp-Amerind strains from the European strains, during
decades of co-colonization.
Conclusion: Our observations suggest presence of ancestral H. pylori (hsp-Amerind) in Peruvian
Amerindians which possibly managed to survive and compete against the Spanish strains that
arrived to the New World about 500 years ago. We suggest that this might have happened after
native Peruvian H. pylori strains acquired cagPAI sequences, either by new acquisition in cag-
negative strains or by recombination in cag positive Amerindian strains.

Page 1 of 10
(page number not for citation purposes)
BMC Genomics 2006, 7:191 http://www.biomedcentral.com/1471-2164/7/191

Background We attempted to dissect gene pool diversity of Amerin-


Helicobacter pylori is a Gram-negative bacterium that estab- dian isolates of H. pylori from Peru with an objective to
lished itself in the human stomach possibly thousands of explore ancient and modern features of the Peruvian H.
years ago [1]. This opportunistic pathogen infects over pylori strains corresponding to different waves of human
50% of the worlds' population, causing no harm to most migration. We also looked if it is possible to link some of
colonized people [2]. Only a small subset of infected peo- the native Peruvian strains to their ancestors in Asia.
ple experience H. pylori-associated illnesses such as
chronic gastritis, peptic ulcer disease, gastric carcinoma, Results
and mucosa-associated lymphoid tissue (MALT) lym- FAFLP based genotyping, candidate gene sequence
phoma. Associations of various clinical outcomes with analysis and multi-locus sequencing
disease-specific virulence factors remain dogmatic [3] Phylogenetic relationships assessed by FAFLP genotyping
years after the completion of genome sequences [4]. The of Peruvian isolates (Figure 1A) revealed 2 different line-
debate has been further intensified as some studies have ages corresponding to the hp-Europe and hsp-Amerind.
posed the possibility that H. pylori infection has some pro- Representative isolates from both the lineages obtained
tective effects in esophageal diseases [3]. Also, possible by FAFLP were subjected to reconfirmation by MLST anal-
symbiotic associations have been proposed based on the ysis. We observed that all the isolates corresponding to the
finding that H. pylori harbor protective, bacteriocin like Amerind type FAFLP profile were found to be genetically
effect and may therefore be beneficial to its host [5]. closer to strains from Alaska and were therefore, accepted
as genotype hsp-Amerind. All the other isolates from Peru
Subsequent to the decipherment of the potential of poly- clustered with those from Spain and therefore joined
morphic DNA markers in reconstruction of human migra- MLST genotype hp-Europe.
tion and phylogeography [6,7], pathogen genotypes were
successfully used in tracking and analyzing patterns of We looked at the geographic signatures of oipA gene
human migrations [8-10] in different continents. sequences in Peruvian isolates and found that 7 of the 27
Recently, sequence variation in H. pylori has provided a isolates including the 2 hsp-Amerind isolates had an East-
window into human population migration [11] and also ern type oipA (CT < 6) signature; 4 of these had a non func-
revealed that impact of religions on stratification of tional gene (frame out) (Table 1). Nineteen of the
human ethnic groups can be analyzed based on H. pylori Peruvian isolates had a western type oipA (CT ≥ 6), 15 of
haplotypes [12]. which had a functionally intact coding frame. It appears
that oipA sequence conveys the European ancestry for a
Ancient origins and dissemination of H. pylori are quite majority of these isolates, including some hsp-Amerind,
debatable in the context of the vast South American con- however, oipA is a virulence linked gene and its resolution
tinent that has witnessed many different waves of popula- power for lineage identification is not as robust as that of
tion migration [13], especially in view of the fact that H. the 7 housekeeping genes we used, or the babB gene. It is
pylori has been present in this continent since pre-Colum- likely that oipA might have been exchanged or recom-
bian times [14]. However, evolution of virulence and fit- bined between different lineages (like most of the other
ness in such 'ancient' strains that arrived first in the virulence genes such as the cagPAI) to give hybrid geno-
Americas and then, possibly out-competed by the influx types.
of 'modern' strains from Europe [14] remains largely
unexplored. The babB gene sequences of our hsp-Amerind isolates were
compared to the sequences previously reported for Amer-
A landmark study based on PCR based DNA motif analy- indian isolates from Venezuela by Ghose et al. [16] by
sis proposed that H. pylori jumped recently from animals multiple sequence alignment and phylogenetic analysis
to humans and, therefore, the acquisition of H. pylori by (Figure 1B). These phylogenies, when studied in the light
humans may be a recent phenomenon [15]. This study of MLST based inferences, revealed distinct affinities
has been the basis for the idea of 'H. pylori free New World' between Peruvian Amerindian strains and Venezuelan
[15]. However, two independent studies based on large- Amerind strains, pointing thus to their common Asian
scale analyses of candidate gene polymorphisms con- origins (Figure 1B).
trasted the idea of recent acquisition and suggest that H.
pylori might have co-evolved with humans [11,16]. In In summary, our overall phylogenetic analyses, demon-
view of these intriguing ideas on ancient origin of H. strated that about 20% of the strains from native Peruvian
pylori, additional evidences based on strains from different patients that we examined share common genotypic pat-
geographical regions (especially those with a rich history terns with Asian (Amerind) strains and therefore, Ancient
of multiple waves of human migrations such as the South Asian gene pool must have been the origin of the present
Americas) are clearly needed. day strains inhabiting the native Peruvians.

Page 2 of 10
(page number not for citation purposes)
BMC Genomics 2006, 7:191 http://www.biomedcentral.com/1471-2164/7/191

SJM18
0
A

SJM
80
SJM19
SJM41
SJ

14
M2

M
SJM37
SJ
7
SJ

SJ
M5

M2
3

0
SJ 1 38
SJM M1 M2 SJM
148 SJ
Hp-Europe SJM18
SJM189

3
184 M6 SJM
57
SJ
SJM S
JM SJ
45 M9
3
50

SJ
M

M
SJ

69
SJM

SJM
SJM23
SJ

SJ
10

12
M

M8
92

3
0.0 1

hsp-Amerind

B
Modern branch
VC72

24
VP
95
VC

3
VC9
4
VC9

SJM83

SJM92 V P43
VP
25

VP49

19
VP

0.01
Amerind branch

A.
Figure
FAFLP 1 analysis of H. pylori strains analyzed from native Peruvians (n = 27)
A. FAFLP analysis of H. pylori strains analyzed from native Peruvians (n = 27). The phylogenetic tree was developed based on
various amplitypes generated for individual isolates after allele scoring and generation of similarity profiles in the form of binary
tables. Genetic relationships in the form of a tree were deduced using MEGA 3.0 software using bootstrapping method at
10000 bootstrap trials. Two different lineages observed in the tree are colored as per the previous conventions [11, 20]. Six
isolates from among those represented in the FAFLP tree (highlighted with circles) were also analyzed subsequently by MLST.
B. Phylogenetic analysis of representative hsp-Amerind sequences from our isolates (SJM 83, 92) and those of other Amerind
and non-Amerind (western) H. pylori sequences previously described by Ghose et al. from Caracas and Puerto Ayacucho in
Venezuela [16]. SJM 83 and 92 formed a separate branch (maroon color) only with Amerind isolates (right). Both the Amerin-
dian and western lineages (green color) are differentially colored as per previous conventions [11, 20]. Sequence alignment
(left) of these Amerind sequences revealed significant conservation at the level of nucleotides.

Page 3 of 10
(page number not for citation purposes)
Table 1: Characteristics of 26 Peruvian H. pylori isolates obtained from Amerindians and distribution, presence/absence and or rearrangement of various gene loci in their
genomes; 0 – region not amplified, 1 – region present.

Page 4 of 10
(page number not for citation purposes)
http://www.biomedcentral.com/1471-2164/7/191

Strains Diagnostic PCR Phylogenetic placement oipA status cagA genotype cag-PAI spanning PCR

cagA GlmM FAFLP/MLST babB oipA No. of CT Frame gene EPIYA motif cagAP1 cagAP2 cagA1 cagA2 cagE cagT LEC1 LEC2

A) Native Peruvian Strains

SJM184 1 1 hp Europe Modern Eastern 5 Out 1 EPIYA A, B, C 1 1 1 1 1 1 1 1


SJM80 1 1 hp Europe Modern Western 6 Out 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM83 1 1 hsp Amerind Amerind Western 6 Out 1 EPIYA A, B, C 1 1 1 1 1 1 1 1
SJM92 1 1 hsp Amerind Amerind Eastern 3 In 1 EPIYA A, B, C 1 1 1 1 1 1 1 1
SJM93 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 0 1 1 1 1 0
SJM10 1 1 hsp Amerind Amerind Western 6 In 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM12 1 1 hsp Amerind Amerind Eastern 4 In 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM148 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 1 1 1 1 1 1
SJM14 1 1 hp Europe Modern Western 7 Out 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM179 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 1 1 1 1 1 1
SJM180 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM189 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 0 1 1 1 1 1 1
SJM18 1 1 hp Europe Modern Eastern 4 In 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM19 1 1 hp Europe Modern Eastern 4 Out 1 EPIYA A, B, C, 1 0 1 1 1 1 1 1
C
SJM1 0 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 0 0 0 1 1 1 1 1
SJM20 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 1 1 1 1 1 1
SJM21 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM23 1 1 hsp Amerind Amerind Western 6 In 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM37 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 1 1 1 0 1 1
SJM38 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 0 1 1 1 0 1 1
SJM41 1 1 hp Europe/ Modern Eastern 2 Out 1 EPIYA A, B, C 1 1 1 1 1 1 1 1
WAfrica
SJM45 1 1 hp Europe Modern Western 8 In 1 EPIYA A, B, C, 1 1 1 1 1 1 1 1
C
SJM50 1 1 hp Europe Modern Eastern 5 Out 1 EPIYA A, B, C, 1 1 1 1 0 1 1 1
C
SJM53 0 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 0 0 1 1 1 1 1 1
SJM63 1 1 hp Europe Modern Western 6 Out 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
SJM69 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 0 1 1 1 1 1
BMC Genomics 2006, 7:191

B) Other Strains

Spain 1 1 hp Europe Modern Western 9 In 1 EPIYA A, B, C, 1 0 1 1 1 0 1 1


(HUPB7 C
6)
Japan 1 1 hp East Asia East Asia Eastern 4 In 1 EPIYA A, B, D 1 1 1 1 1 1 1 1
(HU178)
Africa 1 1 hp S Africa Modern Western 6 In 1 EPIYA A, B, C 0 1 1 1 1 1 1 1
(10)
India 1 1 hp Europe Modern Western 6 In 1 EPIYA A, B, C 1 1 1 1 1 1 0 0
(MS8)
BMC Genomics 2006, 7:191 http://www.biomedcentral.com/1471-2164/7/191

Analysis of the cagPAI and the cagA gene in isolates from in Asia. This supports the hypothesis that H. pylori was
Spain, Peru and Japan associated with its host well before Asian people crossed
Overlapping primer amplification strategy to span entire the Bering strait (20,000 years BP) to colonize America.
cagPAI worked very well with our hsp-Amerind isolates We also report that the cagPAI was acquired by native
(Figure 3A) where all the constituent genes of the PAI were Peruvian strains probably from a European source. This
successfully amplified. This indicates that the 5 Amerind might have occurred as a single import, most probably
strains we looked at represent a 'chimera genome' made during the last 500 years of the spread of H. pylori in the
up of an ancient like core genome component (MLST and South American continent. It is evident from the fact that
babB typing) and a modern type flexible genome compo- the Amerindian Peruvian strains, though of an Asian
nent (cagPAI and its right junction typing). Tyrosine phos- descent, do not share characteristic features of Asian cag-
phorylation of the immunodominant cagA protein is PAIs but show homology to the PAIs of European strains.
known to occur at the EPIYA motifs at the C-terminus by Alternatively, the Amerindian strains might have gained
the SRC family of kinases [17,18]. This EPIYA motif con- the PAI through a series of recombination events over a
sists of 4 distinct EPIYA sites – EPIYA-A, -B, -C and -D, period of time. But this hypothesis appears weak when we
based on the amino acid sequence that neighbor it (Figure take into account the short time span of 500 years within
3B). Based on the presence of these EPIYA motifs, cagA which the European strains spread in the Americas.
can be distinguished into the Western type (W) in case the
-C site is present and the East Asian (EA) type in case We tried to potentate these ideas and to provide evidence
where the -C site is replaced by the -D site. Our data for the in favor of the ancient origin of the pathogen and the pos-
type of EPIYA motifs present in Spanish (n = 6), Peruvian sibility that some extraneous gene cassettes might have
(n = 27), and Japanese (n = 16) strains revealed that the been acquired by otherwise symbiotic H. pylori, some-
western type of cagA was predominant among all the times during its natural history. To support this proposi-
Spanish and Peruvian strains. The Japanese isolates alone tion, our methodology was targeted with a two pronged
revealed East Asian type EPIYA D signatures (Figure 3B). approach to i) further substantiate ancient link of the
pathogen and ii) to prove that the pathogenicity island
Results of the PCR of cagPAI from 26 Peruvian isolates was a 'recent' addition to the genome of H. pylori. Our
studied in the present context have been depicted in Table analyses based on FAFLP and MLST linked about 20% of
1. Briefly, the strains from Peru and Spain did not readily the Peruvian isolates to Amerindian genotype, and con-
amplify regions of the cagPAI as did Japanese isolates, sug- veyed that H. pylori was most probably introduced to the
gesting thereby a distinct allelic diversity at the primer New World by Asian people. We, therefore, disagree with
binding sites in the western PAI of these isolates. This the idea of an 'H. pylori free New World' [15]. This disa-
observation places native Peruvian strains closer to Span- greement of interpretation arose possibly because Kersu-
ish strains and therefore hints that both the Spanish and lyte et al. [15] looked at only a few loci in the genome and
Peruvian strains harbor a similar type of cagPAI. stressed mainly on the motifs surrounding the cagPAI on
its right junction.
The right junction of the cagPAI also revealed similar
acquaintances based on PCR based insertion deletion and We also looked at the cagPAI of such strains and found
substitution analysis of the region spanning cagA right that the Peruvian isolates we tested carried western type
junction to glr, for the 6 Peruvian isolates we analyzed. (EPIYA C) cag islands (Figure 3B). We did not record any
Such genotypes for all the isolates we used from Spain, eastern type signature in the cagPAI (EPIYA D motif) in
Japan and Peru were determined earlier by Kersulyte et al Peruvian isolates, given a distinct presence of ethnic Japa-
[15]. Collectively, this study and the previous observa- nese in Peru. This inference also came from MLST data
tions [15] demonstrated that the cag right junction motif (Figure 2), where none of the Peruvian isolates clustered
types were shared by Spanish and Peruvian isolates and with Japanese genotype hp-EastAsia (Figure 2). Similarly,
that Japanese isolates did not share genetic affinities with the absence of Asian/Amerindian type islands (or their
Peruvian strains. This observation again places native remnants in native Peruvian isolates we analyzed, leads us
Peruvian strains closer to Spanish strains and therefore to speculate that their ancestors in Asia were seemingly
hints that both the Spanish and Peruvian strains share cag- benign due to (natural?) absence of functional cag genes.
PAI insertion sites and the regions flanking them. This finding potentates the idea that cag genes in Peru
mainly originated in Europe and therefore confirms the
Discussion scenario proposed by Kersulyte et al. [15] as far as the cag-
H. pylori is presumed to be an ancient colonizer of the PAI and its right junction is concerned.
human stomach which possibly co-evolved with its host.
We report that at least some of the H. pylori strains found Kauser et al., [19] from our group have previously
in Peru share considerable homology with strains found described PCR analyses of the cagPAI for more than 300

Page 5 of 10
(page number not for citation purposes)
BMC Genomics 2006, 7:191 http://www.biomedcentral.com/1471-2164/7/191

Hp-Asia2

L45
L144L30
L
Hp-Europe

5
LL1732196 1 1 3
L3L1
L L L 5704095 u k

L27L67
2
L3
6 22
L K114 1 0 1eU600uk e
H
E 113357 K 5

a 156 3 c h 7

uk
nJe36218 L A21 7

fi1 0i112203En1 651303241K K


Ec1 1 8 tc 1 1 6 3

fi UU
1

ku212

AA 1

79
fi1 6 2 6 05 6

E
71

1 2170 8
ku 20 5
-2L28
5-1
ku214
n

8 A4 P1B
k uk u 2

coH 23bo 8 o3K0 A PHBU53


0

b o fqi 73 512 404Kuk


52

2 7 5 AD
4 c h 96A

277o35 b L U 2 6
1 c hk u 2 50 220 46 4

ku2fi88

c
16
L1

3C

n 6 0U
n

kJu3 4b 18 8SH b o
96 -2 32
1 66 k u1 5E2

4
5 8 M6 1S85 5 0 8

J 3 385c9chh
fi
4 eL K 3M7

8
Hp-Europe

0 9 L K111 U9 1CuHk ch
ne K 10 1

L 60 a D

73
5 0
09 8

E A6 40

Li 19AJD02
tt 8 M
7 U E 1 4K

A S
4U8 - 0 1693213

93
31s
1K 3 1 1 4 5 4

nfUI S
09
1 1 D777H 9
g2 E26qAq113666aCD193

HH11144117

0
H1
H4
4 nn 27149

2b6o2K1
MH

4 118
2 6

L
S1 X
MH 2841 ku 3K 9cchhh

32
SJ aD
M1 cc2S1247206 9c s 3 28K
1 2 11233c74h0K 9AA 5U
8345S ccLs9c M S L 5 11
1fi3
58u2 0 UK
10 c 02 A9 131
nq3 nq29nCq3H j16 -1 L2K32A15 6015cUhI19695865
n
66 q29 72 6 b o 1 5 H
U I198
I1
n q H
H9U4 111
be1 10 331 7 6
s
94 5 U u 1 HbUoI1 299k2ucU
2mP7B84
K J320 AD
H
re8s0u2
33 12cMhS17 23
H 4K 4R re7S006
3 UI121AD 43JH14 194
141
nct0ac1D16693 10H 40HUkI1770
2
189 372AD 17ch38 HU9u
01 38
2 0 8
cc5136790 s ininnmm
5
aa150n 3 e 6
6 5
c
Mne 4ene
961
622 6046
ne
inm a550
ne62 1 hsp-Maori
2440 1800L44 ne6 inma
inma51 ne617
12
50 nene600
60 36
17 79 nene n6e260 6 ne
61
9
cc2c 129UK10 nen6e0061191613
V230 4
ne60 1 ininminm
maa115
a14
AALLA in
ALm
ne607 Aa101
j99 ALAALAA245 15 hsp-Amerind
BF25a H H 3 ALAAALLA
BF1a HU HUU I I 1A911
BF19a BF7a GU IU11I716664920
D6b10a 17DUf32VD2U2S57J16M 81
9 Peruvian
BF 3-2 GU0915 2 S2
N 3
Gp125 9UK HU JM9

r e re781082-0200 0230C70P-20442
Lsu10174-15 Amerind

GG2 9 CPH UCr1eP7 0


80
Lsu10C1L3su 1014-1 U1G0 U h p 56 2

D U re r erre

U1U reY 1H7Y 0


r e6 1 2 1e1982003 8
027-167-1 6 10 1

b e1 336 S HH

32 r29 H7 000U462827346
fi1o2s41211 0 51
Lsu1Lsu1160-1a

b b
3 665 40 11
s

0 581
1 F8

58 3 S JU3I1
0

boo35 u8k2E 1 H U
0 be
hspWAfrica Lsu -1B5a

1 8c H

U 43
L s unn1qq137L1255S J M78
62BF12 4a

8
h U I2 09 96s2uc 1 0 470-1
0
HU

H U I1
108 0V1 89

1 4a
Lsu 1 1a D hsp-East Asia
BF D1a BF a
3 2S
I20 12

01 0Ls-1

Y0 0
U P1B0 h

2
4 86 8787c 6
bF2 a

uk
3
nqHU

1 F
5 0 91 4 6 33 85

DB
nq

B6
H39

1 99 c
a

1
30 60 V 1

54092 79 9 X870610 C94


1

u1 0 3 6 9

-
699s cc69

51 c
L1s-1 5-1

M4 6 2 1
Lmg 3

UP
c c 1 3 03-

19 ccc23820
35

PB
8 9 49 a 0

h
1 3c0a 1177 5 9 0

11886 19

I19
2B

S J 5 5 9 s u1 0
00 a 6a00

7
0

c5
2 90
9 sm 1 4

85
u2
u3 D 3F1u3

185
1 6 cc a1

5 40 c c 3a5c

L
c 1c0cc3 1 a
B Ls

-1
cc7c 126 0

ccc265aa
11 4 8

57
L s u 1 C 1c0c83 37c0

1 7979aLg
60

c 9
49

1740

1 9 00
05 4- C 7

C16 4

2c
cc 8a
13

cc
1450 8a
cc 2450
Ls

18 1
cc4
650
ccc4

192 9
cc4
cc24c

hspSAfrica

Hp-Africa2
5019
1812
8990
528 9
16 40
1630
17
1 930
16 20 1350
19

0.01

Figure
MLST analysis
2 based on concatenated gene sequences of 7 housekeeping genes of H. pylori (Kimura-2 parameter)
MLST analysis based on concatenated gene sequences of 7 housekeeping genes of H. pylori (Kimura-2 parameter). The phyloge-
netic tree was based on a total of 19 sequence records (concatenated) obtained under this study (SJM, HUPB, HU, CPY) while
incorporating other ~400 sequence records from pubMLST database (pubmlst.org) which were specific to different genotypes
in the world (Courtesy, Daniel Falush). Different genetic populations (Hp) and subpopulations (hsp) or genotypes are named
and differentially colored after previous conventions [11, 20]. All SJM isolates, Amerindians (SJM23 and 92-highlighted) and
non-Amerindians (arrowheads) analyzed by us from Peru are highlighted in bold face black fonts with green twigs indicating
presence of a western type cagPAI.

Page 6 of 10
(page number not for citation purposes)
BMC Genomics 2006, 7:191 http://www.biomedcentral.com/1471-2164/7/191

A
R e p r e s e n ta tiv e c a g P A I p r o f ile o f h s p - A m e r in d , S J M 2 3 /S J M 9 2

33
05
1

4
2

2/
0

1
52

52
52

52

52

52

52

0
8

53
52

53
53
52

52
P0

P0
P0

P0

P0

P0

P0

P0
P0

P0
P0

P0

P0
H

H
H

H
H

H
H
H

H
M

4
54

54 /
5 6 7 8 9 0 1 2 3 5 6 7

8
4 53 53 53 53 53 54 54 54 54 54 54 54

54
53 P0

9
P0 P0 P0 P0 P0 P0 P0 P0 P0 P0 P0 P0

P0
P0 H H H H H H H H H
H
H H H

P0
H
H

H
M

c a g A E P I Y A m o tif ty p e ( W e s te r n - E P I Y A - C ) , h s p - A m e r in d , S J M 2 3 /S J M 9 2
H L N F S D IK K E L N A K L G N F N N N N N N G L K N E P IY A K V N K K K T G Q V A S P E E P IY A Q V A K K V T Q K ID R L N Q IA S G L G G V G Q A A G
F P L K K H D K V D D L S K V G R S V S P E P IY A T ID D L G G P F P L K R H D K V D D L S K V

B
P ER U
S jm 1 8 4 W E P IY A A, B, C
S jm 3 8 W E P IY A A, B, C
S jm 4 5 W E P IY A A, B, C, C
S jm 5 0 W E P IY A A, B, C, C
S jm 6 3 W E P IY A A, B, C
S jm 6 9 W E P IY A A, B, C
S jm 8 0 W E P IY A A, B, C
S jm 9 3 W E P IY A A, B, C
S jm 1 4 8 W E P IY A A, B, C
S jm 1 8 0 W E P IY A A, B, C
S jm 1 8 9 W E P IY A A, B, C
S jm 1 8 W E P IY A A, B, C
S jm 1 9 W E P IY A A, B, C, C
S jm 2 0 W E P IY A A, B, C
S jm 2 1 W E P IY A A, B, C
S jm 2 3 W E P IY A A, B, C
S jm 2 7 W E P IY A A, B, C
S jm 1 4 W E P IY A A, B, C

S P AIN
HupB 79 W E P IY A A, B, C, C
HupB 51 W E P IY A A, B, C, C
HupB 76 W E P IY A A, B, C, C
HupB 78 W E P IY A A, B, C, C
HupB 71 W E P IY A A, B, C, C
HupB 62 W E P IY A A, B, C, C

JAP AN
C py 6261 EA E P IY A A, B, D
C py 6311 EA E P IY A A, B, D
C py 001 EA E P IY A A, B, D
C py 2362 EA E P IY A A, B, D
C py 0041 EA E P IY A A, B, D
C py 6021 EA E P IY A A, B, D
H u56 EA E P IY A A, B, D
H u133 EA E P IY A A, B, D
H u87 EA E P IY A A, B, D
H u55 EA E P IY A A, B, D
H u78 EA E P IY A A, B, D
H u118 EA E P IY A A, B, D
H u131 EA E P IY A A, B, D
H u176 EA E P IY A A, B, D
H u38 EA E P IY A A, B, D
H u48 EA E P IY A A, B, D

Figure
A. (Top)3 PCR based analysis of whole cagPAI of hsp-Amerind isolates from Peru
A. (Top) PCR based analysis of whole cagPAI of hsp-Amerind isolates from Peru. Overlapping primers spanning all of the con-
stituent genes (see methods) amplified all the corresponding PCR products in the expected size range as described previously
[30]. M indicates molecular weight marker (100 bp ladder). (Bottom) Amino acid signature of cagA (phosphorylation motifs –
colored) – characteristic of modern cagA EPIYA (EPIYA C) as observed for all the SJM isolates from Peru. B. Pictorial depic-
tion (right) of different types of cagA-EPIYA motif types prevalent in different H. pylori populations and their distribution in our
isolates (boxes on the left).

Page 7 of 10
(page number not for citation purposes)
BMC Genomics 2006, 7:191 http://www.biomedcentral.com/1471-2164/7/191

strains from different parts of the world. Of these, the and uniform as compared to virulence associated loci
majority of strains from whom complete PAI was ampli- such as the flagellins and vacA [31], recombinant and
fied were from Japan (57%) whereas only 18.6% strains hybrid alleles that blur phylogeographic inferences, could
from Peru and 13.3% strains from Spain could support be a rare occurrence rather than a rule. Nonetheless, it will
amplification of an intact cagPAI, due mainly to allelic be important to ascertain proportions of nearly pure and
diversity present at the primer annealing sites. These hybrid alleles among native Peruvian H. pylori through
observations were further endorsed by our present analy- admixture analyses based on sophisticated population
sis that revealed that all the Peruvian isolates we analyzed genetics tools [32] that reveal contemporary gene flow
carried only the European type PAI. and proportion of different nucleotides inherited from
ancestral populations on an evolutionary time-scale.
It has been recently reported that true Amerind strains
either do not carry a cagPAI or carry only a vestigial, Conclusion
incomplete PAI [20]. It is already well known that H. pylori In summary, our study based on profiling of several gene
can import short stretches of DNA from strains from very loci and neutral markers revealed certain distinctive
different populations when they (presumably co-) infect genetic features of the H. pylori gene pool in Peru. Impor-
individuals in the same location. However, it appears that tant among these features are a demonstrated genetic link
the cagPAI and the region surrounding it had been with Asian (Amerindian) strains and the presence of an
exchanged by the Amerindian strains in Peru, where, the intact cagPAI with western type of gene motifs that point
human population underwent major changes in recent to recent acquisition of this important pathogenicity
history with the arrival of European conquerors and set- island. This indicates possible lateral gene transfers during
tlers. The long isolated Amerindian H. pylori strains thus colonization of Peruvian Amerindians with both 'ancient'
came in contact with the European strains, which har- and 'modern' strains for several generations. Such an
bored a cagPAI. It is established that the cagPAI might give admixed gene pool could be an important source of
selective advantages during host colonization and there- genetic information on pathogen evolution in real time to
fore, "endogenous" Helicobacter, in Peru could be out- possibly understand how gene acquisition and loss on a
competed in a human community with newly arrived cag- population wide scale shape virulence and fitness in dif-
PAI positive strains introduced by the European conquer- ferent pathogens.
ors. The "endogenous" Helicobacter strains could
moreover acquire cagPAI during mixed infection with This could also possibly provide for a reasonable model of
western cagPAI positive strains leading to the observed geographic evolution to understand acquisition of viru-
strains with Eastern-like core genome content and a west- lence in pathogenic bacteria over a period of time.
ern-type cagPAI. So this finding would really be interest-
ing if it is clear that the entire cag-island had been Methods
exchanged. Then, there would be a very interesting ques- Bacterial strains, genomic DNA and diagnostic PCR
tion about the mechanism by which an entire island had Genomic DNA preparations for strains originating from
been transferred; there is no doubt that such exchanges Peru, Spain and Japan were provided by D. E. Berg and
have happened at some point, but if they happened Asish Mukhopadhyay (Washington University, St. Louis,
within the last 500 years, then, there would be much bet- Mo.). These DNA were isolated from patients diagnosed
ter chances of catching it in action. This is probably where with gastric ulcers from Spain (HupB); gastric cancer and
the present data lead us to, and, we suggest that future DU cases from Japan (CPY, Hu); and from gastritis cases
efforts may be directed towards confirmation of this evo- alone from Peru (SJM). However, in the current study, the
lutionary mechanism. clinical background of the individual isolates was not
taken into account. The Peruvian isolates we looked at (n
Finally, it is possible that phylogenetic methods based on = 27) were originally from Native Peruvian people mainly
highly recombining gene loci [21-27] may not be fully of Amerindian ancestry from Lima [15]. PCR based anal-
perfect to predict genetic relationships in terms of inherit- ysis of genes namely glmM, babB [16] and oipA was carried
ance from different ancestral populations, especially out to ascertain the quality of DNA samples we used. Also
when we use tools such as Mega 3.0 [28] which do not these PCR assays served as amplification level controls for
support admixture analysis. Partly in view of this possibil- the analysis of insertion, deletion and substitution in the
ity and to ensure that our conclusions did not represent cagPAI.
shortcomings of a single approach, we adopted an inte-
grated genotyping strategy for the cagPAI [29,30] as well as Integrated genotyping of H. pylori based on chromosomal
the core genome through MLST of less recombining, neu- DNA signatures
tral genes that encode cytoplasmic enzymes. Given the Whole genome fingerprinting based on FAFLP genotyping
fact that these housekeeping genes are selectively neutral was performed as described previously [21-23]. Briefly,

Page 8 of 10
(page number not for citation purposes)
BMC Genomics 2006, 7:191 http://www.biomedcentral.com/1471-2164/7/191

the profiling of whole genome micro-restriction finger- PAI of the representative isolates (SJM92 and SJM23)
prints with EcoRI/MseI enzymes using fluorescence tagged from hspAmerind by PCR using overlapping primers as
primer pairs EcoRI+A/MseI+0 and EcoRI+G or A/MseI+0 described by Blomstergren and colleagues [30]. The 3' end
was performed for all the strains. The PCR amplified frag- of the cagA gene was amplified using primers mentioned
ments for each of the strains were then subjected to elec- elsewhere [17] and the amplified products for strains
trophoretic separation on a 5% acrylamide gel and scoring from Spain, Peru, and Japan were sequenced with forward
of the fluorescent markers was done using an automated and reverse primers. The consensus sequences were then
DNA analysis workstation (ABI Prism 3100 DNA translated into amino acid sequences using GeneDoc soft-
sequencer). Cluster analysis of DNA profiles was con- ware (version 2.6.002) and were then assigned to the
ducted on the basis of fingerprint characteristics. All the Western or the East Asian group based on the C or D
data obtained through molecular genotyping and DNA repeat present respectively in the EPIYA motif [18]. Chro-
profiling were deposited in the genoBASE pylori database mosomal rearrangements are known to give rise to 5 types
[33]. The genoBASE pylori server was queried for compar- of insertion-deletion and substitution motifs in the region
ative analyses. between the 3' end of cagA gene and the 3' end of the
glutamate racemase (glr) gene. Although the statuses of
Genotyping based on candidate genes oipA and babB was these motifs for the Peruvian strains we analyzed were
carried out as described [24-26]. Short stretches of oipA described previously by Kersulyte and colleagues [15], we
gene were analyzed to determine the 'geographic signa- re-assessed 6 of them by PCR exactly as described earlier
ture' based on CT repeat [27]. In addition, 600 bp region [15].
each from the 7 housekeeping genes spread throughout
the genome [atpA, efp, ureI, ppa and mutY, trpC, yphC] was Authors' contributions
amplified and sequenced for all the isolates exactly as SMD and IA performed MLST typing and phylogenetic
described previously [11]. Sequencing was performed analysis. SMD also helped in analysis of babB and oipA
with both forward and reverse primers, using an ABI genotyping. AAK and AA performed PCR based analysis of
Prism 3100 DNA sequencer (Applied Biosystems, USA). the cagPAI genes. SAR extended bioinformatics support.
PCR and direct sequencing were performed at least twice LAS provided expert clinical consultation and contributed
to determine and confirm the DNA sequences for each to manuscript writing. NA performed FAFLP analysis,
isolate. Consensus sequence for each of the samples was planned and supervised the study, wrote the manuscript
generated using Genedoc (version 2.6.002). Multiple and provided overall leadership. All the authors read and
alignments of sequenced nucleotides were carried out approved the final manuscript.
using Clustal X (version 1.81). Neighbor joining trees
were constructed in Mega 3.0 [28], using bootstrapping at Acknowledgements
10000 bootstrap trials (FAFLP and babB) and through We are thankful to Seyed E. Hasnain for his guidance and support. We are
Kimura-2 parameters (for MLST). For construction of phy- highly grateful to Asish Mukhopadhyay and Douglas E. Berg for providing us
logenetic trees based on MLST genotyping procedures, with DNA samples of Peruvian and other strains as a gift. We are thankful
to Daniel Falush for support with international MLST data and advice. We
sequences of 7 housekeeping genes of strains belonging to
are also thankful to the International Society for Genomic and Evolutionary
different established genotypes were obtained from the Microbiology (ISOGEM) for supporting and endorsing the study. Financial
pubMLST database [34] (courtesy, Daniel Falush). support from the Department of Biotechnology, Government of India
(CDFD core grants) is gratefully acknowledged.
The nucleotide sequences of the 7 housekeeping genes for
the 6 hsp-Amerind isolates we analyzed have been depos- References
ited in the GenBank [Accession numbers, Gen- 1. Covacci A, Telford JL, Giudice GD, Parsonnet J, Rappuoli R: Helico-
bacter pylori virulence and genetic geography. Science 1999,
Bank:DQ462362–462367 (atpA), GenBank:DQ462368– 284:1328-1333.
462373 (efp), GenBank:DQ462374–462379 (mutY), 2. Montecucco C, Rappuoli R: Living dangerously: how Helico-
GenBank:DQ462380–462385 (ppa), Gen- bacter pylori survives in the human stomach. Nat Rev Mol Cell
Biol 2001, 2:457-466.
Bank:DQ462386–462391 (trpC), GenBank:DQ462392– 3. Ahmed N, Sechi LA: Helicobacter pylori and gastroduodenal
462397 (ureI), GenBank:DQ462398–462403 (yphC)]. pathology: New threats of the old friend. Annals Clin Microbiol
These and other sequences can also be requested from the Antimicrobials 2005, 4:1.
4. Alm RA, Ling LSL, Moir DT, King BL, Brown ED, Doig PC, Smith DR,
authors. Noonan B, Guild BC, deJonge BL, Carmel G, Tummino PJ, Caruso A,
Nickelsen MU, Mills DM, Ives C, Gibson R, Merberg D, Mills SD, Jiang
Q, Taylor DE, Vovis GF, Trust TJ: Genomic-sequence compari-
Profiling of the cagPAI son of two unrelated isolates of the human gastric pathogen
PCR analyses were carried out to find the status of the cag- Helicobacter pylori. Nature 1999, 397:176-180.
PAI using 8 sets of primers that amplified the cagA gene, 5. Putsep K, Branden CI, Boman HG, Normark S: Antibacterial pep-
tide from H. pylori. Nature 1999, 398:671-672.
its promoter region, the cagE and cagT genes and the left 6. Cavalli-Sforza LL: The DNA revolution in population genetics.
end of the cagPAI [25,26,29]. We also analyzed whole cag- TIG 1998, 14:60-65.

Page 9 of 10
(page number not for citation purposes)
BMC Genomics 2006, 7:191 http://www.biomedcentral.com/1471-2164/7/191

7. Cavalli-Sforza LL, Feldman MW: The application of molecular Helicobacter pylori isolates recovered from ulcer disease
genetic approaches to the study of human evolution. Nat patients in England. BMC Microbiol 2005, 5:32.
Genet 2003, 33(Suppl):266-275. 26. Prouzet-Mauleon V, Hussain MA, Lamouliatte H, Kauser F, Megraud
8. Holmes EC: The phylogeography of human viruses. Mol Ecol F, Ahmed N: Pathogen evolution in vivo: genome dynamics of
2004, 13:745-756. two isolates obtained nine years apart from a duodenal ulcer
9. Pavesi A: Utility of JC polyomavirus in tracing the pattern of patient infected with a single Helicobacter pylori strain. J Clin
human migrations dating to prehistoric times. J Gen Virol 2005, Microbiol 2005, 43:4237-4241.
86:1315-1326. 27. Ando T, Peek RM, Pride D, Levine SM, Takata T, Lee YC, Kusugami
10. Wirth T, Meyer A, Achtman M: Deciphering host migrations and K, van der Ende A, Kuipers EJ, Kusters JG, Blaser MJ: Polymor-
origins by means of their microbes. Mol Ecol 2005, phisms of Helicobacter pylori HP0638 reflect geographic ori-
14:3289-3306. gin and correlate with cagA status. J Clin Microbiol 2002,
11. Falush D, Wirth T, Linz B, Pritchard JK, Stephens M, Kidd M, Blaser 40:239-246.
MJ, Graham DY, Vacher S, Perez-Perez GI, Yamaoka Y, Me'graud F, 28. Kumar S, Tamura K, Nei M: MEGA3: Integrated software for
Otto K, Reichard U, Katzowitsch E, Wang X, Achtman M, Suerbaum molecular evolutionary genetics analysis and sequence align-
S: Traces of human migrations in Helicobacter pylori popula- ment. Brief Bioinfor 2004, 5:150-163.
tions. Science 2003, 299:1582-1585. 29. Ikenoue T, Maeda S, Gura KO, Akanuma M, Mitsuno Y, Imai Y, Yosh-
12. Wirth T, Wang X, Linz B, Novick RP, Lum JK, Blaser M, Morelli G, ida H, Shiratori Y, Omata M: Determination of Helicobacter
Falush D, Achtman M: Distinguishing human ethnic groups by pylori virulence by simple gene analysis of the cag patho-
means of sequences from Helicobacter pylori: lessons from genicity island. Clin Diag Lab Immunol 2001, 8:181-186.
Ladakh. Proc Natl Acad Sci USA 2004, 101:4746-4751. 30. Blomstergren A, Lundin A, Nilsson C, Engstrand L, Lundeberg J:
13. Underhill PA, Jin L, Zemans R, Oefner PJ, Cavalli-Sforza LL: A pre- Comparative analysis of the complete cag pathogenicity
Columbian Y chromosome-specific transition and its impli- island sequence in four Helicobacter pylori isolates. Gene 2004,
cations for human evolutionary history. Proc Natl Acad Sci USA 328:85-93.
1996, 93:196-200. 31. Achtman M, Azuma T, Berg DE, Ito Y, Morelli G, Pan ZJ, Suerbaum S,
14. Yamaoka Y, Orito E, Mizokami M, Gutierrez O, Saitou N, Kodama T, Thompson S, van der Ende A, van Doorn LJ: Recombination and
Osato MS, Kim JG, Ramirez FC, Mahachai V, Graham DY: Helico- clonal groupings within Helicobacter pylori from different
bacter pylori in north and south America before Columbus. geographical regions. Mol Microbiol 1999, 32:459-470.
FEBS Lett 2002, 517:180-184. 32. Falush D, Stephens M, Pritchard JK: Inference of population struc-
15. Kersulyte D, Mukhopadhyay AK, Velapatino B, Su WW, Pan ZJ, Gar- ture using multilocus genotype data: linked loci and corre-
cia C, Hernandez V, Valdez Y, Mistry RS, Gilman RH, Yuan Y, Gao H, lated allele frequencies. Genetics 2003, 164:1567-1587.
Alarcon T, Lopez-Brea M, Nair GB, Chowdhury A, Datta S, Shirai M, 33. GenoBASE pylori database [http://www.cdfd.org.in/amplibase/
Nakazawa T, Ally R, Segal I, Wong BCY, Lam SK, Olfat F, Boren T, HP]
Engstrand L, Torres O, Schneider R, Thomas JE, Czinn S, Berg DE: 34. pubMLST database [http://www.pubmlst.org]
Differences in genotypes of Helicobacter pylori from different
human populations. J Bacteriol 2000, 182:3210-3218.
16. Ghose C, Perez-Perez GI, Bello MGD, Pride DT, Bravi CM, Blaser MJ:
East Asian genotypes of Helicobacter pylori strains in Amer-
indians provide evidence for its ancient human carriage. Proc
Natl Acad Sci USA 2002, 99:15107-15111.
17. Yamaoka Y, Kodama T, Kashima K, Graham DY, Sepulveda AR: Var-
iants of the 3' region of the cag A gene in Helicobacter pylori
isolates from patients with different H. pylori-associated dis-
eases. J Clin Microbiol 1998, 36:2258-2263.
18. Hatakeyama M: Oncogenic mechanisms of Helicobacter pylori
cagA protein. Nature Rev Cancer 2004, 4:688-694.
19. Kauser F, Khan AA, Hussain MA, Carroll IM, Ahmad N, Tiwari S,
Shouche Y, Das B, Alam M, Ali SM, Habibullah CM, Sierra R, Megraud
F, Sechi LA, Ahmed N: The cag pathogenicity island of Helico-
bacter pylori is disrupted in a majority of patient isolates
from different human populations. J Clin Microbiol 2004,
42:5302-5308.
20. Gressmann H, Linz B, Ghai R, Pleissner KP, Schlapbach R, Yamaoka Y,
Kraft C, Suerbaum S, Meyer TF, Achtman M: Gain and loss of mul-
tiple genes during the evolution of Helicobacter pylori. PLoS
Genet 2005, 1(4):e43.
21. Ahmed N, Khan AA, Alvi A, Tiwari S, Jyothirmayee CS, Kauser F, Ali
M, Habibullah CM: Genomic analysis of Helicobacter pylori from
Andhra Pradesh, south India: molecular evidence for three
major genetic clusters. Curr Sci 2003, 85:101-108.
22. Carroll IM, Ahmed N, Beesley SM, Khan AA, Ghousunnissa S,
O'Morain CA, Smyth CJ: Fine-structure molecular typing of Publish with Bio Med Central and every
Irish Helicobacter pylori isolates and their genetic relatedness
to strains from four different continents. J Clin Microbiol 2003, scientist can read your work free of charge
41:5755-5759. "BioMed Central will be the most significant development for
23. Carroll IM, Ahmed N, Beesley SM, Khan AA, Ghousunnissa S, disseminating the results of biomedical researc h in our lifetime."
O'Morain CA, Habibullah CM, Smyth CJ: Microevolution between
paired antral and paired antral and corpus Helicobacter pylori Sir Paul Nurse, Cancer Research UK
isolates recovered from individual patients. J Med Microbiol Your research papers will be:
2004, 53:669-677.
24. Kauser F, Hussain MA, Ahmed I, Ahmad N, Habeeb A, Khan AA, available free of charge to the entire biomedical community
Ahmed N: Comparing genomes of Helicobacter pylori strains peer reviewed and published immediately upon acceptance
from the high altitude desert of Ladakh, India. J Clin Microbiol
2005, 43:1538-1545. cited in PubMed and archived on PubMed Central
25. Kauser F, Hussain MA, Ahmed I, Srinivas S, Devi SM, Majeed AA, Rao yours — you keep the copyright
KR, Khan AA, Sechi LA, Ahmed N: Comparative genomics of
Submit your manuscript here: BioMedcentral
http://www.biomedcentral.com/info/publishing_adv.asp

Page 10 of 10
(page number not for citation purposes)

You might also like