2.1. Cag A and Vac A
Definition, diversity, classification and their significance: Vacuolating cytotoxin, or VacA, and Cag (cytotoxin-associated genes) pathogenicity island (PAI), encoding a bacterial type IV secretory apparatus (T4SSs), are two main factors involved in
H. pylori pathogenicity (
Figure 1). CagA is a protein of 116-140 kDa molecular weight that is expressed by almost 70% of strains and which produces a specific cytotoxin [
38]. The significant role of this protein in
H. pylori-infected patients has led to the isolates being defined as belonging to one of two groups, either CagA-positive (type I) or CagA-negative (type II). Epithelial cells and cells of the immune system are considered as two main targets for VacA, in which is expressed by all
H. pylori strains [
39,
40]. VacA protein has cytotoxic activity that is due to its ability to drive intracellular vacuolization [
41]. It has been demonstrated that various cell types are vulnerable to this toxin [
40]. It has escape mechanisms to avoid the highly acidic environment of the stomach [
42]. Different receptors are recognized for VacA, yet their roles and importance are not clear [
43]. Similar to CagA, this virulence factor is expressed only in
H. pylori type I [
44]. Notably, there are three types of VacA genotype predicated on their signal sequence, namely
s1a,
s1b and
s2, as well as
m1 and
m2, which is based on middle-region alleles of the
vacA gene [
45]. Regarding
vacA allelic diversity, regions including
s-region (signal),
m-region (middle),
i-region (intermediate),
d-region (deletion) and
c-regions are elucidated. Based on the deletion at the 3' end of the
vacA gene, different types are investigated. The
i region exists in three types (
i1,
i2 and
i3), while all other regions are classified into two types (
s1,
s2,
m1,
m2,
c1,
c2,
d1 and
d2). More variants within these regions are proposed, of which K, E and Q-types are conspicuous [
46].
The
cag PAI is a 40 kb DNA sequence as that encodes type IV secretion system (T4SS) and CagA protein. This generates a pilus via which the bacterium can inject CagA protein into a host cell [
47,
48]. There are twelve recognized components of T4SSs in Gram-negative bacteria, including VirD4 and VirB1-11. It is organized into three parts: outer membrane core complex, inner membrane complex, and extracellular pilus [
49,
50]. Upon delivery of CagA into the cell and phosphorylation of a C-terminal EPIYA motif, the signaling pathway is activated via binding of CagA to the SH2 domain. Host cell changes occur after components interact with both phosphorylated and non-phosphorylated CagA. Of note are changes in cell junction, elongation, polarity, proliferation and proinflammatory response [
51,
52]. Various bacterial proteins such as CagM, CagX, CagY, CagT and Cag3 that form a part of CagPIA are encoded by a 41 nm long core structure. Among these, CagX and CagY are associated with the T4SS channel [
53]. An interaction between CagL on the T4SS and α5β1 integrin leads to CagA transposition and pilus formation. Consequently, cells become more irregular as a result of phosphorylation at the 3' end of
CagA gene (EPIYA), which is located in the PAI [
44,
54]. CagA is a highly immunogenic protein that comprises two types, CagAI and CagAII on the right or left segment, respectively [
38,
44]. This
H. pylori type I virulence factor is linked to gastroduodenal disease and its gene may be acquired horizontally [
44,
55]. Upon bacterial attachment and infection, CagA will activate signal factors such as interleukin (IL)-8, which depend on the Cag PAI activity [
44,
56]. IL-8 and NFκB will pave the way for inflammation and carcinogenesis [
54]. Another gene called
cagE, located in the
cagI and in proximity to
cagA, has similarity with
ptlC in
Bordetella pertussis [
44]. This gene is considered a better marker of pathogenicity, although further verification is needed [
57]. In addition, there is a correlation among these virulence factors and other OMPs including IceA, BabA, HopQ, OipA, SabA and HopZ [
58].
Geographical Variances and Clinical Associations: Several studies have investigated associations between these two antigens and different gastric conditions, yet neither is considered as an indicator of gastric cancer [
59,
60]. A high frequency of CagA-positive isolates in patients with gastric cancer was reported [
61]. Different results obtained vary by geographical region. In one study performed in various countries,
s1c-m1,
s1b-m1 and
s1a-m1 of
vacA were the predominant genotypes in Japan and Korea, US, and Colombia, respectively. Although
cagA genotype was predominant in all nations, no relationships with clinical outcomes were identified [
59]. In Egypt, however, 68.7% of patients with a gastric ulcer, 50% of gastric carcinoma patients and 33.3% of gastritis cases were positive for
cagA gene expression [
62]. On the other hand, in an Australian cohort of
H. pylori-infected individuals, 78% and 85% of cases of duodenal ulcer and gastric cancer, respectively, were positive for the
cagA gene [
63]. Another study showed an association between
vacA s1a,
cagE and
cagA with gastric cancer and duodenal ulceration [
64]. Additionally, a correlation between
d-region and gastric atrophy and neutrophil infiltration was reported. There is a close relationship between geographical region and distribution of VacA subtypes. It is apparent that
s1/
m1 and
i1 are predominant genotypes in northeast Asia. Also, a close relationship between VacA subtypes and gastric disorders is demonstrable. Furthermore, an association between
s1a,
s1c and
m1 with gastric cancer, peptic ulcer and intestinal metaplasia was reported [
3,
65,
66,
67].
2.2. H. pylori Outer Membrane Proteins
Hop is the largest family of
H. pylori OMPs, with 32 known members, yet they are collectively encoded by only 4% of the bacterial genome [
36,
37]. Hop A-E act as porin proteins as well as a channel through which antimicrobial agents permeate into the cell. Hence, many of them are potential candidates for development vaccine [
68,
69]. This group contains two divisions, Hop and Hor proteins. Interestingly, members of the latter lack a hop motif but still have an N-terminal motif, as do Hop proteins, and which is greatly variable in size. The former is divided into two groups based on the C-terminus [
37].
Adhesion to host epithelial cells is the very first step for
H. pylori colonization and persistence, which is mostly mediated by OMPs and T4SS [
70]. There are three distinct steps of infection: colonization; attack of the gastric mucosa; and escape from the immune system. Attachment to mucins depends on several variables including type of mucin, anatomical site, pH,
H. pylori strain and gastritis status. Also, interaction between
H. pylori and host Lewis antigens, Le
a,b,x,y, attributed to Hop proteins such as SabA and BabA, is vital to this process [
71,
72].
Protected by a mucus layer and composed mostly of MUC5AC and MUC6, the gastric epithelium is responsible for a glycosylation pattern that varies between gastric disorders. MUC2 is a type of mucin that does not exist in normal mucosa but instead is found mostly in intestinal metaplasia in which goblet cells are predominant. Understanding more about mucin expression patterns is important as
H. pylori adhesion is mediated through interaction between these antigens and virulence factors [
73,
74,
75].
2.2.1. Hop B and Hop C
HopB and HopC, also known as AlpA and AlpB, are encoded by the
alp A/B locus (OMP 20 and 21, respectively). Homology of AlpA/B among various
H. pylori strains is reported as more than 90%. While the role of these proteins remains to be substantiated, they are assumed to be involved in adhesion [
37,
76,
77], for which laminin serves as a receptor. Any interruption to Hop B/C leads to diminished binding of
H. pylori to laminin [
92]. In addition, these proteins are responsible for producing cytokines such as IL-6, IL-8 and for activating signal transduction [
76,
78,
79]. Gastric damage and modulation of cell signaling are consequent to AlpA/B adhesion [
80]. Both play a key role in
H. pylori colonization, although HopB appears to be more important [
81]. New insights into the molecular mechanism of HopC indicate a function in biofilm formation. As described later,
H. pylori can construct biofilm in human gastric cells, HopC being one of the OMPs with the capability to contribute to this in outer membrane vesicles (OMVs) [
82].
Regarding the pathogenicity of HopB/C, there is insufficient information correlating their presence with clinical outcomes. Analysis of 200
H. pylori isolates revealed that all express these proteins, which suggests their important roles [
79]. Interestingly, in another study severe gastric symptoms were associated with some
H. pylori virulence factors such as HopB and VacA, with a high prevalence of HopB in cases of gastric cancer and peptic ulcers (> 80%), implying the importance of this OMP to predictions of infection outcome [
67].
2.2.2. Hop H, a Phase-Variable Protein
HopH, originally identified as outer inflammatory protein or Oip A (Hpo638), is a phase-variable protein the alleles of which are present in almost all
H. pylori strains. A high rate of diversity within CT dinucleotide repeats occurs in the
oipA gene. Similar to other OMPs, it is assumed that HopH is involved in epithelial cell adhesion, although there are discrepancies arising from diversity between strains. This protein can also induce IL-8 production, cell-signaling and toxic events, as well as apoptosis [
83,
84,
85]. These properties are independent of Cag PIA activity. This means that those strains which contain both virulence factors are capable of producing, for instance, higher levels of IL-8 [
86]. Both functional and non-functional types of OipA are known [
86,
87]. Interestingly, an association between this protein and other virulence factors such as CagA, VacAs1 and BabA has been demonstrated [
58,
88,
89].
Hop H association with clinical outcomes: A correlation between the presence of HopH and gastric disorders such as gastric cancer and peptic ulceration has been established. A study in which several virulence factors were examined together showed that gene expression could be a useful predictor of progression to gastric cancer in patients with precancerous gastric lesions, although paradoxical findings have raised doubts [
90,
91,
92,
93]. An investigation of
hopH gene polymorphism led to two proposals for its pathogenicity, enhanced bacterial adhesion and correlation with the presence of other virulence factors [
94]. In another study performed on gastritis and peptic ulcers, a high prevalence of the
oipA gene was reported, which could imply a relationship between this gene and disease progression [
95]. Similarly, a study performed on patients with gastritis, gastric carcinoma or duodenal ulcers showed an association with virulence factors such as CagA, VacA, IceA, BabA and OipA. However, only OipA was recognized as a distinctive factor for clinical outcomes. Nonetheless, as this factor is common among patients, it should be applied as a predictor only in combination with other virulence factors [
87]. Several trials reported a connection between CagA and OipA expression in which slipped strand mispairing of complementary bases during DNA replication enhances bacterial adaptability. Conversely, OipA was reported as a non-significant marker in one study which used PCR to detect and differentiate
H. pylori virulence factors and to predict clinical outcomes [
67].
2.2.3. Hop P
This protein is also known as sialic acid-binding adhesin or SabA. for which the human Lewis (Le) histo-blood group antigens Le
x and Le
a are the main receptors. Sialyl-dimeric-Lewis x glycosphingolipid, defined as
H. pylori receptor, is overexpressed in the stomach of infected people as
sLex and
sLea gene expression is upregulated during inflammation. In contrast, in the gastric mucosa of healthy people sialylated glycoconjugates are not abundant [
96,
97,
98,
99,
100]. Other receptors for SabA have been identified. It can bind to α2-3-linked sialic acids and other sialic acid receptors [
101], while laminin in the extracellular matrix also serves as a receptor [
102].
H. pylori can bind specifically to glycosylated mucins, located in the proximity of epithelial cells, which helps it to maintain long-term infection [
103]. Additionally, the polymorphic nature of
H. pylori is attributed to SabA binding to sialylated carbohydrates. This is a unique strategy of adaptation for
H. pylori [
104], which tends to colonize those stomach areas with low acidity and high levels of HopP receptors [
105].
SabA is classified as a protein that is regulated by phase variation. This means that
H. pylori can switch expression of the gene on or off depending on circumstances [
106]. Interestingly,
sabA also undergoes gene conversion, which plays a key function in regulating SabA levels. Adhesion is affected by emerging subpopulations of
H. pylori with variable expression of the protein, which is a consequence of having recombination amongst
sab A, sab B and
omp27 genes [
107]. SabA also contributes significantly to spasmolytic polypeptide-expressing metaplasia (SPEM), which succeeds chronic atrophy and is a strategy for the stomach to reform its normal structural units following injury. It is thought that
H. pylori can help SPEM progression, in which SabA adhesion to sLe
x plays a pivotal role [
108].
Hop P and gastric disorders: Numerous studies have investigated an association between SabA and clinical outcomes. It appears that SabA is responsible for inflammation and its presence is correlated with clinical outcomes [
109,
110]. Also, a close relationship between this protein and gastric cancer has been found. In one study, 66% of
H. pylori strains in patients with gastritis were SabA-positive, 44% were positive in individuals with duodenal ulcers and 70% in cases of gastric cancer [
111]. Other studies that examined the frequency of SabA reported 93%, 86%, 80% and 23% detection in
H. pylori strains in the Netherlands, France, Taiwan and Iran, respectively [
58,
112,
113,
114]. Recently, a Brazilian report revealed that SabA can accelerate gastric cancer in infected people [
115].
2.2.4. Hop Q
Otherwise known as Omp27, HopQ is classified into two families, HopQI and HopQII [
36,
116]. Both 3' and 5' ends of
hopQ alleles are highly conserved in
H. pylori, but divergence occurs in the 1.1 kb mid-region, with a 75-80% similarity of nucleotide sequence. However, they are different in terms of geographical distribution, HopQI being isolated mostly in East Asia and HopQII commonly present in western countries [
117,
118]. Similar to other OMPs, these proteins mediate adherence to the gastric mucosa. It seems that there is a correlation between HopQ and other virulence factors like CagA and VacA [
119]. Prevalence of this protein is common in those
H. pylori strains with
cag PAI, which is responsible for encoding CagA and a type IV secretion system [
47].
A family of receptors defined as carcinoembryonic antigen-related cell adhesion molecules (CEACAMs) is recognized for HopQ and HopQ. CEACAM activation interferes with immune functions of T and NK cells [
120,
121]. Moreover, CEACAMs mediates various cell functions such as adhesion, proliferation, immune response and motility. CEACAM1, 5 and 6 are expressed by gastric epithelial cells. CEACAM1, 3 and 4 have both cytoplasmic and transmembrane domains, while CEACAM5, 6, 7 and 8 have glycosylphosphatidylinositol linkage to the host cell membrane. A strong connection between HopQ and CEACAM1, 3, 5 and 6 N-terminal domains facilitates
H. pylori adhesion to gastric epithelial cells. Interestingly, CEACAM1, 5 and 6 are found in multiple organs. Binding between HopQ and CEACAMs plays a crucial role in CagA delivery into host cells [
121,
122,
123].
The relationship between HopQ and CagA modulation is a focus of research interest [
124]. It has been shown that inflammatory reactions follow T4SS activation and transfer of CagA oncoprotein via HopQ-CEACAMs interaction. The inflammatory response ultimately leads to gastric cancer, which supports the idea of therapeutic approaches targeting HopQ-CEACAMs [
125,
126]. This interaction affects human CEACAMs, responsible for CagA activation and phosphorylation in polymorphonuclear neutrophils (PMNs) but not dendritic cells and macrophages. In PMNs it lessens CagA translocation and alters expression of CEACAM receptors. Also, the presence of human CEACAMs on PMNs increases bacterial survival within phagosome, thus resisting phagocytosis [
127].
Hop Q and clinical disorders: The correlation between both types of HopQ and gastric cancer is established [
128,
129]. Also, a high incidence of gastric cancer has been reported in patients with
hopQI and
vacA s1m1, or with
hopQII and
vacA S2 genotypes [
130]. In two studies, in specific geographical regions in Iran, the rate of
hopQII was higher than that of
hopQI and a correlation between these OMPs and clinical outcomes was observed. However, another study showed the inverse result by which HopQI prevalence was higher with no association with gastrointestinal disorders [
131,
132]. Although its correlation with gastric diseases was demonstrated in several investigations, paradoxically HopQ could even be used therapeutically, as trials have shown good efficacies against melanoma metastasis [
133].
2.2.5. Hop S, Hop T and Hop U
HopS, HopT and HopU were first identified as blood group antigen-binding adhesin A (BabA) or OMP 28 (~80 kDa), BabB or OMP 19, and BabC or OMP 9, respectively. They each mediate attachment of
H. pylori to histo-blood group antigens on gastric epithelial cells except for BabC, the function of which is not yet clear. Notably, there is extensive homology at the 3' and 5' segments of
babA and
babB [
134]. There are two types of
babA, namely
babA1 and
babA2, with the latter divided into two subtypes with high and low protein production (Bab A-H and Bab A-L) [
135,
136]. An evaluation of glycosphingolipids as a receptor reported that
H. pylori varies in its attachment to different blood groups including A Rh
+/- and O Rh
–. Moreover,
H. pylori could not adhere to glycosphingolipids in people with blood group O but could bind extremely well in A Rh
+/- individuals. In this study, Le
b hexaosyceramide, pentaosylceramide, heptaosylceramide, lactosylceramide, lactotetraosylceramide, neolactohexaosylceramide and pentaosylceramide were reported as BabA receptors [
137]. In addition to Le
b, fucosylated blood group A, B and O antigens are noteworthy receptors [
138]. Depending on the mid region and ability to bind ABO antigens, there are two classifications of BabA, specialist and generalist. The former refers to those
H. pylori strains that can attach to ALe
b (A-Lewis a), whereas the latter refers to those that bind to ALe
b, BLe
b (B-Lewis b) and Le
b [
139]. Also, analysis of variation of
babA and
babB revealed that there are five and three groups of alleles, including AD1-5 and BD1-3, for BabA and BabB, respectively [
136].
Helicobacter pylori is able to achieve compatibility with the variable gastric acidic environment through recombination and mutation in
babA genes. This enables mediated attachment via this protein, which is responsible for this phenomenon, thereby increasing the risk of progression to gastric cancer [
140]. BabA is an antigen that is commonly expressed by
H. pylori and which is related to specific clinical outcomes including peptic ulcers and gastric cancer. Also, colonization occurs predominantly in the lowermost antrum of the stomach [
141,
142]. Based on recent studies, recombination between the three
bab genes frequently happens [
143]. BabA undergoes genetic regulation through phase variation, which modulates its role in adherence. Also, it can be affected by recombination between
babA and
babB genes [
144]. This genetic regulation is beneficial for
H. pylori adaptation to its gastric environment in which the bacterium is exposed to a high level of physiological stress [
145].
Correlation with gastric disorders: Several studies have investigated a correlation between
babA gene expression and gastric disorders such as peptic ulcers and gastric cancer. Reportedly, inflammation induced by BabA adhesion results in gastric conditions such as precancerous transformations and intestinal metaplasia [
146,
147,
148]. Also, a correlation between
Leb and low binding activity and risk of duodenal ulcers was found [
149]. Notably, undertaken the correlation between this genotype and gastric cancer was demonstrated separately in Germany, Portugal, Japan, Taiwan, China, USA and Brazil [
90,
150,
151,
152,
153,
154]. Similarly,
babA2 gene was recently found at high frequency in patients with gastric cancer or peptic ulcers, although discrepancies arise regarding whether or not development to the severe gastric condition is associated with this genotype. A possible reason for this could be a lack of expression of BabA protein despite the presence of the gene [
67]. This agrees with a meta-analysis of twenty studies that indicated a strong association between BabA2 and increased risk of gastric cancer in Asian populations compared to South American ones, suggesting a significant role of this virulence factor in pathogenicity [
155].
2.2.6. Hop Z
This protein, also known as HP9, has a role in adherence to gastric epithelial cells, although its receptor is not yet recognized [
72]. The
hopZ gene undergoes slipped-strand mispairing and is regulated by a phase-variable CT repeat, which means whether it is switched on or off depends on the prevailing
in vivo situation. There are two types, HopZI and HopZII. This differentiation dates to the era in prehistory before migration of humans from Africa [
69,
156]. Its relationship with infection is suggested by some findings [
157,
158]. In one investigation, an association between this protein and gastric cancer was reported, but a correlation between HopZ and chronic atrophic gastritis has yet to be found [
72,
156].
2.2.7. Hop V, Hop W and Other OMPs
These porin members belong to the Hop A/E family. This is due to homologous N-terminal and C-terminus regions. In terms of their pore size, HopV and HopW are similar to
E. coli OmpF porin. Among
H. pylori isolates, their expression is relatively less. Hop X/Y have been identified as porins similar to Hop A-D [
37,
159,
160,
161]. Colonization attributed to OMPs is mediated by
H. pylori OMP 18 [
162].
In the
Helicobacter outer membrane (Hom) family, four members (HomA, B, C and D) are recognized, of which HomB is the most studied. Hom A/B exhibit variation in regard to genes copies and genomic localization in different geographical areas. The rate of homology between
homA/
B genomes is estimated at 90%, with only a 300 bp difference. Similar to other OMPs, recombination and phase variation are involved in gene duplication [
37]. HomA/B are known for their significant roles in adherence, antibiotic resistance, biofilm formation and gastric malignancies [
163]. Two important functions ascribed to HomB are IL-8 secretion and adherence [
164]. While no specific association with clinical outcomes has been found for either of these proteins, they are likely to be involved in
H. pylori persistence [
165].
Another group, defined as Hof proteins, includes eight members, namely Hof A-H. Each of these contains a hydrophobic C-terminal motif, similar to the Hom family. Recently, a study of
Helicobacter heilmannii showed that Hof E and Hof F act as adhesins in the same way as other OMPs [
37,
166].