1. Introduction
With the role of the fifth-ranked worldwide diagnosed malignancy, gastric cancer (GC) induces over 1 million new cases every year, which is a heavy global health burden [
1]. As the predominant histological subtype of gastric cancer, stomach adenocarcinoma (STAD) is characterized as the second most prevalent cancer around the world and also ranks third in deadliest tumors [
2]. Despite several therapies, which include surgical techniques, chemotherapy, and immunotherapy, having advanced, the overall prognosis of GC patients still can’t reach our satisfaction, which is largely due to the disease’s extensive heterogeneity [
3,
4,
5,
6].
For further researching the heterogeneity of the disease, many studies have focused on the molecular subtypes of STAD, such as: the Epstein–Barr virus (EBV) subtype, which is characterized by DNA hypermethylation, and the genomically stable (GS) subtype that has fewer specific genomic alterations [
7,
8,
9]. Though these current molecular classifications, which have been wildly used in multiple projects including The Cancer Genome Atlas (TCGA) and the Asian Cancer Research Group (ACRG), have improved our understanding of GC subtypes at the transcriptomic level, the development of the RNA sequencing technology, such as the single-cell RNA sequencing (scRNA), provides more possibility for the advanced identification, which have revealed that GC patients could be classified into more molecular subtypes with different molecular mechanism characters [
10,
11,
12,
13]. Therefore, it’s necessary for us to have a more refined understanding of the heterogeneity based on the molecular subtypes of GC at the single-cell level.
As a technology of next-generation sequencing, scRNA sequencing enables the profiling of gene expression in individual cells, which includes not only rare tumor cells obscured by bulk RNA-seq but also multiple non-tumor cells of the tumor microenvironment (TME) [
13,
14,
15,
16]. In gastric cancer, multiple scRNA-Seq studies have uncovered substantial intratumoral heterogeneity and highlighted their functional diversity in TME. For example, the cellular communication between cancer-associated fibroblasts (CAFs) and GC cells was reported to have the function of maintaining GC stemness [
8,
17]. And it was revealed that the tumorigenesis and metastasis of MET-unamplified GC could be induced by the CAF-derived HGF [
18]. Nevertheless, the heterogeneity of malignant epithelial cells in gastric cancer, which usually dominates the tumor progression and metastasis, still remains incompletely characterized and needs further research. Specifically, the molecular differences among different tumor subtypes and the corresponding underlying regulatory system have not been systematically investigated.
Olfactomedin 4 (OLFM4), which is also known as hGC-1 and hOlfD, originates from human hematopoietic myeloid cells, and is identified as GW112, which is a novel antiapoptotic molecule [
19]. It is reported that OLFM4 is correlated not only with apoptosis but also with tumor growth [
20]. Besides, overexpression of OLFM4 has been shown to modulate the cellular proliferation of prostate cancer and presents a promotion effect of apoptosis in human myeloid leukemia [
20,
21,
22]. Meanwhile, as it’s said that OLFM4 has the potential as a biomarker for gastric cancer, we realize the importance of OLFM4 in GC and tend to focus on further investigation of the underlying mechanism in GC [
23,
24,
25].
Metabolic reprogramming is a characteristic of malignant tumors, which plays an important role in the growth and survival of cancer [
26]. Multiple biological mechanisms have been reported to participate in the process. For example, as the key factor of glycolysis, the Hexokinase 2 (HK2) could combine with voltage-dependent anion channels (VDACs), which are located on the outer membrane of mitochondria, to obtain ATP, whose activity is available for regulating the metabolic reprogramming in GC [
27,
28]. Besides, when GC cells lack glucose, nonessential amino acids could maintain the mitochondrial homeostasis to suppress the activity of the mitochondrial apoptosis pathway [
29,
30].
Among these diverse metabolic alterations, we find that the palmitoylation of protein, which is a reversible modification of lipid combining a palmitate group with cysteine residues, could play an important role in the post-translational mechanism that regulates not only metabolic enzymes but also signaling molecules, which is regarded as the modulator of anti-cancer treatment efficacy [
31,
32]. In gastric cancer, it’s said that the ZDHHC family, such as ZDHHC2 and ZDHHC14, affects the development of the tumor [
33,
34].
In this study, we integrated bulk and single-cell RNA sequencing data for the systematic investigation of the human GC heterogeneity at the transcriptional level. With the primary tumor and normal samples of the GSE163558 cohort, we identified seven malignant tumor subtypes in epithelial cells. Among these clusters, the C3 subtype, which was characterized by specifically high OLFM4 expression, presented enhancement of the palmitoylation and several energy metabolism pathways. For further confirmation, we upregulated the expression of OLFM4 in GC cell lines, and observed that the expression of ZDHHC2, which was identified as a palmitoyltransferase, as well as GLUT1, which was a key glycolytic transporter, was indeed increased, accompanied by increased ATP production. Furthermore, according to these C3-specific genes, we established a prognostic risk model with significant prognosis-related genes in TCGA-STAD (MUC16, RALA, and PCBD1), which predicted patient survival effectively and also demonstrated the correlation with the immune checkpoint expression.
3. Discussion
In this study, we comprehensively investigated the cellular heterogeneity of gastric cancer in not only bulk but also single-cell RNA sequencing data. Seven distinct tumor subtypes had been identified, which revealed their diversity of biological and transcriptional features. Among these seven clusters of scRNA data, the OLFM4-positive C3 subtype demonstrated significantly upregulated activity of both palmitoylation and glycolysis, which indicated a unique metabolic phenotype.
It was reported that OLFM4 was widely expressed among several types of tumors, which included gastric cancer, and many researchers held the view that OLFM4 acted as a secreted glycoprotein involved in the molecular mechanism regulation and played an important role in the development of gastric carcinogenesis [
19,
25,
35,
36,
37]. For example, Tsubosaka et al. hold the view that OLFM4 may play a crucial role in the early stage of intestinal metaplasia (IM), which could be identified as a biomarker for cells in the intermediate stage between gastric and intestinal cell types [
38]. And OLFM4 also presents the potential as a biomarker for the early detection of high-risk gastric lesions [
23]. Besides, it’s also reported that high OLFM4 expression could be found in the intestinal stem cells of the IM region, which is correlated with the upregulation of the activity in not only oxidative phosphorylation but also myelocytomatosis (MYC) pathways [
39]. However, very limited publications have defined the correlation between OLFM4 and not only palmitoylation but also energetic metabolism in gastric cancer. Therefore, we further verified their underlying relationship.
Functional experiments confirmed that upregulated expression of OLFM4 in GC cell lines with limited expression promoted the expression of specific biomarkers of palmitoylation and glycolysis, which included ZDHHC2 and GLUT1, accompanied by enhanced production of ATP. Furthermore, we established a risk-score-related model based on three C3-specific genes (MUC16, RALA, and PCBD1) for predicting outcomes of GC patients, which was also correlated with immune checkpoint expression. These findings novelly highlight the association between OLFM4 and not only palmitoylation but also energy metabolism in gastric cancer progression.
For the lipid modification of protein palmitoylation, the zinc finger DHHC-type (ZDHHC) enzyme family plays a key regulatory role in multiple cancer types, including gastric cancer (GC). By inducing the covalent attachment of palmitic acid to cysteine residues, the palmitoylation influences not only protein stability, but also membrane localization and signaling dynamics. Mechanistically, dysregulation of specific ZDHHC enzymes, which notably include ZDHHC2, ZDHHC3, and ZDHHC9, has been proven to have an association with aberrant activation of multiple oncogenic signaling cascades such as Ras, Wnt/β-catenin, and PI3K/Akt [
32]. For example, in the research on the molecular mechanism, it’s said that antiproliferative signaling could be mediated by palmitoylation of CKAP4 induced by ZDHHC2 [
40]. Besides, DHHC2-induced palmitoylation of Nrf2 could inhibit ubiquitin-dependent degradation to stabilize the transcription factor. It not only induces the enhancement of transcriptional activity of Nrf2 but also drives redox homeostasis, metabolic reprogramming, and chemoresistance in GC [
34]. Both 2-bromopalmitate (2-BP) treatment and DHHC2 silencing could block this modification, which further reduces tumor growth and reverses immune suppression, positioning palmitoylation as a therapeutic target. Moreover, it’s also reported that the membrane localization and stability of PD-L1 could be enhanced in AGS after the palmitoylation by ZDHHC3, which further inhibits the sustenance of T-cells to promote immune evasion [
41].
In the ZDHHC family, we realized that both ZDHHC2 and ZDHHC14 have been investigated in GC by several studies. For ZDHHC14, it has been reported that high ZDHHC14 expression could primarily promote tumor cell migration and invasion of GC by regulating integrin α5 and β1 subunit mRNA and protein, which indicates ZDHHC14 as an oncogene [
33]. Interestingly, some articles announced that ZDHHC2 may serve as a tumor suppressor, as high ZDHHC2 expression is associated with worse clinical phenotypes, which include heavier lymph node metastasis, worse prognosis, and higher histological grade [
42,
43]. However, some other publications also indicate that the upregulation of ZDHHC2 could induce the promotion of GC growth and reduce the reactive oxygen species (ROS) levels via the palmitoylation of NRF2 [
34].
Such differences of ZDHHC2 among different studies may be attributed to multiple factors, including the cellular context, the target substrates, and the oxidative stress microenvironment in different gastric cancer models. Since the process of palmitoylation is reversible for regulating protein localization and stability, the final biological effect of ZDHHC2 activation could largely be attributed to the regulation of its underlying molecular mechanism. In particular, the stabilization of NRF2 protein, which is induced by the ZDHHC2-mediated palmitoylation, thereby enhances the antioxidant defense system for reducing intracellular ROS levels, which indicates a feasible reason for tumor cells to survive under oxidative stress and suggests that ZDHHC2 can act as an oncogenic factor under certain conditions. Conversely, loss of ZDHHC2-mediated palmitoylation on other substrates may also lead to the opposite result. Therefore, we hold the view that ZDHHC2 may act as a suppressor or promoter in the process of tumor development, which depends on its specific interacting partners and the surrounding microenvironment.
Then, as for metabolic reprogramming, it is widely recognized as a hallmark of cancer, including the GC [
44]. Several factors could lead to the occurrence of metabolic reprogramming, such as: interactions between gene and protein levels, and communications between tumor cells and host cells, which could further enhance the absorption of glucose in cancer [
45]. For example, the p53 could regulate fructose-2,6-bisphosphatase to inhibit glycolysis, promote antioxidant activity, which further induces a reduction in intracellular reactive oxygen species (ROS) [
46]. Besides, several biological processes induced by the abnormal activation of c-MYC, which include not only chromosomal translocation but also gene amplification and transcriptional enhancement, are significantly related to the occurrence and development of metabolic reprogramming [
47]. Therefore, we tend to perform some novel molecular modulators of energetic metabolism, which induce us to have the scRNA analysis. According to our single-cell analysis data, we revealed that the diversity of metabolism occurred not only between tumor and normal epithelial cells but also among malignant subtypes themselves. The enrichment results of the C3 subtype exhibited multiple upregulated pathways of metabolism, including glycolysis, ATP synthesis coupled to electron transport, and oxidative phosphorylation, which indicated enhanced metabolic plasticity. This observation corresponds to previous studies reporting that OLFM4 promotes cell proliferation [
34]. Thus, we tend to observe the relationship between OLFM4 and the underlying modulators of energetic metabolism. It’s widely known that tumor cells could reverse their metabolic mode in different conditions, including not only the concentration of external nutrients but also stress conditions, and prefer to produce adenosine triphosphate (ATP) via glycolysis even with enough oxygen, which makes us select glycolysis for further verification [
48,
49]. With the function of transporting glucose into cells, the glucose transporters 1 (GLUT1) play a crucial role in glucose metabolism as a key glycolytic enzyme [
28,
50]. Then, considering the definite biological impact on the energetic metabolism in GC, such as reversed metabolic reprogramming and induced apoptosis due to the downregulation of GLUT1, we finally select GLUT1 for verifying the association between the OLFM4 and glycolysis [
51].
As a prognosis predicting tool, the risk model we established, which is composed of MUC16, RALA, and PCBD1, provides a good effect in predicting outcomes of STAD patients. MUC16, which is also called CA125, is a membrane-bound mucin. Besides, MUC16 also plays an effective role in risk stratification of patients for evaluating the therapeutic efficacy, as its combination with human epididymis 4 (HE4) can give patients with endometrioid adenocarcinoma before surgery an effective prediction of the high lymph node metastasis (LNM) risk [
52]. And in GC, MUC16 has been reported for its association with the tumor mutation burden (TMB) [
53]. Then, as a small GTPase, RALA encodes the RAS-like (RAL) GTPases with its family member RALB together, which is also associated with immune infiltration in the TME. There is a positive correlation between the RALA expression and the Macrophage M1 infiltration in STAD [
54]. Furthermore, however, as the most abundant organs of PCBD1 are the kidney and liver, it’s difficult to discover enough significant results about its association with the development and progression of STAD [
55]. Anyway, after combining these three genes together for stratification, TCGA-STAD patients are divided into high- and low-risk groups, whose efficacy and clinical value have been validated by not only ROC but also Kaplan–Meier analyses. Importantly, the risk score is positively correlated with multiple immune checkpoints, which include PD-L1 (CD274) and IDO1, indicating that metabolic reprogramming might be related to immunosuppressive TME, which implies that it may be possible for those patients with high OLFM4 expression or high-risk scores to benefit from some therapies targeting both energetic-metabolism-related pathways and immune checkpoints.
However, there are still many limitations to consider. First, although we got some valuable insights from our single-cell analysis, the number of selected gastric cancer samples was relatively limited, which may not capture the entire spectrum of tumor heterogeneity in GC. Second, though we had confirmed the regulations of OLFM4 towards ZDHHC2 as well as GLUT1 in vitro experiments, further investigation in vivo should also be required for validation and to explore the underlying signaling pathways with more details. Third, as the risk-score-related model was derived from the TCGA-STAD cohort, an external verification in at least another independent clinical dataset should be necessary for ensuring its general applicability. Despite these limitations, we still hold the view that we establish an integrative approach, which combines computational analysis with experimental validation, and provides a robust framework for exploring heterogeneity in gastric cancer.
4. Materials and Methods
4.1. Bulk-Seq Data Analyzation
The RNA-Seq expression matrix in log2-transformed TPM of human gastric cancer samples with the corresponding clinic data (n=403) was obtained from the TCGA-STAD cohort to establish the model for further analysis.
4.2. scRNA-Seq Data Processing
The scRNA Seq data of 4 human gastric cancer samples (3 primary gastric cancer and 1 adjacent non-tumoral sample) of the GSE163558 cohort in the GEO database were selected for further analysis. The whole process was achieved via the Seurat R package. For the quality control, cells were excluded with the mitochondrial percentage more than 20%, and the cutoffs of both UMI counts and gene counts were set as 200-1500 and 200-5000. Then the filtered data was normalized by R function NormalizedData(). Afterwards, FindVariableFeatures() function was performed to select 2000 highly variable genes for further PCA reduction. Furthermore, to reduce the impact of the cell cycle, we used the ScaleData() function by regressing out both “S.Score” and “G2M.Score”. Besides, the R package harmony was applied to reduce the batch effect of multiple samples. The dimensionality reduction included both UMAP and TSNE, and cell clusters were annotated via the method of Louvian with different specific signatures. As the FindAllMarkers() function was used to discover the differentially expressed genes (DEGs) among different cell clusters, we set the threshold for identifying significant DEGs as: p-value<0.05, log2FC>0.25 and the expression proportion more than 0.1.
4.3. Cell Clusters Identification
Multiple groups of genes were used to identify different human cell clusters in scRNA Seq. For the epithelial cells, AGR2, PIGR, KRT19, KRT8, KRT18 were regarded as specific markers, and SPARC, CALD1, COL1A2, COL3A1, COL1A1 were identified as the specific markers of fibroblast. Besides, markers for identifying endothelial cells included IGFBP5, PLVAP, STC1, COL4A1, IGFBP7, SPARCL1. T cells were divided into 3 types, which included CD4+ T cells (CXCL13, DUSP4, ICOS, TNFRSF18, MAF, CD2, CTLA4), CD8+ T cells (GZMB, GZMA, NKG7, CCL5, GNLY) and Naïve T cells (INTS6, TSPYL2, TCF7, BCL11B, HEXIM1, IL7R). Meanwhile, we selected several markers (G0S2, FCGR3B, S100A8, CXCL8) as the specific markers for identifying neutrophil cells. Furthermore, markers of monocytes consisted of MMP12, EREG, CTSB, SPP1, APOE. As for the plasma cells and B cells, we used two different groups of markers to distinguish them. The markers of plasma cells were IGLC3, JCHAIN, IGHA1, IGKC, and B cells specific markers were CD83, LY9, MEF2C, BANK1, MS4A1. For the classification of tumor cells, seven specific markers of gastric cancer (LGALS3, CD74, PLCG2, OLFM4, HELLS, TOP2A, GNLY) were selected for the distinguishment according to their expression in these cells. Then, seven clusters of malignant epithelial cells were identified.
4.4. CNV Analysis of scRNA-Seq
With the identification of different clusters, CNV analysis was further performed in the scRNA data to estimate the malignancy level of all cells through the inferCNV. As fibroblasts and endothelial cells were annotated into reference groups, CNV scores of each epithelial cell were acquired, and cells with a score higher than the median CNV score were identified as malignant tumor cells.
4.5. Pseudotime Analysis of scRNA-Seq
To analyze the pseudotime of the identified malignant tumor cells, the R package Monocle2 was used to investigate their differentiation simulation. First, the Seurat object of tumor cells was transferred into the CellDataSet (CDS) object. Then, after the gene ordering had been identified, the DDRTree method was applied for both the dimensionality reduction and the trajectory inference. All parameters in the analysis were set default.
4.6. Transcription Factor Analysis
The transcription factor (TF) analysis in malignant tumor cells was achieved through the Single-Cell Regulatory Network Inference and Clustering (SCENIC) to calculate the regulon activity score (RAS) in default characters. First, by using the GRNBoost software, we built the TF-genes co-expression network. Then, with the motif database, we could verify the combination between those genes and TFs via the RcisTarget. Furthermore, the AUCell R package was used to quantify the activity of each regulatory subset, which was regarded as the Regulon score.
4.7. Cell-Cell Communication Analysis
The analysis of cell-cell communication was achieved by CellChat R package. First, we set the normalized gene expression matrix as the input to create the CellChat object. After identifying both over-expressed signaling genes and over-expressed ligand-receptor interactions, communication probability between any cell groups with interaction had been calculated via the R function computeCommunProb(). Furthermore, we also computed the probability of communication at the signaling pathways level with filtered communications, which was achieved by R functions including filterCommunication() and computeCommunProbPathway(). Finally, the network of cell communications had been established through the R function aggregateNet(), which revealed exact communications between any different cell groups.
4.8. The Signature Score of Efferocytosis
We first select the geneset of efferocytosis, whose term was named as GOBP_PEPTIDYL_L_CYSTEINE_S_PALMITOYLATION (gsea-msigdb.org), from the Msigdb database. Then, we used the geneset for calculating the signature score, which represented the activity of the efferocytosis degree, in the scRNA data via the ssGSEA method in the GSVA R package.
4.9. Immune Infiltration Analysis
For estimating the infiltration of different immune cells in human bulk sequence, three methods including ESTIMATE, CIBERSORT, and xCell were applied by using the IOBR R package, which finally provided a numeric immune infiltration score to each patient sample from TCGA-STAD.
4.10. Enrichment Analysis
With the method of hypergeometric distribution, enrichment analysis based on the KEGG and GO databases was applied by using the clusterProfiler R package. Significant signaling pathways were identified as the p-value, which was adjusted by BH, was lower than 0.05. GSVA analysis was achieved by using fgsea R package, and the GSVA scores of each pathway in different tumor subtypes were compared to find a statistically significant result, whose significance was further quantified by the t-value.
4.11. The Prognosis-Based Model for Gene Screening
With the different expressed genes (DEGs) from the C3 tumor cell cluster, we chose several models based on the patients’ prognosis to perform further gene screening, which included both the univariate and multivariate Cox regression model (R function coxph() from the survival R package). As for the threshold of the filter, the p-value for univariate Cox regression was set as p<0.1 and for multivariate regression was set as p<0.05. Then, with the timeROC R package, the model’s prediction capabilities were quantified and estimated through the area under the curve (AUC) of the time-varying receiver operating characteristic (ROC) curve.
4.12. The Statistical Analysis
All data was processed, analyzed, and visualized through R 4.1.3. In these steps, Pearson’s analysis was applied to estimate the correlation between any two continuous variables, and the Chi-squared test was used to assess the correlation of categorical variables. The Wilcoxon rank-sum test or the T-test was performed in the comparison between two groups, and one-way ANOVA was applied to the comparison with groups of more than three. In the KM survival analysis, the best cut-off for separating the continuous variable was achieved through the survminer R package.
4.13. Cell Culture
All human gastric cancer cell lines were originated from the American Type Culture Collection (ATCC), which included GES-1, HGC-27, AGS, MKN45, SGC-7901, and were cultured with the 10% fetal bovine serum (FBS) medium (DMEM or RPMI-1640) at an atmosphere of 5% CO2 and 37 °C. Selected cell lines were further incubated with the recombinant protein of OLFM4 (10261-OL, R&D Systems) for 24h.
4.14. Western Blotting
Lysates of multiple cell lines seeded in 10cm wells were collected after cell lysis. The extracted proteins of different cell lines were separated by sodium dodecyl sulfate - polyacrylamide gel electrophoresis (SDS-PAGE) and transferred onto PVDF membranes, which were further blocked in 5% milk for 1h at room temperature. After incubating with primary antibodies at 4 °C overnight, secondary antibodies were used to incubate membranes for 1h at room temperature. Finally, we used the Odyssey CLx Imager for visualizing signals.
Antibodies are listed: OLFM4 (ab85046, Abcam), GLUT1(ab115730, Abcam), ZDHCC2 (sc-515204, Santa Cruz), β-actin (sc-69879, Santa Cruz).
4.15. ATP Measurement
The treated human gastric cancer cell lines in 6-well plates were lysed and centrifuged to obtain the lysates, which were further used for measuring the ATP level according to the instructions of the ATP assay kit (HY-K0314, MCE). The expression of ATP in different samples was quantified by the strength of luminescence in the microplate reader.
Figure 1.
(A) The dot plot demonstrated specific markers for identifying different cell clusters. (B) The UMAP plot visualized the classification of different cell clusters. (C) The bar plot represented the ratios of different cells in both the primary tumor and the normal tissue.
Figure 1.
(A) The dot plot demonstrated specific markers for identifying different cell clusters. (B) The UMAP plot visualized the classification of different cell clusters. (C) The bar plot represented the ratios of different cells in both the primary tumor and the normal tissue.
Figure 3.
(A) tSNE plot demonstrated seven subtypes of malignant epithelial cells. (B-C) Bar plots presented the ratio of each subtype in different cell cycle phases and tissue types. (D) Heatmap of Ro/e analysis showed the enrichment of different tumor subtypes in different tissue groups. (E-H) tSNE plots and violin plots revealed the diversity of multiple tumor subtypes in different dimensions.
Figure 3.
(A) tSNE plot demonstrated seven subtypes of malignant epithelial cells. (B-C) Bar plots presented the ratio of each subtype in different cell cycle phases and tissue types. (D) Heatmap of Ro/e analysis showed the enrichment of different tumor subtypes in different tissue groups. (E-H) tSNE plots and violin plots revealed the diversity of multiple tumor subtypes in different dimensions.
Figure 4.
(A-B) The chord graph and the dot plot demonstrated the incoming interaction of tumor cells with other cells and the corresponding ligand-receptor pairs. (C-D) The chord graph and the dot plot demonstrated the outgoing interaction of tumor cells with other cells and the corresponding ligand-receptor pairs. (E) The dot plot presented both the incoming and outgoing cell-cell interaction strength of all cell types.
Figure 4.
(A-B) The chord graph and the dot plot demonstrated the incoming interaction of tumor cells with other cells and the corresponding ligand-receptor pairs. (C-D) The chord graph and the dot plot demonstrated the outgoing interaction of tumor cells with other cells and the corresponding ligand-receptor pairs. (E) The dot plot presented both the incoming and outgoing cell-cell interaction strength of all cell types.
Figure 5.
(A-B) Box plot and tSNE plot visualized and compared the CytoTRACE score of each tumor subtype for presenting their cellular plasticity. (C) Plots of the trajectory analysis demonstrated different states and the differentiation trajectory according to the pseudotime analysis. (D) Bar plot showed the distribution of seven tumor subtype cells in 5 differentiation states. (E-F) Expression of differentiation-related genes had been visualized based on the pseudotime trajectory.
Figure 5.
(A-B) Box plot and tSNE plot visualized and compared the CytoTRACE score of each tumor subtype for presenting their cellular plasticity. (C) Plots of the trajectory analysis demonstrated different states and the differentiation trajectory according to the pseudotime analysis. (D) Bar plot showed the distribution of seven tumor subtype cells in 5 differentiation states. (E-F) Expression of differentiation-related genes had been visualized based on the pseudotime trajectory.
Figure 6.
(A) Feature plots and the violin plot indicated the different activity of the palmitoyl among the seven tumor subtypes. (B-C) Bar plots showed the enriched pathways of both KEGG (B) and GO (C), as pathways in KEGG were ranked by the -log10(pvalue) and pathways in GO were ranked by the gene counts. (D) The bar plot demonstrated the t-value of GSVA scores of multiple pathways in comparing C3 tumor subtypes with other tumor subtypes. (E) The heatmap visualized the GSVA scores of multiple pathways among different tumor subtypes. (F) Western blot showed the different expression of OLFM4 in 5 human gastric cancer cell lines (GES-1, AGS, HGC-27, MKN45, SGC-7901). (G) Western blot demonstrated higher OLFM4 expression induced upregulation of both the palmitoylation marker ZDHHC2 and the glycolysis marker GLUT1 in both AGS and HGC-27. (H) Bar plots showed that the OLFM4 upregulated group had significantly higher ATP expression in both AGS and HGC-27.
Figure 6.
(A) Feature plots and the violin plot indicated the different activity of the palmitoyl among the seven tumor subtypes. (B-C) Bar plots showed the enriched pathways of both KEGG (B) and GO (C), as pathways in KEGG were ranked by the -log10(pvalue) and pathways in GO were ranked by the gene counts. (D) The bar plot demonstrated the t-value of GSVA scores of multiple pathways in comparing C3 tumor subtypes with other tumor subtypes. (E) The heatmap visualized the GSVA scores of multiple pathways among different tumor subtypes. (F) Western blot showed the different expression of OLFM4 in 5 human gastric cancer cell lines (GES-1, AGS, HGC-27, MKN45, SGC-7901). (G) Western blot demonstrated higher OLFM4 expression induced upregulation of both the palmitoylation marker ZDHHC2 and the glycolysis marker GLUT1 in both AGS and HGC-27. (H) Bar plots showed that the OLFM4 upregulated group had significantly higher ATP expression in both AGS and HGC-27.
Figure 7.
(A) Heatmap of Regulon CSI matrix demonstrated the classification of TFs, which classified tumor cells into 3 groups (M1, M2, M3). (B) Dot plots showed average regulon activity scores of different tumor subtypes in each module and labeled the significantly related tumor subtypes. (C-E) Dot plots (left) presented the rank for regulons according to the regulon specificity score (RSS) in each tumor subtype, and the tSNE plots revealed the specific tumor subtype (middle, red dots) and the top-ranked regulon (right, blue dots) with the highest specificity scores in corresponding tumor subtypes.
Figure 7.
(A) Heatmap of Regulon CSI matrix demonstrated the classification of TFs, which classified tumor cells into 3 groups (M1, M2, M3). (B) Dot plots showed average regulon activity scores of different tumor subtypes in each module and labeled the significantly related tumor subtypes. (C-E) Dot plots (left) presented the rank for regulons according to the regulon specificity score (RSS) in each tumor subtype, and the tSNE plots revealed the specific tumor subtype (middle, red dots) and the top-ranked regulon (right, blue dots) with the highest specificity scores in corresponding tumor subtypes.
Figure 8.
(A-B) Forest plots presented the results of p-value, Hazard ratio, and 95% confidence interval (CI) about genes in univariate and multivariate Cox analysis in the TCGA-STAD cohort. The p-value cut-off was p < 0.1 and p < 0.05 in the univariate and multivariate Cox analysis, respectively. The hazard ratio (HR) >1 was identified as the reverse effect on survival and <1 was regarded as the protective effect on survival. (C) Heatmap demonstrated the expression of 3 genes in high/low risk groups of TCGA-STAD, with the median of the risk score set as the cut-off. (D) ROC curves of survival outcomes indicated the predictive performance of the risk-score model. (E-F) KM survival plots showed the significant relationship between the overall survival (OS) and not only risk groups (E) but also the risk-score related genes (MUC16, RALA, PCBD1) (F). p < 0.05 was identified as statistically significant.
Figure 8.
(A-B) Forest plots presented the results of p-value, Hazard ratio, and 95% confidence interval (CI) about genes in univariate and multivariate Cox analysis in the TCGA-STAD cohort. The p-value cut-off was p < 0.1 and p < 0.05 in the univariate and multivariate Cox analysis, respectively. The hazard ratio (HR) >1 was identified as the reverse effect on survival and <1 was regarded as the protective effect on survival. (C) Heatmap demonstrated the expression of 3 genes in high/low risk groups of TCGA-STAD, with the median of the risk score set as the cut-off. (D) ROC curves of survival outcomes indicated the predictive performance of the risk-score model. (E-F) KM survival plots showed the significant relationship between the overall survival (OS) and not only risk groups (E) but also the risk-score related genes (MUC16, RALA, PCBD1) (F). p < 0.05 was identified as statistically significant.
Figure 9.
(A) The Heatmap presented the quantified immune infiltration of the TCGA-STAD cohort based on multiple immune-related genesets. (B-C) Heatmap and boxplots visualized the risk-score associated immunocheckpoints. p < 0.05 was identified as statistically significant. (D) The bar plot demonstrated the relative percentage of different immune cells according to the CIBERSORT analysis. (E) Boxplots indicated immune cells with different infiltration between high and low risk groups. (F-G) The bar plot and dot plots showed immune cells that were significantly correlated with risk scores (p < 0.05).
Figure 9.
(A) The Heatmap presented the quantified immune infiltration of the TCGA-STAD cohort based on multiple immune-related genesets. (B-C) Heatmap and boxplots visualized the risk-score associated immunocheckpoints. p < 0.05 was identified as statistically significant. (D) The bar plot demonstrated the relative percentage of different immune cells according to the CIBERSORT analysis. (E) Boxplots indicated immune cells with different infiltration between high and low risk groups. (F-G) The bar plot and dot plots showed immune cells that were significantly correlated with risk scores (p < 0.05).