Introduction
Lipids are macromolecules critical for the normal functioning of human tissue. In brain tissue, lipids account for more than half of the dry weight of the human brain. The building blocks of lipids—fatty acids—participate in multiple processes of brain tissue
1. While the role of lipid metabolism in normal brain function and neurological disease is well-known, emerging evidence suggests that altered lipid metabolism, including fatty acid synthesis (FAS), may contribute to carcinogenesis in the brain [
1,
2].
Glioblastoma (GBM), also known as the World Health Organization grade IV glioma, is the most common and deadly form of brain cancer [
1,
2,
3,
4,
5]. Most GBM patients receive maximum surgical removal of the tumor with concurrent radiation and temozolomide-based chemotherapy, both of which are invasive treatments with severe side effects
2,4. Despite rigorous therapeutics, the prognosis of GBM is dismal. The median survival of GBM is slightly over a year, and fewer than 1 in 10 patients would survive 5 years post-diagnosis [
2,
6].
GBM hosts a substantial amount of cancer stem cells with self-renewing and proliferative ability
4. Stemness, a phenomenon involving loss of the differentiated cellular features, makes a cell resemble its stem-cell progenitor
7. In cancer, such de-differentiated cells tend to be more aggressive, exhibiting a greater potential for proliferation and invasion of other tissues
7. Stem cells are defined by their ability to give rise to other cell types through the process of differentiation. Consequently, higher stemness in tumors introduces a more diverse reservoir of cell types, contributing to elevated heterogeneity, which makes the disease harder to treat [
4,
6,
7]. The progenitor capability of GBM stem cells also enables the persistence of the treatment-resistant cells
4. Clinically, stemness facilitates cancer progression and manifests in worse prognosis as well as therapeutic resistance, evidenced by most metastatic tumors displaying a de-differentiated state [
7].
Emerging evidence links FAS to brain tumor onset and progression
1. In GBM, a substantial amount of lipid is required to fuel the growth and proliferation of tumor cells
2,6. Recent evidence has pointed to altered FAS machinery as a candidate driver of GBM stemness. Experiments with a high-fat diet promoted proliferative and anti-apoptotic properties in brain tumor stem cells, while RNA interference targeting of FAS machinery repressed stem-cell features in GBM
1. Metabolomic profiling revealed that lipid metabolism levels could distinguish GBM stem and non-stem cell populations
6. Targeting FAS has since been proposed as a therapeutic strategy
1,2. Potential therapeutic targets include fatty acid synthase, a FAS workhorse, as well as sterol regulatory element binding protein 1 (
SREBF1), a major FAS regulator [
1,
2,
6].
Methylation of the fifth carbon position of DNA cytosine in the cytosine-phosphate-guanine (CpG) dinucleotide context, hereafter referred to as “DNA methylation”, plays a crucial role in regulating gene expression and many biological processes, including cellular differentiation and cell-identity maintenance8–10. The methylation landscape at the SREBF1 gene has been implicated in lipid metabolism alterations in diseases11. In type-2 diabetes, SREBF1 methylation is linked to insulin-resistance risk, thus permitting clinically relevant patient stratification12. However, the understanding of SREBF1 methylation in cancer remains limited. In GBM specifically, the role of SREBF1 methylation has not been reported. Given the role of FAS in brain carcinogenesis and the regulatory effect of SREBF1 in FAS, the epigenetics of SREBF1 deserves in-depth investigation.
The goal of this study is to investigate the biological and prognostic relevance of SREBF1 methylation in GBM tumors. In two independent, population-scale cohorts, SREBF1 methylation helps identify a GBM cluster exhibiting significantly higher cancer stemness and younger epigenetic age. The study subsequently reveals a link between SREBF1 methylation and genome-wide epigenetic landscapes at both global and single-nucleotide resolution. The more stem-cell-like cluster is enriched for distinct biological pathways and shows worse survival. Taken together, this study warrants further research into the biological and translational effects driven by the epigenetics of FAS machinery.
Materials and Methods
Study cohorts and samples. This study leveraged two independent, population-scale GBM cohorts with publicly available Illumina’s Infinium HumanMethylation450 BeadChip (450K) measured on biopsies or surgical specimens and relevant clinical profiles including survival: n=153 The Cancer Genome Atlas (TCGA)3 and n=157 Deutsches Krebsforschungszentrum (DKFZ), excluding n=24 non-GBM5. The methylation data for all cohorts were available as matrices of beta-values, which represent the proportion of methylated alleles in a bulk sample and follow a bimodal beta-distribution bound between 0 (unmethylated) and 1 (methylated)8,9.
O-6-Methylguanine-DNA Methyltransferase (MGMT) subtype estimation. MGMT promoter methylation status is a well-established prognostic biomarker and a known correlate to isocitrate dehydrogenase (IDH) mutation and the CpG island methylator phenotype (CIMP) in brain tumors2–5. Here, MGMT subtypes were determined by the mgmtstp27 machine-learning algorithm (R package v0.8). The input was the methylation M-values of CpG probes cg12434587 and cg1298113713 obtained by logit transformation of their beta-values8.
Biological sex inference for DKFZ. Sex can be reliably determined from the median intensities of sex-chromosome probes of methylation arrays [
8]. Here, the biological sex of the DKFZ samples, which were not available in the public data accession, was determined by applying the
minfi (R/Bioconductor packagev1.46.0)[
8] procedures to the intensity data files.
Additional filtering of DNA methylation data. CpGs tracking to sex chromosomes and annotated as non-specific or cross-reactive (curated list available at
https://github.com/sirselim/illumina450k_filtering)
14 were excluded. The CpGs located within
SREBF1 and
MGMT were also excluded. After filtering, 361,281 CpGs shared between the TCGA and DKFZ cohorts were kept for downstream analyses. Missing values were imputed by R/Bioconductor
impute v1.74.1 function
impute.knn with default
k=10.
SREBF1 methylation-based cohort stratification. Of the 42 SREBF1 CpG probes on the 450K array, 38 (90.5%) were present in the filtered datasets and used for GBM stratification. Each sample was assigned a cluster label L or R, the default clustering solution at the outermost tree-node split, by applying Recursively Partitioned Mixture Model (R package RPMM v1.25 function blcTree), an iterative unsupervised clustering algorithm designed for the beta-distributed methylation data15, independently for each cohort. The algorithm also returned a terminal solution with labels LL, LR, RL, and RR (Figure S1). Cluster assignment and sample ordering were determined with seriation (R package v1.5.7)16. Heat maps with tracking bars labeling cluster membership and CpG annotation were plotted with pheatmap v1.0.12.
Cancer stemness scores. Malta et al.7 developed mDNAsi, a methylation-based cancer stemness index using a 219-CpG signature panel. In the filtered cohorts, 212 (96.8%) CpGs were available and none were located within SREBF1 or MGMT genes. The raw mDNAsi values of all samples within each GBM cohort, m, were calculated as m = wT x, where vectors x and w are the beta-values and the corresponding signature weights, respectively. The final mDNAsi value in the k-th sample (mk’) was calculated by intra-cohort min-max transformation7.
Epigenetic aging. Epigenetic age is an indicator of the biological aging process and cellular differentiation potential and can be calculated by the Horvath method using a 353-CpG panel17. Here, epigenetic age was calculated by applying R/Bioconductor package wateRmelon v2.6.0 function agep9 to the 323 (91.5%) CpGs available, which were not located within SREBF1 or MGMT.
Deconvolution of tumor microenvironment. MDBrainT (R package v0.1.0) is a novel method that dissects the tumor microenvironment using a reference library of 25,913 CpGs from the 450K array. The approach is brain tumor-specific and outperforms several existing approaches18. In the filtered cohorts, 20,347 CpGs (78.5%) were present, excluding cg03641529 located within SREBF1. The proportions of tumor cells (i.e. tumor purity), pre-defined immune cells, and pre-defined neuronal-stromal cells were determined by applying the function TMEdeconvolute with reference annotations available as built-in R objects18.
Global epigenetic summaries. A background set of 47,224 CpGs was obtained by keeping only the sites with the greatest inter-sample variability in each cohort and excluding sites lacking a known-gene annotation. Global methylation was then summarized by the topmost dimensions from a principal component analysis (PCA), PC1 and PC2 (base R function prcomp). The raw components were then standard-normalized within each GBM cohort. The PCA groups, A and B, were assigned based on whether a sample was above or below an unbiased decision line, PC1 = PC2, as described previously19.
Differential methylation analysis at nucleotide resolution and pathway analysis. The aforementioned background set of 47,224 CpGs was subsequently analyzed for differential methylation adjusting for age, sex, and
MGMT subtypes with
limma (R/Bioconductor package v3.56.2):
where
Mk is the M-value of
j-th CpG in
k-th sample. All significant CpGs had a Bonferroni-corrected
P < 0.05, corresponding to a raw
P < 1.06e-6. Volcano plots summarizing differential methylation were implemented in
ggrepel (R package v0.9.6).
The gene set-based Gene Ontology (GO)
20 and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses
21 were implemented in
WebGestalt2024 (
https://www.webgestalt.org), which accepted a target and a background gene set as separate text files
22. Here, the former was the list of 904 genes associated with the 1,182 hypo-methylated CpGs detected in both cohorts, and the latter was the 12,761-gene list associated with the 47,224-CpG background set. The raw
P-values of the GO and KEGG terms were subject to Bonferroni correction.
Survival analysis. The overall survival models were implemented in survival (R package v3.5-5). The time-to-event profiles for overall survival were censored at the 5-year mark. Kaplan-Meier curves were drawn with wrapper function ggsurvplot (R package survminer v0.4.9). Multivariate Cox proportional hazard regression adjusting for age, sex, and MGMT subtypes was also implemented in survival. Hazard ratio (HR) estimates were determined by exponentiating the regression coefficients.
Statistical analyses. All analyses were implemented in R v4.3.1. A continuous variable between the SREBF1 clusters was tested in a univariate manner with a two-sided Welch’s t-test (base R function t.test). A follow-up test adjusting for potential confounders was performed with a Gaussian-linked generalized linear model (base R glm). The association between any two binary variables was tested with a two-sided Fisher’s exact test (base R fisher.test) reporting odds ratio (OR) estimates. Similarly, generalized linear models were used for follow-up analyses adjusting for potential confounders, except that the binomial link was used and the adjusted OR estimates were obtained by exponentiating the model coefficients. Visualizations for statistical graphics were implemented in ggplot2 v3.5.223.
Results
SREBF1 Methylation Clusters Display Distinct Genome-Wide Epigenetic Landscapes at the Global-Summary and Single-Nucleotide Resolution
Epigenetic landscapes contribute to the establishment and maintenance of cell identity by regulating the genes involved in determining cell fate10. Given the mounting evidence of differential stemness between SREBF1 methylation clusters, the genome-wide epigenetic landscapes were investigated next. First, the most variable 47,224 CpGs were summarized into a single data point on a two-dimensional feature space with PCA. Each tumor was then assigned to group “A” (above) or “B” (below) an unbiased linear boundary, PC1 = PC2 (Figure 3A). The PCA-based group assignment and SREBF1 cluster membership were strongly and significantly related (both cohorts’ OR > 7 and Fisher’s exact P < 5e-8; Figure 3A). This association remained strong and significant after adjusting for the potential confounders (both cohorts’ adjusted OR > 9 and P < 5e-7; Table S1D).
The finding in global epigenetics led to the hypothesis that each SREBF1 cluster would be undergoing distinct biological processes. To test this, individual differentially methylated CpGs were first identified from the 47,224-CpG background adjusting for age, sex, and MGMT subtypes. Nearly all significant CpGs (all Bonferroni P < 0.05) were hypo-methylated in Cluster R (Figure 3B and Table S2A-B). GO and KEGG enrichment analyses were then applied to the genes associated with the hypomethylated CpGs consistently detected in both cohorts. Table 2 lists the significant terms (all Bonferroni P < 0.05), which were predominantly neuron-specific biological processes, cellular components, molecular functions, and pathways, linked to the hypo-methylated gene sets in Cluster R.
SREBF1 Methylation May be a Biomarker for Patient Survival
Figure 4 shows the univariate Kaplan-Meier analysis of 5-year overall survival with respect to the SREBF1 clusters. Cluster R exhibited reduced median overall survival, although the reduction was statistically significant only in TCGA (log-rank P < 0.01). The Kaplan-Meier curves also suggested that the DKFZ survival data might be noisy. Table 3 lists the findings of the multivariate Cox proportional hazards regression. Adjusting for age, sex, and MGMT subtype, Cluster R cases were at a greater risk of death in both TCGA (HR = 1.7, 95% CI = 1.1-2.6) and DKFZ (HR = 1.3, 95% CI = 0.8-1.9).
Discussion
FAS is an emerging therapeutic area of research in the aggressive GBM tumor context. While the principal regulator of FAS, SREBF1, has been targeted experimentally, the understanding of the epigenetic mechanisms of SREBF1 remains limited. This study investigates the biological and clinical value of SREBF1 DNA methylation, the most well-studied epigenetic modification, offering evidence to support further investment into researching epigenetic modification of the FAS machinery. Importantly, the analyses of this study address the known confounders age and sex, as well as the MGMT subtype relevant in brain tumors.
The Recursively Partitioned Mixture Model applied to the methylation landscape of SREBF1 revealed two predominant clusters in the GBM cohorts. One cluster showed significantly higher stemness, a characteristic involving increased proliferative capacity, cellular heterogeneity, and aggressive cancers7. Supporting this observation, a younger epigenetic age, indicative of greater cellular differentiation potential17, coincided with elevated stemness. These findings are in line with an earlier study reporting the link between lipid metabolism alteration, which delineated stem cells from non-stem cells in GBM6.
Epigenetic landscapes help establish and maintain cell identity by modulating cellular differentiation programs10. This concept motivated an interrogation of the genome-wide epigenetic landscapes in relation to the SREBF1 clusters. At the global-summary level, epigenetic landscapes were strongly and significantly associated with SREBF1 cluster membership. At the single-nucleotide resolution, a substantial number of hypo-methylated CpGs were detected in the stemness-high cluster, hinting at fundamentally distinct molecular programs underlying the tumor clusters.
Tumor microenvironments are typically more complex in brain tumors of higher grades and poorer differentiation status4. The results here showed the high-stemness cluster exhibited increased proportions of neuronal and stromal cell types, consistent with existing knowledge. Biological processes and pathways, as well as molecular functions and cellular components, were linked to the consistently hypo-methylated genes. Since hypo-methylation is generally linked to increased expression, this finding implies that the tumor-specific genes might be overexpressed in the more stem cell-like GBM, which in turn, could harbor more complex tumor microenvironments consisting of heterogeneous neuronal cell types.
This study has limitations and observations that it cannot fully explain. First, the differences across the SREBF1 clusters appeared rather modest in the absence of a systematic clustering algorithm. This study overcame this by leveraging population-scale sample sizes and applying the Recursively Partitioned Mixture Model most well-suited to the beta-distributed methylation data. It is important to note that the clustering solution here might be challenging to replicate in less-empowered cohorts and susceptible to unstable output due to sample-size variations. Similarly, while the difference in tumor microenvironment composition was statistically significant in the two independent cohorts, whether the magnitude was biologically or clinically relevant remained unclear.
Second, prior research suggested that elevated heterogeneity in GBM tumors includes contributions from infiltrating immune cells7. Immune-related processes also play a key role in human tissues involved in lipid metabolism and storage14,25, and epigenetics-based immune repertoire is closely linked to metabolism-related phenotypes. Here, immune cell-type proportions showed few differences between the SREBF1 clusters despite varying neuronal-stromal composition detected. This discrepancy from the literature could be due to technical artifacts in the tumor deconvolution procedure, as evidenced by the heavily zero-inflated output of the inference algorithm. Currently, deconvolution methodologies in non-blood tissue, let alone in brain tumors, are still in their infancy. Developing robust deconvolution approaches should remain an ongoing research focus.
Although the potential confounders important in the GBM context were addressed, the findings here might be weakened by unmeasured and unknown confounders. Since metabolism was being investigated, body mass index and prior obesity diagnosis are examples that could affect the observed associations25. Similarly, chemotherapy and radiation treatment, which could impact epigenetic landscapes and survival outcomes, should also be considered in future studies, ideally through a prospective design with large sample sizes and complete data collection on survival and treatment19.
Another intriguing future direction is the landscape hydroxy-methylation, the epigenetic modification downstream of methylation and occasionally referred to as “de-methylation”, of the FAS machinery. Notably, hydroxy-methylation is preferentially enriched in the brain compared to other tissue types, and 5hmC exhibits pronounced alterations in brain tumors and subtype-driven heterogeneity26. Interrogating the relationship between FAS and hydroxy-methylation, a lesser-studied epigenetic mark, will help elucidate the bio-molecular process while identifying new opportunities for novel brain tumor therapeutics.
In summary, this study demonstrated consistent stratification of GBM tumors based on DNA methylation of SREBF1, a principal regulator of FAS. The SREBF1 methylation-based GBM stratification is characterized by varying levels of cancer stemness, tumor microenvironment composition, and genome-wide epigenetic landscapes at global and nucleotide resolution. The findings from this study support the potential clinical utility of SREBF1 methylation and warrant ongoing research in the epigenetics of FAS machinery.
Author Contributions
Y.C. curated the publicly available datasets, conducted data analysis, and wrote the manuscript.
Institutional Review Board (IRB) and Informed Consent
All samples and patients included in this study were previously published. The details regarding IRB, ethics approval, and informed consent can be found in each dataset’s respective publication.
Data Availability Statement
All datasets used in this study were open-access and freely available without special approval. The TCGA methylation and sample-level metadata were publicly available under Synapse (https://www.synapse.org) accession syn1910197
3. TCGA survival data was from the Pan-cancer Clinical Data Resource available in Liu et al
24. The methylation and clinical data of DKFZ were available under Gene Expression Omnibus (
https://www.ncbi.nlm.nih.gov/geo) accessions GSE103659
5. R code used in this study was deposited in GitHub at
https://github.com/ydavidchen/gbm_efas.
Acknowledgments
The author would like to thank the editorial team of MDPI Lipidology for providing guidance throughout the publication process and the anonymous peer reviewers for their scientific contributions.
Conflicts of Interest
The author declares no competing financial interests.
Abbreviations
450K, Illumina HumanMethylation450 BeadChip; CI, confidence interval; CpG, cytosine-phosphate-guanine; DKFZ, Deutsches Krebsforschungszentrum (cohort); FAS, fatty acid synthesis; GBM, glioblastoma; HR, hazard ratio; OR, odds ratio; SREBF1, gene encoding the steroid response element binding factor protein 1; TCGA, The Cancer Genome Atlas (cohort)
References
- Miska, J. & Chandel, N. S. Targeting fatty acid metabolism in glioblastoma. Journal of Clinical Investigation 133, (2023).
- Kou, Y., Geng, F. & Guo, D. Lipid Metabolism in Glioblastoma: from de novo Synthesis to Storage. Biomedicines 10, 1943 (2022).
- Brennan, C. W. et al. The Somatic Genomic Landscape of Glioblastoma. Cell 155, 462–477 (2013).
- Lathia, J. D., Mack, S. C., Mulkearns-Hubert, E. E., Valentim, C. L. L. & Rich, J. N. Cancer stem cells in glioblastoma. Genes Dev. 29, 1203–1217 (2015).
- Kickingereder, P. et al. Radiomic subtyping improves disease stratification beyond key molecular, clinical, and standard imaging characteristics in patients with glioblastoma. Neuro. Oncol. 20, 848–857 (2018).
- Shakya, S. et al. Altered lipid metabolism marks glioblastoma stem and non-stem cells in separate tumor niches. Acta Neuropathol. Commun. 9, 101 (2021).
- Malta, T. M. et al. Machine Learning Identifies Stemness Features Associated with Oncogenic Dedifferentiation. Cell 173, 338-354.e15 (2018).
- Aryee, M. J. et al. Minfi: A flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics 30, 1363–1369 (2014).
- Pidsley, R. et al. A data-driven approach to preprocessing Illumina 450K methylation array data. BMC Genomics 14, 293 (2013).
- Suelves, M., Carrió, E., Núñez-Álvarez, Y. & Peinado, M. A. DNA methylation dynamics in cellular commitment and differentiation. Brief. Funct. Genomics 15, 443–453 (2016).
- Moody, L. et al. Tissue-specific changes in Srebf1 and Srebf2 expression and DNA methylation with perinatal phthalate exposure. Environ. Epigenet. 5, dvz009 (2019).
- Krause, C. et al. Critical Evaluation of the DNA-Methylation Markers ABCG1 and SREBF1 for Type 2 Diabetes Stratification. Epigenomics 11, 885–897 (2019).
- Bady, P. et al. MGMT methylation analysis of glioblastoma on the Infinium methylation BeadChip identifies two distinct CpG regions associated with gene silencing and outcome, yielding a prediction model for comparisons across datasets, tumor grades, and CIMP-status. Acta Neuropathol. 124, 547–560 (2012).
- Benton, M. C. et al. An analysis of DNA methylation in human adipose tissue reveals differential modification of obesity genes before and after gastric bypass and weight loss. Genome Biol. 16, 8 (2015).
- Houseman, E. A. et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics 13, 86 (2012).
- Hahsler, M., Hornik, K. & Buchta, C. Getting Things in Order: An Introduction to the R Package seriation. J. Stat. Softw. 25, (2008).
- Horvath, S. DNA methylation age of human tissues and cell types. Genome Biol. 14, R115 (2013).
- Liu, F., Qian, J. & Ma, C. DNA Methylation-Based Cell Type Deconvolution Reveals the Distinct Cell Composition in Brain Tumor Microenvironment. arXiv (preprint) Preprint at https://doi.org/10.1101/2025.01.19.633794 (2025).
- Chen, Y. Reproducible and Generalizable Framework for Multi-class Hierarchical Classification Demonstrated by DNA Methylation-based Glioma Subtyping. International Journal for Cross-Disciplinary Subjects in Education 15, 4946–4950 (2024).
- Gene Ontology Consortium. The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res. 47, D330–D338 (2019).
- Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y. & Morishima, K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353–D361 (2017).
- Elizarraras, J. M. et al. WebGestalt 2024: faster gene set analysis and new support for metabolomics and multi-omics. Nucleic Acids Res. 52, W415–W421 (2024).
- Wickham H. GGPLOT2: Elegant Graphics for Data Analysis. (Springer-Verlag, New York, 2016).
- Liu, J. et al. An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell 173, 400-416.e11 (2018).
- Chen, Y. Transcriptomic profiling of subcutaneous adipose tissue in relation to bariatric surgery: a retrospective, pooled re-analysis. J. Obes. Metab. Syndr. 32, 98–02 (2023).
- Azizgolshani, N. et al. DNA 5-hydroxymethylcytosine in pediatric central nervous system tumors may impact tumor classification and is a positive prognostic marker. Clin. Epigenetics 13, 176 (2021).
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).