4. Discussion
The genome mining of Brevibacillus laterosporus SA14 reveals a remarkably rich biosynthetic landscape, characterized by 20 biosynthetic regions comprising 29 predicted protoclusters, indicating extensive metabolic potential for secondary metabolite production. This level of biosynthetic density is consistent with the emerging view that environmental bacteria often encode substantially more biosynthetic capacity than is expressed under standard laboratory conditions (Medema & Fischbach, 2015; Blin et al., 2021). The dominance of NRPS-containing clusters further supports the established role of non-ribosomal peptide systems as major contributors to bioactive chemical diversity (Fischbach & Walsh, 2006).
Beyond overall abundance, SA14 exhibits a marked diversification in biosynthetic architecture. In addition to canonical single-class clusters, the genome contains multiple hybrid NRPS–PKS and NRPS–T1PKS–terpene systems. This indicates a shift from simple modular biosynthesis toward more integrated enzymatic architectures, distinguishing SA14 from many previously characterized Brevibacillus strains that typically display less frequent hybridization of biosynthetic systems.
Importantly, this architectural complexity suggests that SA14 does not merely expand biosynthetic quantity, but reorganizes biosynthetic logic at the genomic level. Such a pattern implies increased potential for chemical diversification through the coupling of peptide, polyketide, and terpene biosynthetic pathways within shared genetic loci.
Beyond the overall abundance of biosynthetic gene clusters, the genome of Brevibacillus laterosporus SA14 exhibits substantial diversity in biosynthetic architecture. In addition to canonical NRPS, PKS, and terpene-associated clusters, multiple hybrid biosynthetic regions were identified, including NRPS–PKS and NRPS–T1PKS–terpene-associated systems. The coexistence of multiple biosynthetic classes within single genomic loci suggests a complex organization of secondary metabolism and highlights the potential for the generation of structurally diverse metabolites.
Among the identified biosynthetic regions, Region 8 represents the largest predicted biosynthetic locus, spanning approximately 133.5 kb and containing more than 60 coding sequences. AntiSMASH classified this region as an interleaved hybrid biosynthetic gene cluster containing NRPS-, T1PKS-, and terpene-associated biosynthetic features. Such hybrid architectures have increasingly been recognized as important contributors to microbial biosynthetic diversity because they combine distinct biosynthetic logics within a single genetic framework (Fischbach & Walsh, 2006; Rutledge & Challis, 2015; Chevrette et al., 2020).
Comparison against the MIBiG database did not identify a closely matching characterized biosynthetic gene cluster, suggesting that Region 8 may represent a previously uncharacterized biosynthetic architecture. However, genomic predictions alone are insufficient to determine metabolite structures, biological activities, or pathway functionality. Consequently, experimental validation will be necessary to establish the biochemical products encoded by this region.
Within the species context, this positions SA14 as a distinct biosynthetic outlier characterized by increased modular integration. Rather than existing as isolated biosynthetic units, NRPS, PKS, and terpene-associated pathways appear to be more frequently co-localized and interwoven, suggesting an evolutionary trajectory favoring biosynthetic connectivity rather than independent pathway expansion.
Such genomic organization is consistent with evolutionary mechanisms including horizontal gene transfer, domain shuffling, and recombination-driven assembly of biosynthetic modules, all of which are recognized drivers of microbial secondary metabolite diversification (Fischbach et al., 2008; Chevrette et al., 2020).
In addition to this structural integration, several biosynthetic gene clusters identified in SA14 exceed 80–100 kb in length and show low similarity to entries in the MIBiG repository. This suggests that a substantial fraction of the biosynthetic repertoire may represent uncharacterized or novel natural product scaffolds, expanding the chemical space associated with the genus Brevibacillus (Kautsar et al., 2020; Navarro-Muñoz et al., 2020).
Among all identified loci, Region 8 represents the most prominent biosynthetic feature. This 133.5 kb cluster contains more than 60 predicted coding sequences and integrates NRPS, T1PKS, and terpene-associated genes within a single contiguous genomic region. Unlike typical single-pathway clusters, Region 8 exhibits a composite architecture in which multiple biosynthetic systems are physically co-localized, suggesting coordinated metabolic potential.
The internal structure of Region 8 further supports this interpretation. Core biosynthetic genes are accompanied by regulatory elements, transport systems, and accessory enzymes, indicating that the cluster functions as an integrated biosynthetic unit rather than a fragmented assembly of genes. The presence of multiple NRPS modules alongside a type I PKS and terpene cyclase supports the possibility of coordinated biosynthetic processing across chemically distinct pathways.
A notable feature of this locus is the integration of terpene biosynthetic capacity within a predominantly NRPS–PKS framework. Although bacterial terpene biosynthesis was historically underappreciated, recent genomic studies have demonstrated that terpene synthases are widely distributed and functionally diverse in bacterial systems (Yamada et al., 2015). In this context, terpene-derived moieties may contribute structural rigidity, hydrophobic interactions, or ecological signaling functions that enhance the overall chemical performance of hybrid metabolites.
From a functional perspective, Region 8 appears to represent a tightly coordinated biosynthetic system. The presence of regulatory proteins, ABC/MFS transporters, and multiple tailoring enzymes suggests that biosynthesis is coupled with transcriptional regulation and metabolite export, ensuring metabolic efficiency and cellular protection (Medema et al., 2011; Kautsar et al., 2020).
Nevertheless, it is important to emphasize that all biosynthetic predictions remain computational. While antiSMASH provides high-confidence identification of biosynthetic gene clusters, it does not confirm metabolite production or chemical structures (Blin et al., 2021). Furthermore, many biosynthetic gene clusters remain transcriptionally silent under standard laboratory conditions, limiting direct correlation between genomic potential and metabolomic output (Rutledge & Challis, 2015; Covington et al., 2021). Therefore, Region 8 should be regarded as a high-confidence candidate locus requiring experimental validation.
To bridge this gap between prediction and function, integrative experimental strategies such as transcriptomics and metabolomics will be essential. These approaches may reveal environmental conditions that activate silent biosynthetic clusters and enable detection of corresponding metabolites. In addition, structural prediction tools such as AlphaFold provide opportunities to refine enzymatic function hypotheses at near-atomic resolution, particularly for modular NRPS and PKS systems (Jumper et al., 2021; Varadi et al., 2022).
From an evolutionary perspective, the biosynthetic architecture observed in SA14 is consistent with models of microbial secondary metabolite diversification driven by gene duplication, horizontal gene transfer, and domain recombination. These processes collectively enable the continuous generation of novel biosynthetic configurations, expanding microbial chemical space (Fischbach et al., 2008; Chevrette et al., 2020). In this framework, Region 8 may represent a long-term evolutionary fusion event in which distinct biosynthetic systems have become integrated into a unified functional platform.
Overall, B. laterosporus SA14 represents a metabolically versatile and biosynthetically enriched organism with substantial potential for natural product discovery. The presence of a large hybrid NRPS–T1PKS–terpene cluster expands the known biosynthetic repertoire of the genus and highlights the utility of genome mining for uncovering previously inaccessible chemical space. While functional and structural validation remains necessary, the genomic evidence strongly supports SA14 as a promising source of structurally novel and biologically relevant secondary metabolites.
4.1. Hypothetical Biosynthetic Model of the Hybrid NRPS–T1PKS–Terpene Cluster
Region 8 of B. laterosporus SA14 spans 133.5 kb and contains more than 60 predicted coding sequences. Based on antiSMASH annotation, the region is predicted to contain NRPS, type I PKS (T1PKS), and terpene-associated biosynthetic features. The genomic organization of these biosynthetic genes suggests potential interactions among multiple biosynthetic systems within a single contiguous locus. However, the proposed biosynthetic model should be regarded as a computational prediction and requires experimental validation through transcriptomic, proteomic, and metabolomic analyses.
Candidate NRPS genes (e.g., BRLA_RS11395, BRLA_RS11410, BRLA_RS11525–11535) encode canonical adenylation (A), condensation (C), and thiolation (T) domains that are predicted to initiate biosynthesis through sequential amino acid activation and peptide chain assembly. This early peptide scaffold likely serves as the structural backbone onto which additional chemical complexity is introduced. The interspersed organization of NRPS modules further supports a modular, iterative assembly-line mechanism consistent with classical non-ribosomal peptide biosynthesis (Fischbach & Walsh, 2006; Medema et al., 2011).
Downstream of the NRPS core, the T1PKS gene BRLA_RS11400 encodes a multidomain enzyme containing ketosynthase (KS), acyltransferase (AT), and ketoreductase (KR) domains. We hypothesize that polyketide chain extension occurs either through direct interaction with the NRPS-derived intermediate or via a hybrid NRPS–PKS interface, supported by the presence of NRPS/PKS fusion features in BRLA_RS11475. This suggests a metabolically coupled elongation process in which peptide and polyketide chemistries are integrated rather than sequentially separated (Cane et al., 1998; Fischbach & Walsh, 2006).
In parallel, the terpene-associated gene BRLA_RS11365, encoding a squalene–hopene cyclase, is predicted to generate cyclic terpenoid scaffolds. Although traditionally considered an independent pathway, in the context of Region 8 it is plausible that terpene-derived moieties function as structural appendages or hydrophobic anchors, either introduced during early prenylation events or appended during late-stage tailoring reactions (Yamada et al., 2015). Collectively, these observations suggest that peptide, polyketide, and terpene biosynthesis may converge into a single hybrid molecular architecture.
NRPS enzymes are predicted to initiate biosynthesis by activating amino acid substrates through adenylation domains and assembling peptide intermediates through condensation and thiolation cycles (Fischbach & Walsh, 2006).
The T1PKS enzyme BRLA_RS11400 is predicted to participate in chain extension or modification reactions through KS-, AT-, and KR-associated catalytic activities. The presence of a putative NRPS/PKS hybrid enzyme (BRLA_RS11475) suggests possible interactions between peptide- and polyketide-associated biosynthetic modules. The presence of a hybrid NRPS/PKS enzyme (BRLA_RS11475) suggests direct crosstalk between peptide and polyketide modules, enabling formation of chimeric peptide–polyketide backbones (Cane et al., 1998).
BRLA_RS11365 is annotated as a terpene-associated cyclase and may participate in the biosynthesis of terpene-derived structural components within the cluster. The precise relationship between terpene biosynthesis and the NRPS–PKS-associated modules remains to be experimentally determined. This step introduces hydrophobic or rigid ring structures, potentially enhancing membrane interaction or biological activity (Yamada et al., 2015).
Accessory enzymes including oxidoreductases, methyltransferases, and cyclases (
Table 3) are expected to introduce post-assembly modifications such as oxidation, methylation, and cyclization. These reactions likely diversify the final metabolite pool and enhance structural complexity (Rutledge & Challis, 2015).
The presence of MFS and ABC transporters, together with Crp/Fnr-family regulators, indicates that biosynthesis is coupled to export and transcriptional regulation. This ensures metabolite secretion and prevents intracellular toxicity, consistent with fully functional BGC architectures (Kautsar et al., 2020).
The integrated activity of NRPS, T1PKS, and terpene modules, followed by tailoring reactions, is hypothesized to produce a structurally complex hybrid metabolite featuring peptide, polyketide, and terpenoid elements. Such molecular architectures are typically associated with enhanced chemical novelty and multifunctional bioactivity (Chevrette et al., 2020; Rudolf et al., 2021).
Although this mechanistic model is supported by genomic evidence derived from cluster organization, functional gene annotation, and domain architecture analyses (
Table 2,
Table 3 and
Table 4 and
Figure 2,
Figure 3 and
Figure 4), it remains a predictive framework requiring experimental validation, as genome mining alone cannot establish metabolite production, pathway activity, or molecular structure (Rutledge & Challis, 2015; Blin et al., 2021). Integrative approaches combining transcriptomics, proteomics, and metabolomics will therefore be necessary to verify pathway expression and identify the associated metabolites. Furthermore, structural prediction tools such as AlphaFold may improve our understanding of enzyme architecture and substrate specificity, thereby facilitating rational pathway engineering and heterologous expression studies (Jumper et al., 2021; Varadi et al., 2022).
4.2. Biosynthetic Significance of Region 8
The identification of Region 8 as a 133.5-kb hybrid NRPS–T1PKS–terpene biosynthetic gene cluster highlights the remarkable biosynthetic complexity encoded within the genome of Brevibacillus laterosporus SA14. Among the 20 biosynthetic regions identified, Region 8 represents the largest predicted biosynthetic locus and is distinguished by the coexistence of NRPS-, T1PKS-, and terpene-associated biosynthetic features within a single contiguous genomic region. Such integrated architectures are increasingly recognized as important drivers of natural-product diversification because they combine distinct biosynthetic mechanisms capable of generating structurally complex metabolites (Chevrette et al., 2020; Rudolf et al., 2021).
The organization of Region 8 further suggests a high degree of biosynthetic integration. The presence of multiple core biosynthetic genes together with regulatory elements, transport-associated proteins, and accessory enzymes indicates that the cluster may function as a coordinated biosynthetic unit rather than a collection of independent pathways. Similar large hybrid biosynthetic systems have been associated with expanded chemical diversity and increased opportunities for evolutionary innovation through the recombination of peptide, polyketide, and terpenoid biosynthetic strategies (Fischbach & Walsh, 2006; Chevrette et al., 2020).
Notably, comparison against the MIBiG database did not reveal a closely matching characterized biosynthetic gene cluster, suggesting that Region 8 may represent a previously uncharacterized biosynthetic architecture. Although the metabolite(s) encoded by this region remain unknown, its genomic organization, large size, and hybrid composition collectively identify Region 8 as a promising target for future transcriptomic, metabolomic, and functional investigations aimed at discovering novel secondary metabolites and elucidating their biosynthetic pathways (Kautsar et al., 2020; Blin et al., 2021).
4.3. Ecological and Functional Implications
The extensive repertoire of biosynthetic gene clusters identified in B. laterosporus SA14 may reflect adaptation to complex microbial environments where competition, resource acquisition, and ecological interactions influence microbial fitness. The predominance of NRPS-containing clusters together with multiple hybrid biosynthetic systems suggests substantial genetic capacity for the production of structurally diverse secondary metabolites, many of which may contribute to ecological functions such as microbial interactions and environmental adaptation (Fischbach & Walsh, 2006; Chevrette et al., 2020).
Region 8 is of particular interest because it integrates NRPS-, T1PKS-, and terpene-associated biosynthetic features within a single genomic locus. The coexistence of these biosynthetic systems may provide opportunities for the generation of structurally complex metabolites through coordinated biosynthetic processes. Similar hybrid biosynthetic architectures have been proposed to contribute to chemical diversification and ecological versatility in microorganisms (Rudolf et al., 2021; Chevrette et al., 2020).
The presence of transporter genes, regulatory elements, and accessory enzymes within Region 8 further suggests that the cluster may function as an integrated biosynthetic unit. Such auxiliary components are commonly associated with pathway regulation, metabolite export, and cellular protection mechanisms in secondary metabolite biosynthetic gene clusters (Kautsar et al., 2020; Medema et al., 2011). However, the biological roles and ecological significance of metabolites potentially encoded by Region 8 remain speculative and require experimental verification.
Taken together, these observations suggest that the hybrid biosynthetic organization observed in SA14 may represent an evolutionary strategy for expanding biosynthetic diversity and enhancing ecological adaptability. Future transcriptomic, metabolomic, and functional studies will be necessary to determine the ecological roles and biological activities of metabolites associated with this biosynthetic region.
4.4. Biotechnological and Drug Discovery Potential
From an applied perspective, the identification of the hybrid NRPS–T1PKS–terpene biosynthetic gene cluster in B. laterosporus SA14 highlights the potential value of this strain as a source of biosynthetic diversity for natural product research. Hybrid biosynthetic systems are increasingly recognized as important reservoirs of chemical innovation because they combine distinct biosynthetic mechanisms capable of generating structurally diverse secondary metabolites (Rutledge & Challis, 2015; Rudolf et al., 2021). The integration of multiple biosynthetic features within Region 8, together with associated regulatory and transporter elements, identifies this locus as a promising target for future functional characterization, heterologous expression, and synthetic biology-based investigations (Medema & Fischbach, 2015).
In addition, the combination of genome mining, biosynthetic pathway reconstruction, and structural prediction approaches provides a useful framework for exploring enzyme function and biosynthetic organization. Structural modeling tools such as AlphaFold may contribute to improved understanding of enzyme architecture and domain organization, thereby supporting future studies of pathway engineering and heterologous production systems (Jumper et al., 2021; Varadi et al., 2022).
The apparent novelty of Region 8, reflected by the absence of a closely matching characterized biosynthetic gene cluster in the MIBiG database, further increases its relevance for natural product discovery. Although the chemical products encoded by this region remain unknown, its large size, hybrid organization, and extensive biosynthetic repertoire suggest that it may represent a valuable target for future metabolomic and biochemical investigations. Collectively, these findings highlight the utility of genome mining as a strategy for identifying previously unexplored biosynthetic resources and prioritizing candidate gene clusters for downstream experimental validation (Medema & Fischbach, 2015; Rudolf et al., 2021).
This study was based exclusively on computational genome mining and biosynthetic gene cluster prediction. Consequently, the presence of a biosynthetic gene cluster does not necessarily indicate active expression, metabolite production, or biological activity under laboratory conditions. Furthermore, biosynthetic functions were inferred from domain architecture and sequence-based annotations generated by antiSMASH and therefore remain predictive. Experimental validation using transcriptomic, proteomic, metabolomic, biochemical, and genetic approaches will be necessary to confirm pathway activity and characterize the metabolites associated with Region 8.