Preprint
Article

This version is not peer-reviewed.

Bacterial Diversity, Metabolic Profiling and Application Potential of Antarctic Soil Metagenomes

A peer-reviewed article of this preprint also exists.

Submitted:

17 July 2024

Posted:

19 July 2024

You are already at the latest version

Abstract
Antarctica has attracted increasing interest in understanding its microbial communities, metabolic potential, and as a source of microbial hydrolytic enzymes with industrial applications, for which advances in next-generation sequencing technologies have greatly facilitated the study of unculturable microorganisms. In this work, soils from seven sub-Antarctic islands and Union Glacier were studied using a whole-genome shotgun metagenomic approach. The main findings were that the microbial community at all sites was predominantly composed of the bacterial phyla Actinobacteria and Cyanobacteria, and the families Streptomycetaceae and Pseudonocardiaceae. Regarding the xenobiotic biodegradation and metabolism pathway, genes associated with benzoate, chloroalkane, chloroalkene, and styrene degradation were predominant. In addition, putative genes encoding industrial enzymes with predicted structural properties associated with improved activity at low temperatures were found, with catalases and malto-oligosyltrehalose trehalohydrolase being the most abundant. Overall, our results show similarities between soils from different Antarctic sites with respect to more abundant bacteria and metabolic pathways, especially at higher classification levels, regardless of their geographic location. Furthermore, our results strengthen the potential of Antarctic soils as a source of industrially relevant enzymes.
Keywords: 
;  ;  ;  ;  

1. Introduction

The Antarctic continent has long been of interest to the scientific community due to its extreme environmental and geographic conditions, which make it one of the most inhospitable and isolated places on the planet. Most of the continent is covered by a permanent layer of ice and snow, and the 2% of ice-free regions encompass a variety of marine and terrestrial environments that support the vast majority of the continent’s biodiversity [1,2,3], with biomes with different biotic and abiotic factors hosting diverse microbial communities in their soils [3,4]. Initially, knowledge of microbial communities was based on culture-dependent studies in which most of the described bacteria belonged to the phyla Actinobacteria and Firmicutes and were represented by more than 70 genera, such as Arthrobacter, Micrococcus, Cellulomonas, Rhodococcus, and Flavobacterium [5,6]. In the case of yeasts, the top five genera described from these regions are Mrakia, Naganishia, Rhodotorula, Candida, and Leucosporidium [7]. It is well known that only a small fraction of environmental microorganisms can be cultured using current methods, especially those from extreme environments [8], and metagenomic approaches, driven by advances in next-generation sequencing (NGS) technologies, has emerged as a valuable tool for uncovering the intricate microbial diversity of the many microbiomes of Antarctica and other extreme environments [9,10,11]. Using these approaches, thousands of species belonging to the phyla Firmicutes, Proteobacteria, Flavobacteria, Sphingobacteria, Cytophaga, and Actinobacteria have been identified in different sites in Antarctica [12,13,14,15]. In general, fungal sequences are less abundant than bacterial sequences and have been detected more effectively when targeted by amplicon metagenomic sequencing via DNA metabarcoding in various studies [16,17,18,19,20,21]. In the case of shotgun metagenomics, in addition to taxonomic description, allows the the functional characterization of samples by identifying genes and mapping them to metabolic pathways, as the identification of genes related to the degradation and metabolism of aromatic and aliphatic hydrocarbons have been identified in soils from Deception and Livingston Islands [22].
Environmental microorganisms produce and secrete hydrolytic enzymes to utilize the carbon sources available in their environment. In the case of cold environments, these "cold-active" or "psychrophilic" enzymes have significant biotechnological potential in several industrial processes due to their ability to catalyze reactions at low temperatures [10,23,24,25,26]. Therefore, gene-mining of soil metagenomes provides an opportunity to explore and exploit the vast genetic reservoir and metabolic capabilities inherent in soil microbiomes and infer functional potential and metabolic networks from genome-encoded enzymes [27,28,29].
In this work, whole-genome shotgun (WGS) metagenomic sequencing was used to explore the microbial diversity and metabolic potential in Antarctic soil samples from five islands in the South Shetland Archipelago, two islands near the Antarctic Peninsula, and Union Glacier. In addition, the presence of putative genes encoding enzymes of interest for application in productive areas and their predicted protein structural properties were investigated.

2. Materials and Methods

  • Sampling sites
Soil samples from King George, Deception, Snow, Dee, Nelson, Litchfield, and Lagotellerie Islands and Union Glacier were collected during several expeditions to Antarctica as previously described [30,31,32,33]. These samples were placed in sterile 50 ml plastic tubes, sealed, and shipped at -20°C to the laboratory of the Universidad de Chile in Santiago, where they were stored at -80°C until further processing. For convenience, the following site abbreviations were used in figures and tables: King George Island Site 1A (KG1A), King George Island Site 1B (KG1B), King George Island Site 2 (KG2), Deception Island (Dee), Snow Island (Snow), Dee Island (Dee), Nelson Island (Nelson), Litchfield Island (Litch), Lagotellerie Island (Lago), and Union Glacier (UG).
  • DNA extraction, library preparation, and sequencing
DNA was extracted from soil samples from sites collected from nine islands in the subantarctic region and Union Glacier (Table S1) using the PowerSoil® DNA Isolation Kit (MO BIO Laboratories Inc., Carlsbad, CA, United States) according to the manufacturer’s instructions, except for the disruption step, which was performed using a Mini Beadbeater-16 cell disruptor (BioSpec Bartlesville, United States) instead of vortex agitation. The extracted DNA was quantified using a Qubit 4 fluorometer (Invitrogen, USA), and those that reached the quality and concentration required for NGS were selected and pooled according to the corresponding island (Table S1) and sent to Omics2View consulting (Kiel, Germany) for processing, sequencing, and informatics data analysis. Short insert shotgun sequencing libraries were prepared by shearing the DNA by sonication and agarose gel electrophoresis to retain fragments of the desired length. P5/P7 Illumina adapters and 8-nt index sequences were ligated to the DNA fragments. Pair-end (2X150bp) Illumina HiSeq X Ten sequencing was performed. Data were submitted to the National Center for Biotechnology Information, Bioproject PRJNA1038696.
  • Bioinformatic processing and data analysis
The quality of the demultiplexed reads was evaluated using FastQC v0.11.7 [34]. Read quality trimming was performed using the BBTools v38.45 package [35], which included removal of optical duplicates, human sequences, adapter sequences, low entropy reads, and trimming of bases with quality scores < 10, as well as reads with invalid or ambiguous bases and reads <127 base pairs (bp) in length. Read quality recalibration and error correction were performed using the BBTools v38.45 package [35] and aligned to a preliminary de novo assembly using Tadpole from a subset of the quality-trimmed reads, with information used to recalibrate the base quality of all quality-trimmed reads. Sequence errors were corrected by applying BBMerge, Clumpify, and Tadpole in error correction mode to the quality-recalibrated reads, and 31-bp Kmer reads were normalized with BBNorm to a target Kmer depth of 100x, with a minimum Kmer depth of 3x (filtered reads). Cross-assemblies were constructed using MEGAHIT v1.1.4-2-gd1998a1 [36,37], run with a preset ’meta-large’ and a minimum contig length of 1000 bp, and the generated scaffolds were subjected to an additional scaffolding step using SSPACE standard v3.0 [38] with default parameters (primary assembly). Filtered reads were back-mapped to the primary assembly using BBMap from the BBTools v38.45 package [35], and overall coverage was determined from the number of unambiguously mapped reads for each primary assembly contig and contigs with length ≥ 1,000 bp and coverage ≥ 10× were considered (filtered assembly). Assembly statistics were generated using code from the ’Assemblathon 2’ project [39] (Table S2). Read counts per sample were generated from alignments to filtered contigs using SAMtools v1.9 [40]. The length distribution of the generated contigs is shown in Figure S1.
  • Taxonomic assignment
Taxonomic assignment of the contigs was performed using Kraken v2.0.8 [41] from the BLAST+ v2.9.0 package [42,43]. Hierarchical taxonomic classification of the contigs followed the NCBI taxonomy [44].
  • Coding sequence prediction and functional annotation
The contig analysis was performed using Geneious Prime version 2023.2.1 [45], hereafter referred to as Geneious Prime for simplicity. To predict coding sequences (CDS), the Geneious Prime plugins were used, including Glimmer [46,47] with genetic code 11 for bacteria and Augustus [48,49] for eukaryotes, using the general training. In both cases, only CDS of at least 300 nt were considered for functional annotation using the KAAS-KEGG Automatic Annotation Server [50], using the default parameters and selecting the appropriate gene dataset for prokaryotes or eukaryotes as needed. The length distribution of the generated CDSs is shown in Figure S1. Predicted CDSs were translated using standard genetic code and compared against a local database constructed with sequences from 34 types of enzymes [51,52] and antifreeze and ice-binding proteins downloaded from the UniProt database (https://www.uniprot.org/) (98,153 sequences in total), using BlastP with the configuration "Bin in ’has hit’ vs. ’no hit’" in the Geneious Prime software. CDSs with hits were then compared with the Swiss-Prot database and annotated with a similarity threshold of ≥ 30% using the Blosum62 cost matrix. The putative CDSs encoding potential industrial enzymes were compared with the SWISS-MODEL server (https://swissmodel.expasy.org) and models were generated if a suitable orthologous template was found (coverage ≥50%, similarity ≥30%, Global Model Quality Estimate (GMQE) ≥0.8). The quality of the generated models was evaluated using VERIFY 3D [53,54] (https://saves.mbi.ucla.edu/), and those models that passed this evaluation were used to calculate parameters associated with protein structural flexibility, such as the number of hydrogen bonds, total solvent-accessible surface area (TotSAS), apolar solvent-accessible surface area (ApoSAS), number of salt bridges, flexibility, and secondary structures content [23,24,55,56]. The Chimera and ChimeraX software [57,58] were used to calculate TotSAS, ApoSAS, and the number of hydrogen bonds and salt bridges using the default parameters. The following parameters were used to estimate TotSAS and ApoSAS: radius = 1.4 Å, peak density = 2. To estimate hydrogen bonds and salt bridges the following settings were used: radius = 0.075 Å, dashes = 6, distance tolerance = 0.4 Å, angle tolerance = 20º. Secondary structure content was extracted from the PDB files using the 2struccompare metaserver (https://2struccompare.cryst.bbk.ac.uk/index.php). Protein flexibility was calculated using MEDUSA [59] as the percentage of residues in regions predicted to be rigid (0), flexible (1), and very flexible (2).
  • Statistical analyses and data processing
The Lilliefors, Kolmogorov-Smirnov, Anderson-Darling, and D’Agostino K-squared tests rejected the normality of the data. Statistical comparisons were made using the Kruskal-Wallis test followed by Dunn’s post hoc test (p < 0.05). Statistical analysis, Shannon index, and Bray-Curtis dissimilarity calculations were conducted in Python using the pandas and SciPy packages [60,61]. Graphs were generated using the Matplotlib, NumPy, and scikit-learn packages [62,63].

3. Results

  • Analysis of bacterial taxa in soils from different Antarctic sites
Taxonomic analysis was performed on bacteria, as the reads associated with this domain represented almost all reads in each site studied (at least 99.5%). A total of 33 bacterial phyla were identified, 20 of which were detected at all 10 sites (Figure 1 A). Actinobacteria was the most abundant phylum, accounting for 66% of all data, and was the most abundant phylum at eight sites, with higher percentages at Union Glacier (27%) and Deception Island (17%). Cyanobacteria and Proteobacteria were the second and third most abundant phyla, representing 13% and 12% of all data, respectively. Cyanobacteria was the predominant phylum at King George Island Site 1B (77%) and Lichfield Island (45%). The remaining phyla represent small percentages by location (lower 3%). In terms of bacterial families, a total of 320 bacterial families were found, 126 of which were identified at all sites (Figure 1B, Table S3). Five families represented 45% of all reads associated with bacteria: Streptomycetaceae, Pseudonocardiaceae, Oscillatoriaceae, uncl. Actinobacteria, and Intrasporangiaceae. By site, the most abundant families were Streptomycetaceae at Deception, King George Island Site 1A, and Lagotellerie Islands and Union Glacier, Oscillatoriaceae at Dee, King George Island Site 1B (which was particularly high at this site, 70%), Litchfield, Nelson, and Snow Islands, and Intrasporangiaceae at King George Island Site 2.
According to the calculated Shannon index (H), the biodiversity of bacterial phyla was similar at all sites, ranging from 2.7 to 2.8, with evenness (E) values ranging from 0.8 to 0.9 (Figure 1A). Similar results were obtained for bacterial families (H values ranging from 3.1 to 3.6 and E values from 0.5 to 0.6), with the exception of King George Island Site 1B where lower indices were obtained (H= 1.6, E= 0.3) (Figure 1B). Variability among sites was assessed by calculating Bray-Curtis dissimilarity for bacterial phyla and families, which was used for hierarchical clustering of sites, generating identical dendrograms at the two taxonomic levels (Figure 1 C,D). The way in which sites were grouped according to taxonomic data differs from hierarchical clustering based on geographic distance between sites (Figure 1E). For example, Deception Island, geographically closer to Snow, Dee, and Nelson Islands, was grouped with Union Glacier, the most distant site from the others. King George Island sites were grouped separately with other islands, King George Site 1A with Snow Island, and King George Site 2 with the more distant Lagotellerie Islands.
  • Metabolic potential of soils from different Antarctic sites
A total of 29% (40,780) of all predicted CDSs in the metagenomes were annotated and classified into general, intermediate, and specific pathways according to the Kyoto Encyclopedia of Genes and Genomes. The reads were associated with twenty-four intermediate metabolic pathways across all sites. Ten of these pathways accounted for 77% of all reads, and the most represented were carbohydrate metabolism, amino acid metabolism, and energy metabolism, with 41% of the total (Figure 2A). These three pathways were more abundant at Deception, King George Site 2, Litchfield Islands, and Union Glacier. A total of 233 specific metabolic pathways were found, 186 were detected in all sites, and 30 of them account for 62 % of the assigned reads (Figure 2B and Table S4). Biosynthesis of secondary metabolites and microbial metabolism in diverse environments consistently were the first and second most abundant metabolic pathways in the studied sites, except for Dee Island, where the second most abundant was oxidative phosphorylation. 13 of the 18 specific pathways of the category "xenobiotic biodegradation and metabolism" were identified at all sites, whereas dioxin degradation was identified at only five sites (Figure 2C). An 80% of all reads were associated with seven pathways, and the “drug metabolism - other enzymes” pathway was consistently the most abundant at all sites (10% to 40%). The second most abundant pathway was benzoate degradation in six islands and Union Glacier (14 % to 19 %), nitrotoluene degradation at King George site 1A (17%) and Snow Island (18%), and metabolism of xenobiotics by cytochrome P450 at King George site 1B (9%). The calculated Shannon and evenness indices for each metabolic level were similar among sites, with H ranging from 2.7 to 2.8 and E from 0.8 to 0.9 at the intermediate level, H ranging from 3.9 to 4.1 and E from 0.7 to 0.8 at the specific level, and H ranging from 2.1 to 2.4 and E from 0.7 to 0.8 for xenobiotic biodegradation and metabolism (Figure 2 A, B, and C). Hierarchical clustering of sites based on calculated Bray-Curtis dissimilarity yielded identical dendrograms for intermediate and specific pathways (Figure 2 D and E), which differed from the dendrogram generated by "Xenobiotic Biodegradation and Metabolism" (Figure 2 F). In the first two, Dee Island and King George Island site 1B were closely grouped, and in the third, Dee was grouped with Lagotellerie and Nelson Island, and George Island site 1B was grouped with George Island site 1A and Snow Island. Similar to the taxonomic analysis, this clustering is not consistent with the geographic distances of the sites studied.
  • Gene mining for putative enzymes with potential applications and protein structure analysis
A total of 1,244 CDSs encoding 71 types of enzymes with potential industrial applications were predicted in all metagenomes (Figure 3A). Twenty-eight putative enzymes were found in all sites, and 17 of them accounted for 70% of all associated reads. The percentage of reads associated with catalase enzymes was highest when data from all sites were combined, and was highest at six islands and Union Glacier. King George Island Site 1B, Nelson and Litchfield Islands had the highest percentages of reads associated with malto-oligosyltrehalose trehalohydrolase, glucose-6-phosphate isomerase, and 1,4-alpha-glucan branching enzyme. The top 10 putative enzymes with higher percentages considering all site data were, in addition to those mentioned above, trehalose synthase/amylase TreS, Lon protease, aspartate ammonia-lyase, alpha-maltose-1-phosphate synthase, extracellular metalloprotease, and trehalose synthase. CDSs encoding the antifreeze proteins were found at all sites except King George Site 1B, with higher percentages at Nelson and Deception Islands and Union Glacier.
To evaluate the putative enzymes for structural flexibility, the CDSs were translated in silico, compared with the SWISS-MODEL database, and 3D models were constructed, and quality checked when a suitable ortholog was found. In this way, 87 protein models were generated, and the parameters proposed as key factors for improved performance of cold-adapted enzymes at low temperatures, such as ionic-electrostatic interactions, solvent-accessible surface, hydrogen bonds, salt bridges, and the proportion of large amino acids, were calculated and compared with those calculated from the mesophilic ortholog. In the box plots of Figure 3B, some tendencies can be observed between the parameter values obtained from the Antarctic putative enzymes and their corresponding mesophilic orthologs. Antarctic enzymes tended to have a lower content of secondary structures, amino acids classified as rigid (by Medusa), and hydrogen bonds than mesophilic orthologs. The solvent-accessible surface area showed interesting results with a tendency for a higher total surface area for putative Antarctic enzymes than for mesophilic ones, which was significantly (p ≤ 0.05) higher when considering the apolar surface area. Regarding the percentage of amino acids, significant differences between Antarctic and mesophilic enzymes were found for eight amino acids. A higher percentage of A, P, and R were found in the Antarctic than in mesophilic enzymes, residues that are classified as small or very small, with the exception of R. On the other hand, the Antarctic enzymes had lower percentages of F, I, K, N, and Y than the mesophilic enzymes, residues that, with the exception of N, are classified as large or very large.

4. Discussion

The present study investigated the bacterial and metabolic diversity and potential as a source of industrial enzymes in the metagenomes of soils from five islands in the Shetland South Archipelago, two from the Antarctic Peninsula, and Union Glacier, Antarctica. The most abundant bacterial phyla were Actinobacteria, Cyanobacteria, Proteobacteria, uncl. Bacteria, and Bacteroidetes in most of the study sites. These results are consistent with previous studies in soils from different Antarctic regions that reported Proteobacteria, Actinobacteria, and Bacteroidetes as the most abundant phyla in soils from King George and Livingston Islands [64], Proteobacteria and Actinobacteria as the dominant phyla in soils from Mars Oasis on Alexander Island [13], and Cyanobacteria, Actinobacteria and Proteobacteria in biological soil crusts in regions of Enderby Land and Queen Maud Land (East Antarctica) [65]. In a shotgun metagenomic study of soils from Barton Peninsula (King George Island), Proteobacteria, Bacteroidetes, Acidobacteria (10th in this work), Chloroflexi (7th in this work), Planctomycetes (8th in this work), and Actinobacteria were reported as the predominant phyla [66]. At the bacterial family level, 320 families were identified, 126 of which were ubiquitous across all sites examined. Nearly half of the reads associated with bacterial families were represented by five families, the three most abundant being Streptomycetaceae, Pseudonocardiaceae, and Oscillatoriaceae. Considering the top ten bacterial families described in this work, seven of them have been described in previous metagenomic studies but with different abundances, the closest being Pseudonocardiaceae, Oscillatoriaceae, and Nocardioidaceae [67,68]. According to the Shannon and evenness indices, bacterial diversity at the phylum and family levels was similar among the sites studied here, except for King George Island site 1B, where lower values were calculated for bacterial families. This was reflected in the hierarchical clustering of the sites according to the Bray-Curtis dissimilarity calculated considering the abundance of the bacterial taxa, in which the sites were grouped in a way that was independent of their geographical location and the distance between them. Therefore, it can be concluded that bacterial diversity among the islands of the Shetland South Archipelago, the Antarctic Peninsula, and the Union Glacier sites studied here is rather homogeneous for the more abundant taxa, with differences in the less abundant taxa.
In terms of metabolic pathways, some homogeneity was observed across sites, with 24 intermediate and 186 specific pathways found at all, with variation in their abundance in each site. The most abundant intermediate pathways were carbohydrate metabolism, amino acid metabolism, and energy metabolism, and the most abundant specific pathways were biosynthesis of secondary metabolites, microbial metabolism in diverse environments, and biosynthesis of cofactors. Although the hierarchical clustering of sites by Bray-Curtis dissimilarity differed from that obtained using taxonomic data, in both cases sites were grouped independently of their geographic location. Interestingly, reads associated with 13 metabolic pathways of xenobiotic biodegradation and metabolism were identified at all sites. The three most abundant considering all sites studied were drug metabolism - other enzymes, benzoate degradation, and aminobenzoate degradation. As observed in the taxonomic analysis, similar Shannon and evenness indices were obtained in all sites for all metabolic levels analyzed. Previously, 125 catabolic genes associated with the degradation and metabolism of aromatic and aliphatic hydrocarbons were reported from Deception and Livingston Islands, and the authors suggested that the genetic potential for hydrocarbon degradation was due to the natural occurrence of hydrocarbons [22]. Another possibility is the existence of compounds classified as "natural xenobiotics" produced by some microorganisms and plants in the environment [69,70], but to the best of our knowledge, there are no reports on this topic in Antarctic environments. However, thanks to the development of sophisticated instrumentation, a class of xenobiotics called persistent organic pollutants (POPs) has been detected in Antarctic terrestrial and marine ecosystems from King George Island to McMurdo and Victoria Land [71,72,73].
Mining metagenomes for genes encoding enzymes with high activity at low temperatures and other properties suitable for industrial applications is a promising strategy for the heterologous production of novel enzymes in productive organisms such as the yeast S. cerevisiae. Of the 71 hydrolytic enzymes with applied potential detected in this study, 28 were found at all sites. In previous studies of soil metagenomes from Barton Peninsula (King George Island), 162 putative genes encoding carbohydrate-active enzymes (CAZy), including candidates for lignocellulolytic enzymes [66], and genes potentially encoding cold shock and antifreeze proteins were predicted in Enderby Land and Queen Maud biocrusts [65]. In a metagenomic library constructed using soil samples from the Pointe Géologie archipelago (Ile des Petrels), Terre Adélie, it was found recombinant E. coli clones with lipases/esterases, amylases, proteases, and cellulases activities were obtained from [74]. In terms of the most predicted enzymes in this study, catalase is one of the most important enzymatic mechanisms of antioxidant defense [75,76] and has been widely described in microorganisms, including some from Antarctica [77,78], with applications in bioremediation as an indicator of hydrocarbon degradation in soil and removal of H2O2 from bleaching industry effluents [79]. The enzyme malto-oligosyltrehalose trehalohydrolase functions in the synthesis of trehalose, a compound involved in the response to abiotic stress [80], and is used in the biotechnological production of trehalose [81]. The 1,4-alpha-glucan-branching enzyme is ubiquitous in different types of organisms and modifies the structure of starch and glycogen with several applications, including glycogen production [82].
In addition to finding putative genes encoding enzymes of interest, predicting enzyme performance at low temperatures is desirable because several putative genes encoding a particular enzyme may be found in a metagenome. Several factors have been described to improve the performance of cold-adapted enzymes at low temperatures, related to the structural flexibility of the protein [23,24,55,56]. Among these factors, the putative Antarctic enzymes studied here tended to have a lower content of secondary structures, amino acids classified as rigid (by Medusa), hydrogen bonds, and residues classified as large or very large, and a higher content of residues classified as small or very small. The more distinctive parameter was the apolar solvent-accessible surface area, which was significantly higher for putative Antarctic enzymes than for mesophilic counterparts. Therefore, determining these parameters provides relevant information for selecting putative enzymes with greater structural flexibility.

Supplementary Materials

The following supporting information can be downloaded at the website of this paper posted on Preprints.org. Supplementary Table S1. Soil samples used for DNA extraction by each site. Supplementary Table S2. Assembly statistics. Supplementary Table S3. Reads associated with bacterial families in each site. Supplementary Table S4. Reads associated to specific metabolic pathways in each site

Author Contributions

MB, JA, and VC planned the study and discussed the results. SB performed DNA extraction and analysis. MF and MB performed bioinformatics and biostatistics analysis. FG performed protein structural modeling and characterization. MF, MB, and JA wrote the manuscript.

Funding

This research was funded by Grant INACH RG_02-18 from Instituto Antartico Chileno and ENL03/22 from Vicerrectoria de Investigacion y Desarrollo, Universidad de Chile.

Data Availability Statement

RNA-seq data: National Center for Biotechnology Information, Bioproject: PRJNA1038696

Acknowledgments

Bioinformatic and statistical consulting was provided by omics2view.consulting GbR, Kiel (Germany)

Conflicts of Interest

The authors declare no conflicts of interest

References

  1. Cowan DA, Tow LA: Endangered Antarctic Environments. Annu Rev Microbiol 2004, 58, 649–690. [CrossRef]
  2. Terauds A, Chown SL, Morgan F, J. Peat H, Watts DJ, Keys H, Convey P, Bergstrom DM: Conservation biogeography of theAntarctic. Diversity and Distributions 2012, 18:726-741. [CrossRef]
  3. Lee JR, Raymond B, Bracegirdle TJ, Chadès I, Fuller RA, Shaw JD, Terauds A: Climate change drives expansion of Antarctic ice-free habitat. Nature 2017, 547:49-54. [CrossRef]
  4. Lambrechts S, Willems A, Tahon G: Uncovering the Uncultivated Majority in Antarctic Soils: Toward a Synergistic Approach. Front Microbiol 2019, 10:242. [CrossRef]
  5. Margesin R, Miteva V: Diversity and ecology of psychrophilic microorganisms. Res Microbiol 2011, 162:346-361. [CrossRef]
  6. Stan-Lotter H, Fendrihan S: Adaption of Microbial Life to Environmental Extremes Springer; 2017.
  7. Baeza M, Flores O, Alcaíno J, Cifuentes V: Yeast Thriving in Cold Terrestrial Habitats: Biodiversity and Industrial/Biotechnological Applications. In Fungi in Extreme Environments: Ecological Role and Biotechnological Significance Edited by Cham: Springer International Publishing; 2019:253-268.
  8. Solden L, Lloyd K, Wrighton K: The bright side of microbial dark matter: lessons learned from the uncultivated majority. Curr Opin Microbiol 2016, 31, 217–226. [CrossRef]
  9. Wang NF, Zhang T, Zhang F, Wang ET, He JF, Ding H, Zhang BT, Liu J, Ran XB, Zang JY: Diversity and structure of soil bacterial communities in the Fildes Region (maritime Antarctica) as revealed by 454 pyrosequencing. Front Microbiol 2015, 6:1188. [CrossRef]
  10. Baeza M, Alcaíno J, Cifuentes V, Turchetti B, Buzzini P: Cold-Active Enzymes from Cold-Adapted Yeasts. In Biotechnology of Yeasts and Filamentous Fungi Edited by Sibirny AA. Cham: Springer International Publishing; 2017:297-324.
  11. Feeser KL, Van Horn DJ, Buelow HN, Colman DR, McHugh TA, Okie JG, Schwartz E, Takacs-Vesbach CD: Local and Regional Scale Heterogeneity Drive Bacterial Community Diversity and Composition in a Polar Desert. Front Microbiol 2018, 9:1928. [CrossRef]
  12. Rogers SO, Shtarkman YM, Koçer ZA, Edgar R, Veerapaneni R, D’Elia T: Ecology of subglacial lake vostok (antarctica), based on metagenomic/metatranscriptomic analyses of accretion ice. Biology (Basel) 2013, 2:629-650. [CrossRef]
  13. Pearce DA, Newsham KK, Thorne MA, Calvo-Bado L, Krsek M, Laskaris P, Hodson A, Wellington EM: Metagenomic analysis of a southern maritime antarctic soil. Front Microbiol 2012, 3:403. [CrossRef]
  14. Lopatina A, Medvedeva S, Shmakov S, Logacheva MD, Krylenkov V, Severinov K: Metagenomic Analysis of Bacterial Communities of Antarctic Surface Snow. Front Microbiol 2016, 7:398. [CrossRef]
  15. Alves Junior N, Meirelles PM, de Oliveira Santos E, Dutilh B, Silva GG, Paranhos R, Cabral AS, Rezende C, Iida T, de Moura RL, Kruger RH, Pereira RC, Valle R, Sawabe T, Thompson C, Thompson F: Microbial community diversity and physical-chemical features of the Southwestern Atlantic Ocean. Arch Microbiol 2015, 197:165-179. [CrossRef]
  16. Ogaki MB, Câmara PEAS, Pinto OHB, Lirio JM, Coria SH, Vieira R, Carvalho-Silva M, Convey P, Rosa CA, Rosa LH: Diversity of fungal DNA in lake sediments on Vega Island, north-east Antarctic Peninsula assessed using DNA metabarcoding. Extremophiles 2021, 25:257-265. [CrossRef]
  17. Rosa LH, Ogaki MB, Lirio JM, Vieira R, Coria SH, Pinto OHB, Carvalho-Silva M, Convey P, Rosa CA, Câmara PEAS: Fungal diversity in a sediment core from climate change impacted Boeckella Lake, Hope Bay, north-eastern Antarctic Peninsula assessed using metabarcoding. Extremophiles 2022, 26:16. [CrossRef]
  18. de Souza LMD, Ogaki MB, Câmara PEAS, Pinto OHB, Convey P, Carvalho-Silva M, Rosa CA, Rosa LH: Assessment of fungal diversity present in lakes of Maritime Antarctica using DNA metabarcoding: a temporal microcosm experiment. Extremophiles 2021, 25:77-84. [CrossRef]
  19. da Silva TH, Câmara PEAS, Pinto OHB, Carvalho-Silva M, Oliveira FS, Convey P, Rosa CA, Rosa LH: Diversity of Fungi Present in Permafrost in the South Shetland Islands, Maritime Antarctic. Microb Ecol 2022, 83:58-67. [CrossRef]
  20. Rosa LH, Pinto OHB, Convey P, Carvalho-Silva M, Rosa CA, Câmara PEAS: DNA Metabarcoding to Assess the Diversity of Airborne Fungi Present over Keller Peninsula, King George Island, Antarctica. Microb Ecol 2021, 82:165-172. [CrossRef]
  21. Pudasaini S, Wilson J, Ji M, van Dorst J, Snape I, Palmer AS, Burns BP, Ferrari BC: Microbial Diversity of Browning Peninsula, Eastern Antarctica Revealed Using Molecular and Cultivation Methods. Front Microbiol 2017, 8:591. [CrossRef]
  22. Silva JB, Centurion VB, Duarte AWF, Galazzi RM, Arruda MAZ, Sartoratto A, Rosa LH, Oliveira VM: Unravelling the genetic potential for hydrocarbon degradation in the sediment microbiome of Antarctic islands. FEMS Microbiol Ecol 2022, 99:fiac143. [CrossRef]
  23. Gerday C: Fundamentals of Cold-Active Enzymes. In Cold-adapted Yeasts: Biodiversity, Adaptation Strategies and Biotechnological Significance Edited by Buzzini P, Margesin R. Springer Berlin Heidelberg: Springer; 2014:325-350.
  24. Parvizpour S, Hussin N, Shamsir MS, Razmara J: Psychrophilic enzymes: structural adaptation, pharmaceutical and industrial applications. Appl Microbiol Biotechnol 2021, 105:899-907. [CrossRef]
  25. Alcaíno J, Cifuentes V, Baeza M: Physiological adaptations of yeasts living in cold environments and their potential applications. World J Microbiol Biotechnol 2015, 31:1467-1473. [CrossRef]
  26. Feller G: Psychrophilic enzymes: from folding to function and biotechnology. Scientifica (Cairo) 2013, 2013:512840. [CrossRef]
  27. Liu S, Moon CD, Zheng N, Huws S, Zhao S, Wang J: Opportunities and challenges of using metagenomic data to bring uncultured microbes into cultivation. Microbiome 2022, 10:76. [CrossRef]
  28. Sharpton TJ: An introduction to the analysis of shotgun metagenomic data. Front Plant Sci 2014, 5:209. [CrossRef]
  29. Frioux C, Singh D, Korcsmaros T, Hildebrand F: From bag-of-genes to bag-of-genomes: metabolic modelling of communities in the era of metagenome-assembled genomes. Comput Struct Biotechnol J 2020, 18:1722-1734. [CrossRef]
  30. Carrasco M, Rozas JM, Barahona S, Alcaíno J, Cifuentes V, Baeza M: Diversity and extracellular enzymatic activities of yeasts isolated from King George Island, the sub-Antarctic region. BMC Microbiol 2012, 12:251. [CrossRef]
  31. Barahona S, Yuivar Y, Socias G, Alcaíno J, Cifuentes V, Baeza M: Identification and characterization of yeasts isolated from sedimentary rocks of Union Glacier at the Antarctica. Extremophiles 2016, 20:479-491. [CrossRef]
  32. Troncoso E, Barahona S, Carrasco M, Villarreal P, Alcaíno J, Cifuentes V, Baeza M: Identification and characterization of yeasts isolated from the South Shetland Islands and the Antarctic Peninsula. Polar Biol 2017, 40:649-658. [CrossRef]
  33. Baeza M, Barahona S, Alcaíno J, Cifuentes V: Amplicon-Metagenomic Analysis of Fungi from Antarctic Terrestrial Habitats. Front Microbiol 2017, 8:35. [CrossRef]
  34. FastQC: A quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/.
  35. fastqc (accessed XXX). [http://www.bioinformatics.babraham.ac.uk/projects/fastqc].
  36. BBTools. http://jgi.doe.gov/data-and-tools/bbtools/. [http://jgi.doe.gov/data-and-tools/bbtools/.
  37. Li D, Luo R, Liu CM, Leung CM, Ting HF, Sadakane K, Yamashita H, Lam TW: MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods 2016, 102:3-11. [CrossRef]
  38. Li D, Liu CM, Luo R, Sadakane K, Lam TW: MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 2015, 31:1674-1676. [CrossRef]
  39. Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W: Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 2011, 27:578-579. [CrossRef]
  40. Bradnam KR, Fass JN, Alexandrov A, Baranay P, Bechner M, Birol I, Boisvert S, Chapman JA, Chapuis G, Chikhi R, Chitsaz H, Chou WC, Corbeil J, Del Fabbro C, Docking TR, Durbin R, Earl D, Emrich S, Fedotov P, Fonseca NA, Ganapathy G, Gibbs RA, Gnerre S, Godzaridis E, Goldstein S, Haimel M, Hall G, Haussler D, Hiatt JB, Ho IY, Howard J, Hunt M, Jackman SD, Jaffe DB, Jarvis ED, Jiang H, Kazakov S, Kersey PJ, Kitzman JO, Knight JR, Koren S, Lam TW, Lavenier D, Laviolette F, Li Y, Li Z, Liu B, Liu Y, Luo R, Maccallum I, Macmanes MD, Maillet N, Melnikov S, Naquin D, Ning Z, Otto TD, Paten B, Paulo OS, Phillippy AM, Pina-Martins F, Place M, Przybylski D, Qin X, Qu C, Ribeiro FJ, Richards S, Rokhsar DS, Ruby JG, Scalabrin S, Schatz MC, Schwartz DC, Sergushichev A, Sharpe T, Shaw TI, Shendure J, Shi Y, Simpson JT, Song H, Tsarev F, Vezzi F, Vicedomini R, Vieira BM, Wang J, Worley KC, Yin S, Yiu SM, Yuan J, Zhang G, Zhang H, Zhou S, Korf IF: Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. Gigascience 2013, 2:10. [CrossRef]
  41. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 GPDPS: The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25:2078-2079. [CrossRef]
  42. Wood DE, Salzberg SL: Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 2014, 15:R46. [CrossRef]
  43. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215:403-410. [CrossRef]
  44. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL: BLAST+: architecture and applications. BMC Bioinformatics 2009, 10:421. [CrossRef]
  45. Federhen S: The NCBI Taxonomy database. Nucleic Acids Res 2012, 40:D136-D143. [CrossRef]
  46. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A: Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28:1647-1649. [CrossRef]
  47. Delcher AL, Bratke KA, Powers EC, Salzberg SL: Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 2007, 23:673-679. [CrossRef]
  48. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL: Improved microbial gene identification with GLIMMER. Nucleic Acids Res 1999, 27:4636-4641.
  49. Stanke M, Tzvetkova A, Morgenstern B: AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome. Genome Biol 2006, 7 Suppl 1:S11.1-8. [CrossRef]
  50. Stanke M, Morgenstern B: AUGUSTUS: a web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic Acids Res 2005, 33:W465-7. [CrossRef]
  51. Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M: KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res 2007, 35:W182-5. [CrossRef]
  52. Mesbah NM: Industrial Biotechnology Based on Enzymes From Extreme Environments. Front Bioeng Biotechnol 2022, 10:870083. [CrossRef]
  53. Polaina J, MacCabe AP: Dordrecht: Springer; 2007.
  54. Bowie JU, Lüthy R, Eisenberg D: A method to identify protein sequences that fold into a known three-dimensional structure. Science 1991, 253:164-170.
  55. Lüthy R, Bowie JU, Eisenberg D: Assessment of protein models with three-dimensional profiles. Nature 1992, 356:83-85.
  56. Collins T, Feller G: Psychrophilic enzymes: strategies for cold-adaptation. Essays Biochem 2023, 67:701-713. [CrossRef]
  57. Liu Y, Jia K, Chen H, Wang Z, Zhao W, Zhu L: Cold-adapted enzymes: mechanisms, engineering and biotechnological application. Bioprocess Biosyst Eng 2023, 46:1399-1410. [CrossRef]
  58. Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE: UCSF Chimera--a visualization system for exploratory research and analysis. J Comput Chem 2004, 25:1605-1612. [CrossRef]
  59. Goddard TD, Huang CC, Meng EC, Pettersen EF, Couch GS, Morris JH, Ferrin TE: UCSF ChimeraX: Meeting modern challenges in visualization and analysis. Protein Sci 2018, 27:14-25. [CrossRef]
  60. Vander Meersche Y, Cretin G, de Brevern AG, Gelly JC, Galochkina T: MEDUSA: Prediction of Protein Flexibility from Sequence. J Mol Biol 2021, 433:166882. [CrossRef]
  61. McKinney W: Data structures for statistical computing in Python. 2010, SciPy 445:51-56.
  62. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, van der Walt SJ, Brett M, Wilson J, Millman KJ, Mayorov N, Nelson ARJ, Jones E, Kern R, Larson E, Carey CJ, Polat İ, Feng Y, Moore EW, VanderPlas J, Laxalde D, Perktold J, Cimrman R, Henriksen I, Quintero EA, Harris CR, Archibald AM, Ribeiro AH, Pedregosa F, van Mulbregt P, SciPy C: SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods 2020, 17:261-272. [CrossRef]
  63. Hunter JD: Matplotlib: A 2D graphics environment. Computing in science & engineering 2007, 9:90-95.
  64. Michel V, Gramfort A, Varoquaux G, Eger E, Keribin C, Thirion B: A supervised clustering approach for fMRI-based inference of brain states. Pattern Recognition 2012, 45:2041-2049. [CrossRef]
  65. Ramírez-Fernández L, Orellana LH, Johnston ER, Konstantinidis KT, Orlando J: Diversity of microbial communities and genes involved in nitrous oxide emissions in Antarctic soils impacted by marine animals as revealed by metagenomics and 100 metagenome-assembled genomes. Sci Total Environ 2021, 788:147693. [CrossRef]
  66. Pushkareva E, Elster J, Becker B: Metagenomic Analysis of Antarctic Biocrusts Unveils a Rich Range of Cold-Shock Proteins. Microorganisms 2023, 11:1932. [CrossRef]
  67. Oh HN, Park D, Seong HJ, Kim D, Sul WJ: Antarctic tundra soil metagenome as useful natural resources of cold-active lignocelluolytic enzymes. J Microbiol 2019, 57:865-873. [CrossRef]
  68. Varliero G, Lebre PH, Adams B, Chown SL, Convey P, Dennis PG, Fan D, Ferrari B, Frey B, Hogg ID, Hopkins DW, Kong W, Makhalanyane T, Matcher G, Newsham KK, Stevens MI, Weigh KV, Cowan DA: Biogeographic survey of soil bacterial communities across Antarctica. Microbiome 2024, 12:9. [CrossRef]
  69. Becker B, Pushkareva E: Metagenomics Provides a Deeper Assessment of the Diversity of Bacterial Communities in Polar Soils Than Metabarcoding. Genes (Basel) 2023, 14:812. [CrossRef]
  70. Štefanac T, Grgas D, Landeka Dragičević T: Xenobiotics-Division and Methods of Detection: A Review. J Xenobiot 2021, 11:130-141. [CrossRef]
  71. Berlin, Heidelberg: Springer Berlin Heidelberg; 2012.
  72. Bhardwaj L, Chauhan A, Ranjan A, Jindal T: Persistent Organic Pollutants in Biotic and Abiotic Components of Antarctic Pristine Environment. Earth Systems and Environment 2018, 2:35-54. [CrossRef]
  73. Kallenborn R, Reiersen L-O, Olseng CD: Long-term atmospheric monitoring of persistent organic pollutants (POPs) in the Arctic: a versatile tool for regulators and environmental science studies. Atmospheric Pollution Research 2012, 3:485-493.
  74. Anzano J, Abás E, Marina-Montes C, del Valle J, Galán-Madruga D, Laguna M, Cabredo S, Pérez-Arribas L-V, Cáceres J, Anwar J: A Review of Atmospheric Aerosols in Antarctica: From Characterization to Data Processing. Atmosphere 2022, 13:1621. [CrossRef]
  75. Berlemont R, Pipers D, Delsaute M, Angiono F, Feller G, Galleni M, Power P: Exploring the Antarctic soil metagenome as a source of novel cold-adapted enzymes and genetic mobile elements. Rev Argent Microbiol 2011, 43:94-103.
  76. Staerck C, Gastebois A, Vandeputte P, Calenda A, Larcher G, Gillmann L, Papon N, Bouchara JP, Fleury MJJ: Microbial antioxidant defense enzymes. Microb Pathog 2017, 110:56-65. [CrossRef]
  77. Baker A, Lin CC, Lett C, Karpinska B, Wright MH, Foyer CH: Catalase: A critical node in the regulation of cell fate. Free Radic Biol Med 2023, 199:56-66. [CrossRef]
  78. Koleva Z, Abrashev R, Angelova M, Stoyancheva G, Spassova B, Yovchevska L, Dishliyska V, Miteva-Staleva J, Krumova E: A Novel Extracellular Catalase Produced by the Antarctic Filamentous Fungus Penicillium rubens III11-2. Fermentation 2024, 10:58. [CrossRef]
  79. Krumova E, Abrashev R, Dishliyska V, Stoyancheva G, Kostadinova N, Miteva-Staleva J, Spasova B, Angelova M: Cold-active catalase from the psychrotolerant fungus Penicillium griseofulvum. J Basic Microbiol 2021, 61:782-794. [CrossRef]
  80. Kaushal J, Mehandia S, Singh G, Raina A, Arya SK: Catalase enzyme: Application in bioremediation and food industry. Biocatalysis and Agricultural Biotechnology 2018, 16:192-199. [CrossRef]
  81. Iordachescu M, Imai R: Trehalose biosynthesis in response to abiotic stresses. J Integr Plant Biol 2008, 50:1223-1229. [CrossRef]
  82. Cai X, Seitl I, Mu W, Zhang T, Stressler T, Fischer L, Jiang B: Biotechnical production of trehalose through the trehalose synthase pathway: current status and future prospects. Appl Microbiol Biotechnol 2018, 102:2965-2976. [CrossRef]
  83. Ban X, Dhoble AS, Li C, Gu Z, Hong Y, Cheng L, Holler TP, Kaustubh B, Li Z: Bacterial 1,4-α-glucan branching enzymes: characteristics, preparation and commercial applications. Crit Rev Biotechnol 2020, 40:380-396. [CrossRef]
Figure 1. Distribution and abundance of different bacterial taxa at Antarctic sites. The percentage of each taxon was calculated at each site (circle size) and relative to all sites (circle color), at the level of phyla (A) and families (B, only the 35 most abundant taxa are shown). The calculated Shannon index (H) and evenness (E) are shown in the bar plots. Hierarchical clustering of sites according to the Bray-Curtis dissimilarity values calculated for bacterial phyla (C), bacterial family (D), and distance in kilometers between sites (E).
Figure 1. Distribution and abundance of different bacterial taxa at Antarctic sites. The percentage of each taxon was calculated at each site (circle size) and relative to all sites (circle color), at the level of phyla (A) and families (B, only the 35 most abundant taxa are shown). The calculated Shannon index (H) and evenness (E) are shown in the bar plots. Hierarchical clustering of sites according to the Bray-Curtis dissimilarity values calculated for bacterial phyla (C), bacterial family (D), and distance in kilometers between sites (E).
Preprints 112529 g001
Figure 2. Metabolic characterization of the Antarctic sites. The percentage of each pathway or bacterial genus was calculated at each site (circle size) and relative to all sites (circle color). A) Intermediate level pathways; B) Specific level pathways (only the 30 most abundant are shown); C) Xenobiotic biodegradation and metabolism pathways. The calculated Shannon index (H) and evenness (E) are shown in the bar plots. Hierarchical clustering of sites according to the Bray-Curtis dissimilarity values calculated for Intermediate level pathways (D), Specific level pathways (E), Xenobiotic biodegradation and metabolism pathways (F).
Figure 2. Metabolic characterization of the Antarctic sites. The percentage of each pathway or bacterial genus was calculated at each site (circle size) and relative to all sites (circle color). A) Intermediate level pathways; B) Specific level pathways (only the 30 most abundant are shown); C) Xenobiotic biodegradation and metabolism pathways. The calculated Shannon index (H) and evenness (E) are shown in the bar plots. Hierarchical clustering of sites according to the Bray-Curtis dissimilarity values calculated for Intermediate level pathways (D), Specific level pathways (E), Xenobiotic biodegradation and metabolism pathways (F).
Preprints 112529 g002
Figure 3. Enzymes with potential industrial applications and structural comparisons. A) Percentage of CDS potentially encoding each enzyme at each site (circle size) and relative to all sites (circle color). B) Distribution of calculated values for structural parameters from translated and modeled putative enzymes: alpha helix (Helix), beta strand (Beta), unstructured (O), residues classified as rigid (Med0), residues classified as flexible (Med1), residues classified as very flexible (Med2), total solvent-accessible surface area (TotSAS), apolar solvent-accessible surface area (ApoSAS), total hydrogen bonds (TotHbond), and salt bridges (Salt). C) Distribution of amino acid percentages (one-letter code). Ant: putative enzymes found in metagenomes in this work and Mes: mesophilic enzyme orthologs. Significant differences between Ant and Mes were found in the parameters indicated with *.
Figure 3. Enzymes with potential industrial applications and structural comparisons. A) Percentage of CDS potentially encoding each enzyme at each site (circle size) and relative to all sites (circle color). B) Distribution of calculated values for structural parameters from translated and modeled putative enzymes: alpha helix (Helix), beta strand (Beta), unstructured (O), residues classified as rigid (Med0), residues classified as flexible (Med1), residues classified as very flexible (Med2), total solvent-accessible surface area (TotSAS), apolar solvent-accessible surface area (ApoSAS), total hydrogen bonds (TotHbond), and salt bridges (Salt). C) Distribution of amino acid percentages (one-letter code). Ant: putative enzymes found in metagenomes in this work and Mes: mesophilic enzyme orthologs. Significant differences between Ant and Mes were found in the parameters indicated with *.
Preprints 112529 g003
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated