Submitted:
30 April 2024
Posted:
01 May 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- How do overtime, economic, CO2 related industries, educational levels and population dynamics interacted to influence the short-term trend change in CO2 emissions across diverse countries having diverse characteristics with respect of the factors mentioned?
- Which insight in term of identification and quantification of the temporal dynamics and influence of these factors can machine learning techniques highlight to deepen one’s understanding on this change overtime on a global scale?
2. Materials and Methods
2.1. Data Collection
2.2. Data Preparation
2.3. Machine Learning Algorithms
2.4. Metrics
- ○
- For the selection of the best performing algorithm:
- ●
- ▪
- ○
- For final evaluation:
- ●
- Regression: Mean squared error, Residuals, R-squared
- ▪
- Classification: Accuracy score, Mathhew correlation coefficient, Confusion matrix and classification report.
2.5. Explainable Machine Learning Techniques
2.5.1. Partial Dependence Plots (PDPs):
2.5.2. Sensitivity Analysis
2.5.3. Feature Analysis
2.6. Research Design
3. Results
3.1. Regression Analysis



3.2. Classification Analysis
3.2.1. Grouping Target in Classes
| Confusion matrix | Precision | Recall | F1-score | Support | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| High negative | 12 | 0 | 0 | 0 | 0 | 0 | 0.71 | 1.00 | 0.83 | 12 |
| High positive | 0 | 27 | 0 | 0 | 0 | 0 | 1.00 | 1.00 | 1.00 | 27 |
| Low negative | 0 | 0 | 15 | 0 | 1 | 0 | 1.00 | 0.94 | 0.97 | 16 |
| Low positive | 0 | 0 | 0 | 23 | 0 | 0 | 0.96 | 1.00 | 0.98 | 23 |
| Moderate negative | 5 | 0 | 0 | 0 | 28 | 0 | 0.97 | 0.85 | 0.90 | 33 |
| Moderate positive | 0 | 0 | 0 | 1 | 0 | 18 | 1.00 | 0.95 | 0.97 | 19 |
| High negative | High positive | Low negative | Low positive | Moderate negative | Moderate positive | |||||
3.3. Feature Analysis
3.3.1. Feature Importance Analysis


3.3.2. Sensitivity Analysis
3.3.3. Correlation Analysis

3.4. Discussion
| Features | High negative | High positive | Moderate negative | Moderate positive | Low negative | Low positive |
| Population (number) | 4.81E+08 | 8.07E+08 | 7.18E+07 | 7.91E+07 | 6.74E+07 | 6.74E+07 |
| CO2 from coal | 1.45E+09 | 1.76E+09 | 1.15E+08 | 1.24E+08 | 1.04E+08 | 8.21E+07 |
| Mean-3y | -7.31E+07 | 1.07E+08 | -1.07E+07 | 1.06E+07 | -3.02E+06 | 4.76E+06 |
| GDP | 9.99E+12 | 4.27E+12 | 1.96E+12 | 9.79E+11 | 1.72E+11 | 1.38E+11 |
| CO2 from oil | 1.58E+09 | 8.85E+08 | 2.12E+08 | 1.71E+08 | 2.43E+07 | 2.31E+07 |
| Population-Education: Incomplete Primary | 2.04E+07 | 4.32E+07 | 2.10E+06 | 5.82E+06 | 3.90E+06 | 3.94E+06 |
| Contribution to temperature rise | 0.173045 | 0.098758 | 0.029083 | 0.022507 | 0.010212 | 0.008529 |
- a.
- High Negative: Countries in this group have a high average population and GDP, a high CO2 emission from both coal and oil. Despite their high GDP and CO2 emissions, these countries have seen a decrease in CO2 emissions over time. However, they also have a high contribution to temperature rise.
- b.
- High Positive: Countries in this group also have a high average population, a slightly lower GDP compared to the High Negative group and they have seen an increase in CO2 emissions over time. Surprisingly, they have a lower contribution to temperature rise compared to the High Negative group. This could be due to their past contribution.
- c.
- Low Negative: Countries in this group have a lower average population and GDP compared to the High groups. They have lower CO2 emissions from both coal and oil and have seen a decrease in CO2 emissions over time. Finally, they contribute less to temperature rise compared to the High groups.
- d.
- Low Positive: Countries in this group have a similar average population to the Low Negative group. They have a lower GDP compared to the Low Negative group but they have seen an increase in CO2 emissions over time. They also contribute less to temperature rise compared to the High groups.
- e.
- Moderate Negative: Countries in this group have a moderate average population and GDP. They have moderate CO2 emissions from both coal and oil, and they have seen a decrease in CO2 emissions over time. They contributed moderately to temperature rise compared to the High groups.
- f.
- Moderate Positive: Countries in this group have a similar average population to the Moderate Negative group. They have a lower GDP compared to the Moderate Negative group and they have seen an increase in CO2 emissions over time. They have a lower contribution to temperature rise compared to the Moderate Negative group.
- ●
- A comparison of the two categories, High negative and High positive, suggests that:
- ●
- Population (number): The average population is higher in the High positive group (approximately 807 million) compared to the High negative group (approximately 481 million). This suggests that countries with larger populations tend to have increasing CO2 emissions.
- ●
- CO2 emissions from Coal: Both groups have high CO2 emissions from coal, but the High positive group has slightly higher emissions on average (approximately 1.76 billion tonnes) compared to the High negative group (approximately 1.45 billion tonnes).
- ●
- CO2 emissions from Oil: The High negative group has higher CO2 emissions from oil (approximately 1.58 billion tonnes) compared to the High positive group (approximately 885 million tonnes).
- ●
- 3-Year Mean Change in CO2 emissions (Mean-3y): The High negative group shows a decrease in CO2 emissions over time (average change of -73 million tonnes), while the High positive group shows an increase (average change of 107 million tonnes).
- ●
- GDP: is higher on average in the High negative group (approximately 9.99 trillion USD) compared to the High positive group (approximately 4.27 trillion USD). This suggests that wealthier countries tend to have decreasing CO2 emissions.
- ●
- Population with Incomplete Primary Education: The High positive group has a higher average population with incomplete primary education (approximately 43.2 million) compared to the High negative group (approximately 20.4 million).
- ●
- Contribution to Temperature Rise: The High negative group has a higher average contribution to temperature rise (0.173) compared to the High positive group (0.099).
- Population: The average population is approximately the same in both groups (approximately 67 million). This suggests that population size does not significantly differentiate these two groups.
- CO2 emissions from Coal: The Low negative group has slightly higher CO2 emissions from coal on average (approximately 104 million tonnes) compared to the Low positive group (approximately 82 million tonnes).
- 3-Year Mean Change in CO2 emissions (Mean-3y): The Low negative group shows a decrease in CO2 emissions over time (average change of -3 million tonnes), while the Low positive group shows an increase (average change of 4.8 million tonnes).
- GDP: The GDP is slightly higher on average in the Low negative group (approximately 171 billion USD) compared to the Low positive group (approximately 138 billion USD).
- CO2 emissions from Oil: The Low negative group has slightly higher CO2 emissions from oil (approximately 24 million tonnes) compared to the Low positive group (approximately 23 million tonnes).
- Population with Incomplete Primary Education: The Low positive group has a slightly higher average population with incomplete primary education (approximately 3.94 million) compared to the Low negative group (approximately 3.90 million).
- Contribution to Temperature Rise: The Low negative group has a slightly higher average contribution to temperature rise (0.0102) compared to the Low positive group (0.0085).
- Population: The average population is slightly higher in the Moderate positive group (approximately 79 million) compared to the Moderate negative group (approximately 72 million).
- CO2 emissions from Coal: The Moderate positive group has slightly higher CO2 emissions from coal on average (approximately 124 million tonnes) compared to the Moderate negative group (approximately 115 million tonnes).
- 3-Year Mean Change in CO2 emissions (Mean-3y): The Moderate negative group shows a decrease in CO2 emissions over time (average change of -10.7 million tonnes), while the Moderate positive group shows an increase (average change of 10.6 million tonnes).
- GDP: The GDP is significantly higher on average in the Moderate negative group (approximately 1.96 trillion USD) compared to the Moderate positive group (approximately 979 billion USD).
- CO2 Emissions from Oil: The Moderate negative group has higher CO2 emissions from oil (approximately 212 million tonnes) compared to the Moderate positive group (approximately 171 million tonnes).
- Population with Incomplete Primary Education: The Moderate positive group has a higher average population with incomplete primary education (approximately 5.82 million) compared to the Moderate negative group (approximately 2.10 million).
- Contribution to Temperature Rise: The Moderate negative group has a higher average contribution to temperature rise (0.029) compared to the Moderate positive group (0.0225).
3.5. Policy Implication
- Invest in early childhood education: The analysis suggests a link between early education and economic growth, potentially leading to higher CO2 emissions later. By investing in early education, countries might be able to foster more sustainable development practices alongside economic growth.
- Education focused on environmental awareness: Curriculum reform that emphasizes environmental issues and sustainability could encourage responsible behavior and potentially lower future emissions.
- Family planning and economic incentives: The analysis suggests a complex relationship between population growth and economy. When a country reaches a certain level of population and pollution, policies that encourage smaller families, coupled with economic incentives, could help manage population growth while maintaining economic stability.
- Focus beyond just CO2 emissions: Since the contribution to temperature rise might not solely depend on CO2 emissions, a broader approach to emissions monitoring and mitigation strategies might be necessary.
- Tailored policies for different countries: The analysis suggests population growth affects economies differently. Policymakers might consider more targeted approaches to address CO2 emissions based on each country's specific circumstances.
- Long-term vs. Short-term: The analysis highlights short-term variations in CO2 emissions. Policies should consider both short-term and long-term strategies for sustainable development.
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. Features Description and Rationale
| N° | Variables | Source | Unit | Description | Rationale | Source | Period considered |
| 1 | Year-on-year change in CO2 emissions | [42] | Tonnes | Absolute annual change in carbon dioxide emissions | Target of the analysis | Global Carbon Budget, 2023) | 1960 to 2022 |
| 2 | Annual Greenhouse gas emissions by world region | [43] | Tonnes | Emissions, cumulative emissions and the global mean surface temperature response by country, gas (CO2, Ch4, N2 O or GHG) and source emissions (fossil, land use) | Regional greenhouse gas emission will certainly affect the neighboring countries level of CO2 emissions. | [44] | 1960 to 2021 |
| 3 | Annual temperature anomalies | [45] | Celsius | The deviation of a specific month’s average surface temperature | Fluctuations of the temperature can be informative about the change in CO2 emissions | [46] | 1960 to 2023 |
| 4 | Annual emissions of carbon dioxide (CO₂) from flaring | [47] | Tonnes | Annual emissions of carbon dioxide (CO₂) from flaring based on territorial emissions (excluding traded goods and international aviation) | The amount of excess of oil or gas burned during their production can explain changes in CO2 emissions | [48] | 1960 to 2022 |
| 5 | Annual emissions of carbon dioxide (CO₂) from cement | [47] | Tonnes | Annual emissions of carbon dioxide (CO₂) from cements based on territorial emissions (excluding traded goods and international aviation) | The production of concrete is an important source of CO2 emissions. | [49] | 1960 to 2022 |
| 6 | Annual emissions of carbon dioxide (CO₂) from gas | [47] | Tonnes | Annual emissions of carbon dioxide (CO₂) from gas based on territorial emissions (excluding traded goods and international aviation) | The production of gas releases a significant amount of CO2 | [8,48] | 1960 to 2022 |
| 7 | Annual emissions of carbon dioxide (CO₂) from oil | [47] | Tonnes | Annual emissions of carbon dioxide (CO₂) from oil based on territorial emissions (excluding traded goods and international aviation) | The production of oil is directly linked to CO2 emissions, thus, affecting its change. | [8,48] | 1960 to 2022 |
| 8 | Annual emissions of carbon dioxide (CO₂) from coal | [47] | Tonnes | Annual emissions of carbon dioxide (CO₂) from coal based on territorial emissions (excluding traded goods and international aviation) | The production of coal represents a major source of CO2 emissions. | [8] | 1960 to 2022 |
| 9 | Cumulative CO2 emissions | [50] | Tonnes | Sum of CO2 emissions produced from fossil fuels and industry | The total amount of CO2 emissions accumulated during a period can significantly affect the change of CO2 emissions on a yearly based period. | [51] | 1960 to 2022 |
| 10 | Annual CO2 emissions growth | [50] | Percentage | Annual percentage growth of total emissions of CO2 excluding land use usage | CO2 emissions growth is an important indicator to explain changes in CO2 emissions | [8] | 1960 to 2022 |
| 11 | Share of Cumulative CO2 emissions | [52] | Tonnes | Cumulative CO2 emissions measured as a percentage of global total cumulative emissions of CO2 | Understanding which country contributes the most is an important indicator of change in CO2 emissions. | [53] | 1960 to 2022 |
| 12 | Contribution to the global mean surface temperature rise | [54] | Celsius | Each country’s contribution to global surface mean temperature rise from cumulative CO2, Ch4, N2 O | This factor can indirectly explain variations of CO2 emissions | [55] | 1960 to 2021 |
| 13 | Population growth rate | [56] | Percentage | Average exponential growth of the population over a given period | The increased concentration of population generally results in many activities like urbanization, deforestation, etc., which have the potential to influence the level of CO2 emissions | [8,57] | 1960 to 2021 |
| 14 | Population (number) | [58] | Number | Population by country | Idem | [8,57] | 1960 to 2022 |
| 15 | Population with no education | [59] | Number | Educational attainment | Education plays a significant role in reducing the vulnerability of a society, and can increase awareness to pollution | [60] | 1960 to 2022 |
| 16 | Population with primary education | [59] | Number | Educational attainment | Idem | Idem | 1960 to 2022 |
| 17 | Population with incomplete primary education | [59] | Number | Educational attainment | Idem | Idem | 1960 to 2022 |
| 18 | Population with Secondary education | [59] | Number | Educational attainment | Idem | Idem | 1960 to 2022 |
| 19 | Population with upper secondary education | [58] | Number | Educational attainment | Idem | Idem | 1960 to 2022 |
| 20 | Population with lower secondary education | [59] | Number | Educational attainment | Idem | Idem | 1960 to 2022 |
| 21 | Population under 15 | [59] | Number | Educational attainment | Idem | Idem | 1960 to 2022 |
| 22 | Global Domestic Product (GDP) | [61] | US dollar | Sum of gross value added by all resident producers in the economy plus any product taxes and minus any subsidies not included in the value of the products | There is a certain correlation between the prosperity of country and its level of CO2 emissions. | 1960 to 2022 | |
| 23 | Global Domestic Product per capita | [62] | US dollar | GDP divided by midyear population | Idem | [63] [12] | 1960 to 2022 |
| 24 | Change in GDP | [64] | Percentage | Annual percentage growth of GDP at market prices based on constant local currency. | Idem | [63] [12] | 1960 to 2022 |
Appendix B. Statistic Description of the Dataset
| Annual CO2 emissions | GHG emissions by world region | Temperature anomaly | CO2 from flaring | CO2 from cement | CO2 from gas | CO2 from oil | CO2 from coal | Cumulative CO2 emissions | |
| count | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 |
| mean | 1180717000.00 | 1804513000.00 | 0.29 | 7744875.00 | 39100780.00 | 155176800.00 | 377323700.00 | 590266200.00 | 49326940000.00 |
| std | 2067422000.00 | 2450693000.00 | 0.34 | 13920800.00 | 120124900.00 | 346134300.00 | 645444900.00 | 1255671000.00 | 86020780000.00 |
| min | 1647474.00 | 51732420.00 | -0.31 | 0.00 | 50771.00 | 0.00 | 688832.00 | 0.00 | 33411650.00 |
| 25% | 119760600.00 | 428377400.00 | -0.02 | 0.00 | 3499390.00 | 493724.80 | 34982420.00 | 21795300.00 | 2533510000.00 |
| 50% | 394471300.00 | 669718000.00 | 0.25 | 2121456.00 | 8233956.00 | 19968290.00 | 165657100.00 | 135809200.00 | 13698860000.00 |
| 75% | 634479800.00 | 2143402000.00 | 0.58 | 6483802.00 | 24980980.00 | 90109130.00 | 268966300.00 | 416902800.00 | 51740210000.00 |
| max | 11396780000.00 | 13710640000.00 | 0.93 | 88436970.00 | 858232600.00 | 1743539000.00 | 2642692000.00 | 8250736000.00 | 426914600000.00 |
| Share of cumulative CO2 emissions | Contribution to temperature rise | Population - Education: Post Secondary | Population - Education: Upper Secondary | Population - Education: Lower Secondary | Population - Education: Primary | Population - Education: Incomplete Primary | Population - Education: No Education | Population - Education: Under 15 | Population (number) | |
| count | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 |
| mean | 5.33 | 0.05 | 16722340.00 | 36436890.00 | 43712180.00 | 32234490.00 | 14956620.00 | 42991600.00 | 82316990.00 | 277365400.00 |
| std | 9.27 | 0.06 | 27933400.00 | 53308720.00 | 97421520.00 | 52721110.00 | 22983330.00 | 79366320.00 | 115380700.00 | 395267600.00 |
| min | 0.01 | 0.00 | 36100.00 | 68500.00 | 430700.00 | 0.00 | 0.00 | 0.00 | 6612300.00 | 15276560.00 |
| 25% | 0.27 | 0.01 | 1326700.00 | 4598100.00 | 5399125.00 | 3810300.00 | 444000.00 | 2542500.00 | 11825100.00 | 50089850.00 |
| 50% | 1.16 | 0.02 | 5783600.00 | 13641200.00 | 10568000.00 | 10126800.00 | 3799500.00 | 5732150.00 | 25066800.00 | 66412130.00 |
| 75% | 4.75 | 0.05 | 15051400.00 | 39150200.00 | 28268300.00 | 27610400.00 | 19467000.00 | 30892300.00 | 61191000.00 | 250691100.00 |
| max | 38.78 | 0.28 | 154720400.00 | 250631200.00 | 537276300.00 | 200622500.00 | 82623900.00 | 292338700.00 | 380274300.00 | 1425894000.00 |
| Population growth rate | GDP | GDP per capita | Change in gdp | Annual CO2 emission growth (%) | Absolute CO2 change | |
| count | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 | 630.00 |
| mean | 1.57 | 2098799000000.00 | 12938.54 | 4.02 | 3.43 | 26837800.00 |
| std | 0.98 | 3839755000000.00 | 15577.38 | 4.81 | 10.74 | 103135300.00 |
| min | -0.39 | -71767060000.00 | -6173.54 | -27.27 | -48.33 | -547516900.00 |
| 25% | 0.68 | 166345200000.00 | 1367.95 | 1.74 | -1.09 | -961013.50 |
| 50% | 1.37 | 693487400000.00 | 5221.58 | 3.85 | 3.07 | 5909344.00 |
| 75% | 2.45 | 1841303000000.00 | 23704.80 | 6.73 | 6.84 | 24604410.00 |
| max | 5.92 | 20529460000000.00 | 83951.61 | 25.01 | 82.62 | 911781900.00 |
Appendix C. Regression Results Summary
| Models | Residuals | Mean Squared Error | R-squared | Mean Cross validation score | ||||||||
| Baseline | PCA | FS | Baseline | PCA | FS | Baseline | PCA | FS | Baseline | PCA | FS | |
| Linear regression | -59327819.08 | - | - | 3.103195e+20 | - | - | -1.144 | - | - | -253.80 | - | - |
| Ridge Regression | -40263611.43 | - | - | 1.699994e+20 | - | - | -0.17 | - | - | -29.06 | - | - |
| Random Forest Regressor | -24435046.47 | - | - | 59685956e+8 | - | - | 0.58 | - | - | -0.89 | - | - |
| Bagging Regressor | -22162025.93 | - | - | 59739629e+8 | - | - | 0.58 | - | - | -40.29 | - | - |
| Gradient Boosting Regressor | -14687525.64 | -20216617.8 | -18239234.8 | 50060367e+8 | 609461e+6 | 4886957e+5 | 0.65 | 0.57 | 0.66 | -1.88 | -0.71 | -0.71 |
| XGBoost Regressor | -28289899.56 | - | - | 79866357e+8 | - | - | 0.44 | - | - | -53.54 | - | - |
| KNeighbors Regressor | -34832710.33 | - | - | 1.110808e+20 | - | - | 0.23 | - | - | -0.32 | - | - |
| Adaboost Regressor | -23440275.88 | - | - | 64210504e+8 | - | - | 0.55 | - | - | -1.24 | - | - |
Appendix C1: Grouping by Volatility by Targets: High Volatility
| Models | Residuals | Mean Squared Error | R-squared | Mean Cross validation score |
| Linear regression | -59327819.08 | 3.103195e+20 | -1.144 | -253.80 |
| Ridge Regression | -40263611.43 | 1.699994e+20 | -0.17 | -29.18 |
| Random Forest Regressor | -28879206.47 | 7216968e+8 | 0.50 | -0.85 |
| Bagging Regressor | -28411174.11 | 6158571e+8 | 0.57 | -3.34 |
| Gradient Boosting Regressor | -17772056.50 | 4940930e+8 | 0.66 | -1.55 |
| XGBoost Regressor | -28289899.56 | 7986635e+8 | 0.44 | -0.94 |
| KNeighbors Regressor | -29801474.76 | 1.1108049e+20 | 0.23 | -0.32 |
| Adaboost Regressor | -7268139.56 | 64677311e+8 | 0.53 | -0.60 |
Appendix C2: Grouping by Volatility by Targets: Low Volatility
| Models | Residuals | Mean Squared Error | R-squared | Mean Cross validation score |
| Linear regression | -59327819.08 | 3.103195e+20 | -1.14 | -253.80 |
| Ridge Regression | -40263611.43 | 1.699994e+20 | -0.17 | -29.18 |
| Random Forest Regressor | -28455959.39 | 6675874e+7 | 0.54 | -0.89 |
| Bagging Regressor | -297537e+2 | 244154e+9 | 0.61 | -1.35 |
| Gradient Boosting Regressor | -17282934.1 | 491648e+8 | 0.66 | -1.99 |
| XGBoost Regressor | -28289899.56 | 7986635e+8 | 0.44 | -0.94 |
| KNeighbors Regressor | -34832710.33 | 1.1108049e+20 | 0.23 | -0.32 |
| Adaboost Regressor | -7268139.56 | 4388766e+8 | 0.70 | -0.97 |
Appendix D. Year on Year Change in co2 Emissions for the Considered Countries

References
- S. S. Patel, B. McCaul, G. Cáceres, L. E. R. Peters, R. B. Patel, and A. Clark-Ginsberg, “Delivering the promise of the Sendai Framework for Disaster Risk Reduction in fragile and conflict-affected contexts (FCAC): A case study of the NGO GOAL’s response to the Syria conflict,” Prog. Disaster Sci., vol. 10, p. 100172, Apr. 2021. [CrossRef]
- M. Garschagen, D. Doshi, J. Reith, and M. Hagenlocher, “Global patterns of disaster and climate risk—an analysis of the consistency of leading index-based assessments and their results,” Clim. Change, vol. 169, no. 1–2, p. 11, Nov. 2021. [CrossRef]
- B. J. Kim, S. Jeong, and J.-B. Chung, “Research trends in vulnerability studies from 2000 to 2019: Findings from a bibliometric analysis,” Int. J. Disaster Risk Reduct., vol. 56, p. 102141, Apr. 2021. [CrossRef]
- P. Shi et al., “Disaster Risk Science: A Geographical Perspective and a Research Framework,” Int. J. Disaster Risk Sci., vol. 11, no. 4, pp. 426–440, Aug. 2020. [CrossRef]
- L. Bloice and S. Burnett, “Barriers to knowledge sharing in third sector social care: a case study,” J. Knowl. Manag., vol. 20, no. 1, pp. 125–145, Feb. 2016. [CrossRef]
- C. M. Mukendi and H. Choi, “Temporal Analysis of World Disaster Risk: A Machine Learning Approach to Cluster Dynamics,” in 2023 14th International Conference on Information and Communication Technology Convergence (ICTC), Jeju Island, Korea, Republic of: IEEE, Oct. 2023, pp. 973–978. [CrossRef]
- Global Burden of Disease Study, “Deaths that are from all causes attributed to air pollution per 100,000 people, in both sexes aged age-standardized,” IHME, Global Burden of Disease Study, 2019. [Online]. Available: https://ourworldindata.org/air-pollution.
- S. Li, Y. W. Siu, and G. Zhao, “Driving Factors of CO2 Emissions: Further Study Based on Machine Learning,” Front. Environ. Sci., vol. 9, p. 721517, Aug. 2021. [CrossRef]
- Bruno Venditti, “Here’s how CO2 emissions have changed since 1900,” World Economic Forum, Nov. 22, 2022. [Online]. Available: https://www.weforum.org/agenda/2022/11/visualizing-changes-carbon-dioxide-emissions-since-1900/.
- G. James, D. Witten, T. Hastie, R. Tibshirani, and J. Taylor, “Linear Regression,” in An Introduction to Statistical Learning, in Springer Texts in Statistics. , Cham: Springer International Publishing, 2023, pp. 69–134. [CrossRef]
- M. Revan Özkale and H. Altuner, “Bootstrap confidence interval of ridge regression in linear regression model: A comparative study via a simulation study,” Commun. Stat. - Theory Methods, vol. 52, no. 20, pp. 7405–7441, Oct. 2023. [CrossRef]
- J. Pérez-Rodríguez, F. Fernández-Navarro, and T. Ashley, “Estimating ensemble weights for bagging regressors based on the mean–variance portfolio framework,” Expert Syst. Appl., vol. 229, p. 120462, Nov. 2023. [CrossRef]
- D. Ghunimat, A. E. Alzoubi, A. Alzboon, and S. Hanandeh, “Prediction of concrete compressive strength with GGBFS and fly ash using multilayer perceptron algorithm, random forest regression and k-nearest neighbor regression,” Asian J. Civ. Eng., vol. 24, no. 1, pp. 169–177, Jan. 2023. [CrossRef]
- J. Cai, K. Xu, Y. Zhu, F. Hu, and L. Li, “Prediction and analysis of net ecosystem carbon exchange based on gradient boosting regression and random forest,” Appl. Energy, vol. 262, p. 114566, Mar. 2020. [CrossRef]
- W. P. Zhao, J. Li, J. Zhao, D. Zhao, J. Lu, and X. Wang, “XGB Model : Research on Evaporation Duct Height Prediction Based on XGBoost Algorithm,” Radioengineering, vol. 29, no. 1, pp. 81–93, Apr. 2020. [CrossRef]
- H. Wei, “AdaBoost Regression Predicts the Ranking of College Students Using the Super Star Learning APP,” in 2023 IEEE International Conference on Electrical, Automation and Computer Engineering (ICEACE), Changchun, China: IEEE, Dec. 2023, pp. 355–362. [CrossRef]
- B. Yao, “Walmart Sales Prediction Based on Decision Tree, Random Forest, and K Neighbors Regressor,” Highlights Bus. Econ. Manag., vol. 5, pp. 330–335, Feb. 2023. [CrossRef]
- E. Y. Boateng and D. A. Abaye, “A Review of the Logistic Regression Model with Emphasis on Medical Research,” J. Data Anal. Inf. Process., vol. 07, no. 04, pp. 190–207, 2019. [CrossRef]
- B. Charbuty and A. Abdulazeez, “Classification Based on Decision Tree Algorithm for Machine Learning,” J. Appl. Sci. Technol. Trends, vol. 2, no. 01, pp. 20–28, Mar. 2021. [CrossRef]
- A. B. Shaik and S. Srinivasan, “A Brief Survey on Random Forest Ensembles in Classification Model,” in International Conference on Innovative Computing and Communications, vol. 56, S. Bhattacharyya, A. E. Hassanien, D. Gupta, A. Khanna, and I. Pan, Eds., in Lecture Notes in Networks and Systems, vol. 56. , Singapore: Springer Singapore, 2019, pp. 253–260. [CrossRef]
- G. Abdurrahman and M. Sintawati, “Implementation of xgboost for classification of parkinson’s disease,” J. Phys. Conf. Ser., vol. 1538, no. 1, p. 012024, May 2020. [CrossRef]
- A. Chandramouli, V. R. Hyma, P. S. Tanmayi, T. G. Santoshi, and B. Priyanka, “Diabetes prediction using Hybrid Bagging Classifier,” Entertain. Comput., vol. 47, p. 100593, Aug. 2023. [CrossRef]
- L. Hao and G. Huang, “An improved AdaBoost algorithm for identification of lung cancer based on electronic nose,” Heliyon, vol. 9, no. 3, p. e13633, Mar. 2023. [CrossRef]
- B. Gezici and A. K. Tarhan, “Explainable AI for Software Defect Prediction with Gradient Boosting Classifier,” in 2022 7th International Conference on Computer Science and Engineering (UBMK), Diyarbakir, Turkey: IEEE, Sep. 2022, pp. 1–6. [CrossRef]
- S. Alam, S. K. Sonbhadra, S. Agarwal, and P. Nagabhushan, “One-class support vector classifiers: A survey,” Knowl.-Based Syst., vol. 196, p. 105754, May 2020. [CrossRef]
- S. Naiem, A. E. Khedr, A. M. Idrees, and M. I. Marie, “Enhancing the Efficiency of Gaussian Naïve Bayes Machine Learning Classifier in the Detection of DDOS in Cloud Computing,” IEEE Access, vol. 11, pp. 124597–124608, 2023. [CrossRef]
- S. Raschka, “Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning,” 2018. [CrossRef]
- T. O. Hodson, T. M. Over, and S. S. Foks, “Mean Squared Error, Deconstructed,” J. Adv. Model. Earth Syst., vol. 13, no. 12, p. e2021MS002681, Dec. 2021. [CrossRef]
- Y. Ma, Z. Xie, S. Chen, F. Qiao, and Z. Li, “Real-time detection of abnormal driving behavior based on long short-term memory network and regression residuals,” Transp. Res. Part C Emerg. Technol., vol. 146, p. 103983, Jan. 2023. [CrossRef]
- D. Chicco, M. J. Warrens, and G. Jurman, “The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation,” PeerJ Comput. Sci., vol. 7, p. e623, Jul. 2021. [CrossRef]
- D. Chicco and G. Jurman, “The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,” BMC Genomics, vol. 21, no. 1, p. 6, Dec. 2020. [CrossRef]
- M. Heydarian, T. E. Doyle, and R. Samavi, “MLCM: Multi-Label Confusion Matrix,” IEEE Access, vol. 10, pp. 19083–19095, 2022. [CrossRef]
- AMAN KHARWAL, “Classification report in machine learning.” [Online]. Available: https://www.mendeley.com/catalogue/bb23c245-6fe2-37d1-a8ba-4041334de8c9/.
- christoph. M, “Interpretable machine learning: PArtial dependence plot,” christophm.github.io/. [Online]. Available: https://christophm.github.io/interpretable-ml-book/pdp.html.
- C. Molnar et al., “Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process,” in Explainable Artificial Intelligence, vol. 1901, L. Longo, Ed., in Communications in Computer and Information Science, vol. 1901. , Cham: Springer Nature Switzerland, 2023, pp. 456–479. [CrossRef]
- G. Kong, S. Hu, and Q. Yang, “Uncertainty method and sensitivity analysis for assessment of energy consumption of underground metro station,” Sustain. Cities Soc., vol. 92, p. 104504, May 2023. [CrossRef]
- B. Iooss and A. Saltelli, “Introduction to Sensitivity Analysis,” in Handbook of Uncertainty Quantification, R. Ghanem, D. Higdon, and H. Owhadi, Eds., Cham: Springer International Publishing, 2015, pp. 1–20. [CrossRef]
- “Using the Canonical Correlation Analysis Method to Study Students’ Levels in Face-to-Face and Online Education in Jordan,” Inf. Sci. Lett., vol. 12, no. 2, pp. 901–910, Feb. 2023. [CrossRef]
- Y. Yin et al., “IGRF-RFE: a hybrid feature selection method for MLP-based network intrusion detection on UNSW-NB15 dataset,” J. Big Data, vol. 10, no. 1, p. 15, Feb. 2023. [CrossRef]
- J. Zhai and F. Kong, “The Impact of Multi-Dimensional Urbanization on CO2 Emissions: Empirical Evidence from Jiangsu, China, at the County Level,” Sustainability, vol. 16, no. 7, p. 3005, Apr. 2024. [CrossRef]
- R. Ngcobo and M. C. De Wet, “The Impact of Financial Development and Economic Growth on Renewable Energy Supply in South Africa,” Sustainability, vol. 16, no. 6, p. 2533, Mar. 2024. [CrossRef]
- Global Carbon Budget, “Year-on-year change in CO₂ emissions – GCB.” 2023. [Online]. Available: https://ourworldindata.org/grapher/absolute-change-co2.
- Jones et al, “Annual greenhouse gas emissions by world region” [dataset], National contributions to climate change [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/ghg-emissions-by-world-region.
- T. Wei, J. Wu, and S. Chen, “Keeping Track of Greenhouse Gas Emission Reduction Progress and Targets in 167 Cities Worldwide,” Front. Sustain. Cities, vol. 3, p. 696381, Jul. 2021. [CrossRef]
- Copernicus Climate Change Service (2024), “‘Annual temperature anomalies’ [dataset]. Copernicus Climate Change Service, ‘ERA5 monthly averaged data on single levels from 1940 to present 2’ [original data].” 2024. [Online]. Available: https://ourworldindata.org/grapher/annual-temperature-anomalies.
- NASA’s Scientific Visualization Studio, “Global Temperature Anomalies from 1880 to 2019,” Scientific Visualization Studio. [Online]. Available: https://svs.gsfc.nasa.gov/4787#section_credits.
- Global Carbon Budget (2023), “‘Other industry – GCB’ [dataset]. Global Carbon Project, ‘Global Carbon Budget’ [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/co2-by-source.
- M. Molteni, G. Walker, D. Parmar, M. Sutton, P. Licence, and S. Woodward, “Can ‘Electric Flare Stacks’ Reduce CO 2 Emissions? A Case Study with Nonthermal Plasma,” Ind. Eng. Chem. Res., vol. 62, no. 46, pp. 19649–19657, Nov. 2023. [CrossRef]
- “Concrete needs to lose its colossal carbon footprint,” Nature, vol. 597, no. 7878, pp. 593–594, Sep. 2021. [CrossRef]
- Global Carbon Budget (2023), “‘Cumulative CO₂ emissions – GCB’ [dataset]. Global Carbon Project, ‘Global Carbon Budget’ [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/cumulative-co-emissions.
- Z. Liu, Z. Deng, S. J. Davis, C. Giron, and P. Ciais, “Monitoring global carbon emissions in 2021,” Nat. Rev. Earth Environ., vol. 3, no. 4, pp. 217–219, Mar. 2022. [CrossRef]
- Global Carbon Budget (2023), “‘Share of global cumulative CO₂ emissions – GCB’ [dataset]. Global Carbon Project, ‘Global Carbon Budget’ [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/share-of-cumulative-co2.
- N. P. Gillett, “Warming proportional to cumulative carbon emissions not explained by heat and carbon sharing mixing processes,” Nat. Commun., vol. 14, no. 1, p. 6466, Oct. 2023. [CrossRef]
- Jones et al, “‘Contribution to global mean surface temperature rise’ [dataset]. Jones et al., ‘National contributions to climate change’ [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/contribution-temp-rise-degrees.
- Hannah Ritchie, Pablo Rosado, and Max Roser, “Data Page: Global warming: Contributions to the change in global mean surface temperature,” ourworldindata.org. [Online]. Available: https://ourworldindata.org/grapher/contributions-global-temp-change.
- UN, World Population Prospects (2022), “‘Growth rate - Sex: all - Age: all - Variant: estimates’ [dataset]. UN, World Population Prospects (2022) [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/population-growth-rates.
- populationconnection.org, “The Connections Between Population and Climate Change Info Brief,” Washington, 2024. [Online]. Available: https://populationconnection.org/resources/population-and-climate/.
- Gapminder - Population v7 (2022), Gapminder - Systema Globalis (2022), HYDE (2017), and United Nations - World Population Prospects (2022), “‘Population (future projections) (future projections)’ [dataset]. Gapminder, ‘Population v7’; Gapminder, ‘Systema Globalis’; PBL Netherlands Environmental Assessment Agency, ‘HYDE 3.2’; United Nations, ‘World Population Prospects’ [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/population-long-run-with-projections.
- Wittgenstein Centre, “‘No Education’ [dataset]. Wittgenstein Centre (2018) [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/world-population-level-education.
- M. M. Tang, D. Xu, and Q. Lan, “How does education affect urban carbon emission efficiency under the strategy of scientific and technological innovation?,” Front. Environ. Sci., vol. 11, p. 1137570, Apr. 2023. [CrossRef]
- World Bank (2023), “GDP (constant 2015 US$).” 2023. [Online]. Available: https://data.worldbank.org/indicator/NY.GDP.MKTP.KD.
- World Bank (2023), “GDP per capita (constant 2015 US$).” 2023. [Online]. Available: https://data.worldbank.org/indicator/NY.GDP.PCAP.KD.
- Leandro Vigna and Johannes Friedrich, “Global per capita emissions explained - through 9 charts.” [Online]. Available: https://www.weforum.org/agenda/2023/05/global-per-capita-emissions-explained-charts/.
- World Bank and OECD, “‘GDP’ [dataset]. World Bank and OECD [original data].” 2023. [Online]. Available: https://ourworldindata.org/grapher/co2-gdp-growth.








| Category Names | Min Values | Max Values | Range | Countries |
|---|---|---|---|---|
| High positive | 3.244667e+03 | 6.769703e+08 | 676967055.333 | United states, China, India |
| High negative | -1.906534e+08 | -1.370680e+05 | 190516332 | |
| Moderate positive | 4.88107e+05 | 3.741389e+07 | 36925079.3 | United Kingdom, France, South Korea, Brazil, India |
| Moderate negative | -3.209079e+07 | -4.587947e+05 | 31631995.3 | |
| Low positive | 2.683000e+03 | 2.632321e+07 | 26320527 | South Africa, Nigeria, Democratic Republic of Congo |
| Low negative | -2.045842e+07 | -8.609333e+03 | 20449810.667 |
| Model | MCC | AUC | Precision | Recall | F1 score | Mean CV |
|---|---|---|---|---|---|---|
| Logreg | 0.67 | 0.73 | 0.76 | 0.73 | 0.68 | 0.74 |
| DT | 0.95 | 0.96 | 0.96 | 0.96 | 0.96 | 0.93 |
| RF | 0.87 | 0.89 | 0.91 | 0.89 | 0.89 | 0.86 |
| XGB | 0.83 | 0.85 | 0.89 | 0.85 | 0.84 | 0.91 |
| GB | 0.84 | 0.86 | 0.89 | 0.86 | 0.85 | 0.85 |
| SVC | 0.58 | 0.64 | 0.70 | 0.64 | 0.58 | 0.73 |
| MLP | 0.60 | 0.66 | 0.74 | 0.66 | 0.64 | 0.75 |
| GNB | 0.50 | 0.58 | 0.66 | 0.58 | 0.55 | 0.75 |
| Class | Increase | Decrease |
| High negative | Population number | 3 years average |
| High positive | co2 from coal Population number 3 years average |
--- |
| Low negative | --- | Population number Gdp 3 years average |
| Low positive | 3 years average |
co2 from oil Population education: Incomplete Primary Population number Change in gdp |
| Moderate negative | GDP | Population number 3 years average |
| Moderate positive | co2 from oil Population education: Incomplete Primary Change in gdp 3 years average |
Co2 from coal Population number |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
