Submitted:
22 July 2024
Posted:
23 July 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Related Work
1.2. Research Gap
1.3. Motivation
1.4. Novelty and Contributions
1.5. Organization of this Article

2. Materials and Methods
2.1. Data Collection and Description
| Variables | Energy type | Unit |
|---|---|---|
| Electrical energy generated from solar panels | Watt-hour | |
| Electrical energy generated from fuel cells | Watt-hour | |
| Total electrical energy consumption | Watt-hour |
2.2. Questionnaire Survey
-
Survey Question 1: Please tell the type of housing in which you reside. Please select the most suitable answer from the options provided:
- Single-storey detached house.
- Double-storey detached house.
- Triple-storey detached house.
- Apartment (owned).
- Apartment (rented).
-
Survey Question 2: Do you have a floor heating system installed in your residence? Please choose the most appropriate answer from the options below:
- No, it is not installed.
- Yes, electric floor heating systems are installed (e.g., all-electric homes).
- Yes, gas-powered floor heating systems with hot water circulation are installed (e.g., homes equipped with ENE-FARM, etc).
- Yes, other hot water floor heating systems are installed (e.g., homes with OM Solar panels).
- Survey Question 3: Please provide the average monthly household electricity bill in Yen.
2.3. Data Pre-Processing
2.4. Electricity Self-Sufficiency Rate
- Renewable Energy Sources: The use of solar panels, fuel cells, Inverters, or other renewable energy installations that can generate electricity or heat for the household.
- Energy Storage Systems: The presence of batteries or other storage systems that can store energy produced during peak production times for use during periods when production is low.
- Energy Efficiency Measures: The implementation of energy-saving practices and technologies, such as high-efficiency appliances, LED lighting, proper insulation, and smart thermostats.
- Geographical Location: The availability and effectiveness of renewable energy sources vary by location. For example, solar power is more effective in regions with abundant sunlight.
- Electricity Produced by Household (Wh): Represents the amount of electricity generated by the household’s renewable energy sources, measured in watt-hours (Wh).
- Total Electricity Consumption of Household (Wh): Denotes the total amount of electricity consumed by the household from all sources, including both self-generated electricity and any additional electricity drawn from the grid or other external sources, measured in watt-hours (Wh).
2.5. Model Development
2.5.1. Feature Analysis Based on SHAP (SHapley Additive exPlanations):
- is the SHAP value for feature i.
- S is a subset of features excluding i.
- N is the set of all features.
- is the value function for the subset S.
2.5.2. LightGBM:
- Gradient-based One-Side Sampling (GOSS): LightGBM retains instances with large gradients while randomly sampling instances with smaller gradients. This approach reduces the number of data points processed and accelerates computation without significantly compromising accuracy.
- Exclusive Feature Bundling (EFB): LightGBM combines mutually exclusive features—features that rarely have non-zero values simultaneously—to decrease the total number of features and enhance training efficiency.
- Histogram-based Decision Tree Learning: LightGBM employs a histogram-based technique to determine optimal split points by converting continuous feature values into discrete bins. This method simplifies the learning process and accelerates training.
- Leaf-wise Tree Growth: In contrast to traditional level-wise tree growth methods, such as those used by XGBoost, LightGBM adopts a leaf-wise strategy. It selects the leaf with the highest delta loss for expansion, resulting in deeper trees and improved accuracy.
- is the loss function
- is the prediction for instance i,
- is the regularization term for the k-th tree ,
- n is the no. of instances
- N is the no. of trees.
- and are the sums of the gradients for the left and right splits, respectively,
- and are the sums of the second-order gradients (Hessians) for the left and right splits, respectively,
- is the regularization parameter,
- is the regularization term for the number of leaves.
- Input variables - Survey responses (option number 1–5 for each survey question)
- Output variable - Electricity self-sufficiency rate
- Evaluation function - Mean absolute error (MAE)
2.5.3. Correlation Matrix Based Heatmap:
Correlation Coefficient
- and are individual sample points
- and are the means of x and y, respectively
- n is the number of data points
Correlation Matrix
3. Results and Discussion
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| SHAP | SHapley Additive exPlanations |
| HEMS | Home energy management systems |
| nZEHs | Net Zero energy houses |
| LightGBM | Light Gradient boosting machine |
| ESSR | Electricity self-sufficiency rate |
| MAE | Mean absolute error |
| RES | Renewable energy resources |
| ANN | Artificial neural network |
| FFNN | Feedforward Neural Networks |
References
- Zhang, X.; Zhang, H.; Yuan, J. Economic growth, energy consumption, and carbon emission nexus: fresh evidence from developing countries. Environ. Sci. Pollut. Res. 2019, 26 (25), 26367–26380. [CrossRef] [PubMed]
- Chanthakett, A.; Arif, M. T.; Khan, M. M. K.; Subhani, M. Hydrogen production from municipal solid waste using gasification method. In Hydrogen Energy Conversion and Management; Elsevier, 2024; pp 103–131. [CrossRef]
- Singh, N. K.; Fukushima, T.; Nagahara, M. Gradient Boosting Approach to Predict Energy-Saving Awareness of Households in Kitakyushu. Energies 2023, 16 (16), 5998. [CrossRef]
- High Efficiency Plants and Building Integrated Renewable Energy Systems. In Handbook of Energy Efficiency in Buildings; Elsevier, 2019; pp 441–595. [CrossRef]
- Naterer, G. F.; Dincer, I.; Zamfirescu, C. Hydrogen Production from Nuclear Energy; Springer London: London, 2013b. [CrossRef]
- Yoro, K. O.; Daramola, M. O. CO2 emission sources, greenhouse gases, and the global warming effect. In Advances in Carbon Capture; Elsevier, 2020; pp 3–28. [CrossRef]
- Nejat, P.; Jomehzadeh, F.; Taheri, M.M.; Gohari, M.; Abd Majid, M.Z. A global review of energy consumption, CO2 emissions and policy in the residential sector (with an overview of the top ten CO2 emitting countries). Renew. Sustain. Energy Rev. 2015, 43, 843–862. [CrossRef]
- Tian, J.; Yu, L.; Xue, R.; Zhuang, S.; Shan, Y. Global low-carbon energy transition in the post-COVID-19 era. Appl. Energy 2022b, 307, 118205. [CrossRef] [PubMed]
- Singh, N. K.; Fukushima, T.; Nagahara, M. Gradient Boosting Approach to Predict Zero Carbon Achievement of Households in Kitakyushu. In 2023 9th International Conference on Control, Decision and Information Technologies (CoDIT), Rome, Italy, July 3–6, 2023; IEEE, 2023. [CrossRef]
- Santamouris, M.; Cartalis, C.; Synnefa, A.; Kolokotsa, D. On the impact of urban heat island and global warming on the power demand and electricity consumption of buildings—A review. Energy Build. 2015, 98, 119–124. [CrossRef]
- Dong, K.; Hochman, G.; Timilsina, G. R. Do drivers of CO2 emission growth alter overtime and by the stage of economic development? Energy Policy 2020, 140, 111420. [CrossRef]
- Schleussner, C.-F.; Rogelj, J.; Schaeffer, M.; Lissner, T.; Licker, R.; Fischer, E. M.; Knutti, R.; Levermann, A.; Frieler, K.; Hare, W. Science and policy characteristics of the Paris Agreement temperature goal. Nat. Clim. Chang. 2016, 6 (9), 827–835. [CrossRef]
- Moodley, P.; Trois, C. Lignocellulosic biorefineries: the path forward. In Sustainable Biofuels; Elsevier, 2021; pp 21–42. [CrossRef]
- Guzović, Z.; Duić, N.; Piacentino, A.; Markovska, N.; Mathiesen, B. V.; Lund, H. Paving the way for the Paris Agreement: Contributions of SDEWES science. Energy 2022, 125617. [CrossRef]
- Ashouri, M.; Haghighat, F.; Fung, B. C. M.; Lazrak, A.; Yoshino, H. Development of building energy saving advisory: A data mining approach. Energy Build. 2018, 172, 139–151. [CrossRef]
- Colmenar-Santos, A.; Campíñez-Romero, S.; Pérez-Molina, C.; Castro-Gil, M. Profitability analysis of grid-connected photovoltaic facilities for household electricity self-sufficiency. Energy Policy 2012, 51, 749–764. [CrossRef]
- Ramirez Camargo, L.; Nitsch, F.; Gruber, K.; Dorner, W. Electricity self-sufficiency of single-family houses in Germany and the Czech Republic. Appl. Energy 2018, 228, 902–915. [CrossRef]
- Li, S.-Y.; Han, J.-Y. The impact of shadow covering on the rooftop solar photovoltaic system for evaluating self-sufficiency rate in the concept of nearly zero energy building. Sustain. Cities Soc. 2022, 80, 103821. [CrossRef]
- Harke, F.; Otto, P. Solar Self-Sufficient Households as a Driving Factor for Sustainability Transformation. Sustainability 2023, 15, 2734. [CrossRef]
- Nishimura, A., Tanikaga, S., Hirota, M., & Hu, E. (2018). Energy characteristics of an integrated power generation system with photovoltaic and fuel cell. Smart Grid and Renewable Energy, 9(4), 57-73. [CrossRef]
- Bruni, G., Cordiner, S., & Mulone, V. (2014). Domestic distributed power generation: Effect of sizing and energy management strategy on the environmental efficiency of a photovoltaic-battery-fuel cell system. Energy, 77, 133-143. [CrossRef]
- Ozcan, M. (2018). The role of renewables in increasing Turkey’s self-sufficiency in electrical energy. Renewable and Sustainable Energy Reviews, 82, 2629-2639. [CrossRef]
- Beckel, C.; Sadamori, L.; Staake, T.; Santini, S. Revealing household characteristics from smart meter data. Energy 2014, 78, 397–410. [CrossRef]
- Edwards, R. E., New, J., & Parker, L. E. (2012). Predicting future hourly residential electrical consumption: A machine learning case study. Energy and Buildings, 49, 591-603. [CrossRef]
- Thakur, A., Shukla, K. A., Choudhary, A., & Atrey, J. (2023, July). Predictive Analysis of Energy Consumption and Electricity Demand Using Machine Learning Techniques. In 2023 International Conference on Smart Systems for applications in Electrical Sciences (ICSSES) (pp. 1-6). IEEE.
- Chou, J.-S.; Tran, D.-S. Forecasting energy consumption time series using machine learning techniques based on usage patterns of residential householders. Energy 2018, 165, 709–726. [CrossRef]
- Wang, B., Wang, Y., Qin, K., & Xia, Q. (2018, June). Detecting transportation modes based on LightGBM classifier from GPS trajectory data. In 2018 26th International conference on geoinformatics (pp. 1-7). IEEE.
- Nohara, Y.; Matsumoto, K.; Soejima, H.; Nakashima, N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput. Methods Programs Biomed. 2022, 214, 106584. [CrossRef]
- Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, et al & Liu, T. Y. (2017). Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems, 30.
- Wang, Y.; Wang, T. Application of Improved LightGBM Model in Blood Glucose Prediction. Appl. Sci. 2020, 10 (9), 3227. [CrossRef]
- Nagahara, M. Sparsity Methods for Systems and Control; now Publishers: Norwell, MA, USA, 2020.








Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).