Submitted:
02 November 2024
Posted:
11 November 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- What quantitative information relates to data, scientific documents, and collaboration between authors?
- Who are the most productive authors, and how has their productivity changed over time?
- What are the dynamics exist between authors, journals and keywords?
- Which journals are the most relevant?
- How do authors and countries collaborate?
- Which countries produce the highest number of articles?
2. Contributions to the Field
3. Materials and Methods
3.1. Database Survey
3.2. Scientometric Analysis
- Annual production of articles;
- Productivity of authors;
- Author productivity over time;
- The most productive authors;
- Three-field plot;
- The core journals according the theme investigated;
- Production journals over time;
- The most productive countries.
4. Results
4.1. Annual Production of Articles and Author’s Productivity
4.2. Three-Field Plot, Core Journals and Journal’s Production
4.3. Scientific Production and Collaborations
4.4. Emerging Trends from Scientometric Analysis in Energy Theft Detection and Consumption Profiling Research
4.4.1. Exponential Growth in the Research Field
4.4.2. Strengthening of International Collaborations
4.4.3. Emergence of Specialized Journals and Changing Publication Dynamics
4.4.4. Increased Contribution from Emerging Economies
4.4.5. Integration of AI and Big Data for Energy Theft Detection
4.4.6. Conformity with the Sustainable Development Goals (SDGs)
5. Conclusions
- Q: What quantitative information relates to data, scientific documents, and collaboration between authors?
- A: Between 2003 and April 2024, our findings show an exponential increase in the number of published articles, culminating in 2024 with a total of 107 articles. This indicates a growing interest among researchers in the two themes that constitute the subject of this study. Additionally, out of 1556 authors, only 14 produced single-authored articles, indicating a strong synergy in collaborative work among authors from the same or different countries.
- Q: Who are the most productive authors, and how has their productivity changed over time?
- A: The most productive authors are primarily from Pakistan, followed by South Korea, Saudi Arabia, India, China, Palestine, and Bangladesh. Nadeem Javaid was identified as the most productive author, with 12 articles among the 478 in the database created for this study. His first two works, published in 2020, have received a total of 101 citations. The majority of his publications appear in the IEEE Access journal.
- Q: What are the dynamics exist between authors, journals and keywords?
- A: Using the Sankey diagram and Bradford’s Law, it was discovered that the top 10 productive authors concentrated their publications in journals such as Applied Energy, Energies, IEEE Access, Energy and Buildings, Energy, and IEEE Transactions on Smart Grid. The most common keywords used by these authors relate to deep learning, smart meters, smart grids, machine learning, and clustering.
- Q: Which journals are the most relevant?
- A: According to Bradford’s Law, the most relevant journals, in descending order, are Applied Energy, Energies, IEEE Access, Energy and Buildings, Energy, and IEEE Transactions on Smart Grid. This result guides researchers interested in publishing their findings to the most relevant journals for reporting discoveries related to electricity theft and energy consumption categorization in smart grids.
- Q: How do authors and countries collaborate?
- A: The research showed significant collaboration among authors, as less than 1% of the articles were single-authored. Since the oldest article in the database was published in 2003, it was observed that the research groups have been in existence for just over 20 years, with considerable interaction between groups. Regarding collaborations between countries, China and the USA are the countries with the most collaboration with other countries, and with the exception for Pakistan, the two leading publishing countries have a higher number of articles co-authored by researchers affiliated with institutions in these respective countries.
- Q: Which countries produce the highest number of articles?
- A: China and the USA are the countries that have published the most articles between 2003 and April 2024. They also have a strong collaboration network with other countries identified in this research. The year 2021 has the highest number of publications, due to the increase of research groups with publications related to that year and the increase of partnerships and publications from China.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| AMI | Advanced Metering Infrastructure |
| ANEEL | The Brazilian National Agency of Electric Energy |
| CNN | Convolutional Neural Network |
| DF | Dominance Factor |
| DSM | Demand-Side Management |
| IEEE | Institute of Electrical and Electronics Engineers |
| KS | Kolmogorov-Smirnov |
| KSA | Kingdom of Saudi Arabia |
| MCP | Multiple Country Publications |
| NTL | Non-Technical Losses |
| PRISMA | Preferred Reporting Items for Systematic reviews and Meta-Analyses |
| RNN | Recurrent Neural Network |
| SCP | Single Country Publications |
| SDG | Sustainable Development Goals |
| SOM | Self-Organizing Map |
| TC | Total of Citations |
| TCN | Temporal Convolutional Network |
| TWh | Terawatt hours |
| UAE | United Arab Emirates |
| UN | United Nations |
| USA | United States of America |
| WoS | Web of Science |
References
- Dileep, G. A survey on smart grid technologies and applications. Renewable Energy 2020, 146, 2589–2625. [Google Scholar] [CrossRef]
- Satre-Meloy, A.; Diakonova, M.; Grünewald, P. Cluster analysis and prediction of residential peak demand profiles using occupant activity data. Applied Energy 2020, 260, 114246. [Google Scholar] [CrossRef]
- Khan, A.N.; Iqbal, N.; Rizwan, A.; Ahmad, R.; Kim, D.H. An ensemble energy consumption forecasting model based on spatial-temporal clustering analysis in residential buildings. Energies 2021, 14, 3020. [Google Scholar] [CrossRef]
- Zhang, X.; Ramírez-Mendiola, J.L.; Li, M.; Guo, L. Electricity consumption pattern analysis beyond traditional clustering methods: A novel self-adapting semi-supervised clustering method and application case study. Applied Energy 2022, 308, 118335. [Google Scholar] [CrossRef]
- Shi, H.; Xu, M.; Li, R. Deep learning for household load forecasting - a novel pooling deep RNN. IEEE Transactions on Smart Grid 2018, 9, 5271–5280. [Google Scholar] [CrossRef]
- Massaoudi, M.; others. A novel stacked generalization ensemble-based hybrid LGBM-XGB-MLP model for Short-Term Load Forecasting. Energy 2021, 214, 118874. [Google Scholar] [CrossRef]
- Ghimire, S.; Nguyen-Huy, T.; AL-Musaylh, M.S.; Deo, R.C.; Casillas-Pérez, D.; Salcedo-Sanz, S. A novel approach based on integration of convolutional neural networks and echo state network for daily electricity demand prediction. Energy 2023, 275, 127430. [Google Scholar] [CrossRef]
- Ullah, A.; Javaid, N.; Javed, M.U.; Pamir.; Kim, B.S.; Bahaj, S.A. Adaptive Data Balancing Method Using Stacking Ensemble Model and Its Application to Non-Technical Loss Detection in Smart Grids. IEEE Access 2022, 10, 133244–133255. [Google Scholar] [CrossRef]
- Pamir.; Javaid, N.; Almogren, A.; Adil, M.; Javed, M.U.; Zuair, M. RFE Based Feature Selection and KNNOR Based Data Balancing for Electricity Theft Detection Using BiLSTM-LogitBoost Stacking Ensemble Model. IEEE Access 2022, 10, 112948–112963. [Google Scholar] [CrossRef]
- Naeem, A.; others. A novel data balancing approach and a deep fractal network with light gradient boosting approach for theft detection in smart grids. Heliyon 2023, 9, E18928. [Google Scholar] [CrossRef]
- Kuang, L.; Yang, L.T.; Chen, J.; Hao, F.; Luo, C. A Holistic Approach for Distributed Dimensionality Reduction of Big Data. IEEE Transactions on Cloud Computing 2018, 6, 506–518. [Google Scholar] [CrossRef]
- Wen, L.; Zhou, K.; Yang, S.; Li, L. Compression of smart meter big data: A survey. Renewable and Sustainable Energy Reviews 2018, 91, 59–69. [Google Scholar] [CrossRef]
- Pourmirza, Z. S..B.J. Data reduction algorithm for correlated data in the smart grid. IET Smart Grid 2021, 4, 474–488. [Google Scholar] [CrossRef]
- Hossain, E.; Khan, I.; Un-Noor, F.; Sikander, S.S.; Sunny, M.S.H. Application of Big Data and Machine Learning in Smart Grid, and Associated Security Concerns: A Review. IEEE Access 2019, 7, 13960–13988. [Google Scholar] [CrossRef]
- Oprea, S.V.; Bâra, A.; Tudorică, B.G.; Călinoiu, M.I.; Botezatu, M.A. Insights into demand-side management with big data analytics in electricity consumers’ behaviour. Computers & Electrical Engineering 2021, 89, 106902. [Google Scholar] [CrossRef]
- Syed, D.; Zainab, A.; Ghrayeb, A.; Refaat, S.S.; Abu-Rub, H.; Bouhali, O. Smart Grid Big Data Analytics: Survey of Technologies, Techniques, and Applications. IEEE Access 2021, 9, 59564–59585. [Google Scholar] [CrossRef]
- Wen, M.; Xie, R.; Lu, K.; Wang, L.; Zhang, K. FedDetect: A Novel Privacy-Preserving Federated Learning Framework for Energy Theft Detection in Smart Grid. IEEE Internet of Things Journal 2022, 9, 6069–6080. [Google Scholar] [CrossRef]
- El-Toukhy, A.T.; others. Electricity Theft Detection Using Deep Reinforcement Learning in Smart Power Grids. IEEE Access 2023, 11, 59558–59574. [Google Scholar] [CrossRef]
- Kgaphola, P.M.; Marebane, S.M.; Hans, R.T. Electricity Theft Detection and Prevention Using Technology-Based Models: A Systematic Literature Review. Electricity 2024, 5, 334–350. [Google Scholar] [CrossRef]
- Bhatia, B.; Gulati, M. Reforming the power sector: controlling electricity theft and improving revenue. Available at: https://openknowledge.worldbank.org/handle/10986/10430, 2004. Accessed on 11-07-2024.
- Arango, L.o. Theft impact on the economy of a regulated electricity company. Journal of Control, Automation and Electrical Systems 2017, 28, 567–575. [Google Scholar] [CrossRef]
- Jain, S.; Choksi, K.A.; Pindoriya, N.M. Rule-based classification of energy theft and anomalies in consumers load demand profile. IET Smart Grid 2019, 2, 612–624. [Google Scholar] [CrossRef]
- ANEEL. Electric Energy Losses in the Distribution. Available online: https://git.aneel.gov.br/publico/centralconteudo/-/raw/main/relatorioseindicadores/tarifaeconomico/Relatorio_Perdas_Energia.pdf (accessed on 26 August 2024). (in Portuguese).
- ANEEL. Electric Energy Losses Report. Available at: https://portalrelatorios.aneel.gov.br/luznatarifa/perdasenergias, 2024. Accessed on 08-26-2024 (in Portuguese).
- Al-Ghaili, A.M.; others. A review of anomaly detection techniques in advanced metering infrastructure. Bulletin of Electrical Engineering and Informatics 2021, 10, 266–273. [Google Scholar] [CrossRef]
- Jokar, P.; Arianpoo, N.; Leung, V.C.M. Electricity Theft Detection in AMI Using Customers’ Consumption Patterns. IEEE Transactions on Smart Grid 2016, 7, 216–226. [Google Scholar] [CrossRef]
- Punmiya, R.; Choe, S. Energy Theft Detection Using Gradient Boosting Theft Detector With Feature Engineering-Based Preprocessing. IEEE Transactions on Smart Grid 2019, 10, 2326–2329. [Google Scholar] [CrossRef]
- Yao, D.; Wen, M.; Liang, X.; Fu, Z.; Zhang, K.; Yang, B. Energy Theft Detection With Energy Privacy Preservation in the Smart Grid. IEEE Internet of Things Journal 2019, 6, 7659–7669. [Google Scholar] [CrossRef]
- Gunturia, S.K.; Sarkar, D. Ensemble machine learning models for the detection of energy theft. Electric Power Systems Research 2020, 192, 106904. [Google Scholar] [CrossRef]
- Philips, A.; Jayakumar, J. Data analytics in metering infrastructure of smart grids: a review. Journal of Green Engineering 2020, 10, 11205–11232. [Google Scholar]
- Das, R.; Karmakar, G.; Kamruzzaman, J.; Chowdhury, A. Measuring Trustworthiness of Smart Meters Leveraging Household Energy Consumption Profile. IEEE Journal of Emerging and Selected Topics in Industrial Electronics 2022, 3, 289–297. [Google Scholar] [CrossRef]
- Ullah, A.; Haydarov, K.; Ul Haq, I.; Muhammad, K.; Rho, S.; Lee, M.; Baik, S.W. Deep Learning Assisted Buildings Energy Consumption Profiling Using Smart Meter Data. Sensors 2020, 20. [Google Scholar] [CrossRef]
- Csoknyai, T.; Legardeur, J.; Akle, A.A.; Horváth, M. Analysis of energy consumption profiles in residential buildings and impact assessment of a serious game on occupants’ behavior. Energy and Buildings 2019, 196, 1–20. [Google Scholar] [CrossRef]
- Sepehr, M.; Eghtedaei, R.; Toolabimoghadam, A.; Noorollahi, Y.; Mohammadi, M. Modeling the electrical energy consumption profile for residential buildings in Iran. Sustainable Cities and Society 2018, 41, 481–489. [Google Scholar] [CrossRef]
- Czétány, L.; Vámos, V.; Horváth, M.; Szalay, Z.; Mota-Babiloni, A.; Deme-Bélafi, Z.; Csoknyai, T. Development of electricity consumption profiles of residential buildings based on smart meter data clustering. Energy and Buildings 2021, 252, 111376. [Google Scholar] [CrossRef]
- Hood, W.; Wilson, C. Literature of bibliometrics, scientometrics, and informetrics. Scientometrics 2001, 52, 291–314. [Google Scholar] [CrossRef]
- Chellappandi, P.; Vijayakumar, C. Bibliometrics, scientometrics, webometrics / cybermetrics, informetrics and altmetrics - An emerging field in library and information science research. International Journal of Education 2018, 7, 5–8. [Google Scholar] [CrossRef]
- Mingers, J.; Leydesdorff, L. A review of theory and practice in scientometrics. European Journal of Operational Research 2015, 246, 1–19. [Google Scholar] [CrossRef]
- Messinis, G.M.; Hatziargyriou, N.D. Review of non-technical loss detection methods. Electric Power Systems Research 2018, 158, 250–266. [Google Scholar] [CrossRef]
- Stracqualursi, E.; Rosato, A.; Di Lorenzo, G.; Panella, M.; Araneo, R. Systematic review of energy theft practices and autonomous detection through artificial intelligence methods. Renewable and Sustainable Energy Reviews 2023, 184, 113544. [Google Scholar] [CrossRef]
- Badr, M.M.; others. Review of the Data-Driven Methods for Electricity Fraud Detection in Smart Metering Systems. Energies 2023, 16. [Google Scholar] [CrossRef]
- Kim, S.; others. Data-driven approaches for energy theft detection: a comprehensive review. Energies 2024, 17. [Google Scholar] [CrossRef]
- Gassar, A.A.A.; Cha, S.H. Energy prediction techniques for large-scale buildings towards a sustainable built environment: A review. Energy and Buildings 2020, 224, 110238. [Google Scholar] [CrossRef]
- Zhao, Y.; others. A review of data mining technologies in building energy systems: Load prediction, pattern identification, fault detection and diagnosis. Energy and Built Environment 2020, 1, 149–164. [Google Scholar] [CrossRef]
- Sun, Y.; Haghighat, F.; Fung, B.C. A review of the-state-of-the-art in data-driven approaches for building energy prediction. Energy and Buildings 2020, 221, 110022. [Google Scholar] [CrossRef]
- Aliero, M.S.; Asif, M.; Ghani, I.; Pasha, M.F.; Jeong, S.R. Systematic Review Analysis on Smart Building: Challenges and Opportunities. Sustainability 2022, 14. [Google Scholar] [CrossRef]
- Benítez, I.; Díez, J.L. Automated Detection of Electric Energy Consumption Load Profile Patterns. Energies 2022, 15. [Google Scholar] [CrossRef]
- Nations, U. Goal 7 - Ensure access to affordable, reliable, sustainable and modern energy for all. Available online: https://sdgs.un.org/goals/goal7 (accessed on 27 August 2024).
- Nations, U. Goal 9 - Build resilient infrastructure, promote inclusive and sustainable industrialization and foster innovation. Available online: https://sdgs.un.org/goals/goal9 (accessed on 27 August 2024).
- Zhu, J.; Liu, W. A tale of two databases: the use of Web of Science and Scopus in academic papers. Scientometrics 2020, 123, 321–335. [Google Scholar] [CrossRef]
- Aria, M.; Cuccurullo, C. Bibliometrix: An R-tool for comprehensive science mapping analysis. Journal of Informetrics 2017, 11, 959–975. [Google Scholar] [CrossRef]
- Page, M.J.; others. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021, 372. [Google Scholar] [CrossRef]
- van Eck, N.; Waltman, L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics 2010, 84, 523–538. [Google Scholar] [CrossRef]
- Lotka, A.J. The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences 1926, 16, 317–323. [Google Scholar]
- Kumar, S.; Kumar, S. Collaboration in research productivity in oil seed research institutes of India. Proceedings of Fourth International Conference on Webometrics, Informetrics and Scientometrics, 2008.
- Schmidt, M. The Sankey Diagram in Energy and Material Flow Management. Journal of Industrial Ecology 2008, 12, 82–94. [Google Scholar] [CrossRef]
- Bradford, S.C. Sources of information on specific subjects. Journal of Information Science 1934, 10, 176–180. [Google Scholar] [CrossRef]
- Sajid, Z.; Javaid, A. A Stochastic Approach to Energy Policy and Management: A Case Study of the Pakistan Energy Crisis. Energies 2018, 11, 2424. [Google Scholar] [CrossRef]
- Conrad, B.; Kostka, G. Chinese investments in Europe’s energy sector: Risks and opportunities? Energy Policy 2017, 101, 644–648. [Google Scholar] [CrossRef]
- Dollar, D. United States-China two-way direct investment: Opportunities and challenges. Journal of Asian Economics 2017, 50, 14–26. [Google Scholar] [CrossRef]
- Asif, M. Growth and sustainability trends in the buildings sector in the GCC region with particular reference to the KSA and UAE. Renewable and Sustainable Energy Reviews 2016, 55, 1267–1273. [Google Scholar] [CrossRef]
- Wang, K.; others. Generative adversarial networks: introduction and outlook. IEEE/CAA Journal of Automatica Sinica 2017, 4, 588–598. [Google Scholar] [CrossRef]
- Nasir, V.; Sassani, F.A. A review on deep learning in machining and tool monitoring: methods, opportunities, and challenges. The International Journal of Advanced Manufacturing Technology 2021, 115, 2683–2709. [Google Scholar] [CrossRef]
- Poudel, S.; Dhungana, U.R. Artificial intelligence for energy fraud detection: a review. International Journal of Applied Power Engineering (IJAPE) 2022. [Google Scholar] [CrossRef]
- Rana, N.P.; others. Understanding dark side of artificial intelligence (AI) integrated business analytics: assessing firm’s operational inefficiency and competitiveness. European Journal of Information Systems 2022, 31, 364–387. [Google Scholar] [CrossRef]
- Cuomo, S.; others. Scientific machine learning through Physics–Informed Neural Networks: where we are and what’s next. Journal of Scientific Computing 2022, 92. [Google Scholar] [CrossRef]
- Wong, L.A.; others. Review on the optimal placement, sizing and control of an energy storage system in the distribution network. Journal of Energy Storage 2019, 21, 489–504. [Google Scholar] [CrossRef]
- Xie, R. An energy theft detection framework with privacy protection for smart grid. 2023 International Joint Conference on Neural Networks (IJCNN), 2023, pp. 1–7. [CrossRef]
- Olivares-Rojas, J.C.; others. A multi-tier architecture for data analytics in smart metering systems. Simulation Modelling Practice and Theory 2020, 102, 102024. [Google Scholar] [CrossRef]
| 1 | This Github link contains all the appendices mentioned throughout this article: https://github.com/joserezende/scientometric-analysis. |












| Rank | Author | Articles(%) | Dominance Factor | H-index | Country |
|---|---|---|---|---|---|
| 1 | Nadeem Javaid | 12 (2.5) | 0.18 | 7 | ![]() |
| 2 | Sung Wook Baik | 6 (1.3) | 0.17 | 6 | ![]() |
| 3 | Imran Khan | 6 (1.3) | 0.17 | 5 | ![]() |
| 4 | Zulfiqar Ahmad Khan | 6 (1.3) | 1.00 | 6 | ![]() |
| 5 | Khursheed Aurangzeb | 5 (1.0) | 0.50 | 2 | ![]() |
| 6 | Muhammad Asad Khan | 5 (1.0) | 0.00 | 3 | ![]() |
| 7 | Ram Rajagopal | 5 (1.0) | 0.00 | 4 | ![]() |
| 8 | Amin Ullah | 5 (1.0) | 0.40 | 4 | ![]() |
| 9 | Kaile Zhou | 5 (1.0) | 0.40 | 5 | ![]() |
| 10 | Haitham Abu-Rub | 4 (0.8) | 0.00 | 3 | ![]() |
| Rank | Country | Total citations | Articles | Average article citations | Rank | Country | Total citations | Articles | Average article citations |
|---|---|---|---|---|---|---|---|---|---|
| 1 | ![]() |
2920 | 50 | 58.40 | 6 | ![]() |
649 | 35 | 18.50 |
| 2 | ![]() |
2482 | 85 | 29.20 | 7 | ![]() |
476 | 3 | 158.7 |
| 3 | ![]() |
1789 | 43 | 41.60 | 8 | ![]() |
440 | 12 | 36.70 |
| 4 | ![]() |
1037 | 14 | 74.10 | 9 | ![]() |
406 | 4 | 101.50 |
| 5 | ![]() |
704 | 25 | 28.20 | 10 | ![]() |
350 | 7 | 43.80 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).













