Introduction:
Diabetes Mellitus is a chronic condition that affects millions of people worldwide, with rising incidence rates due to changing lifestyle patterns, dietary habits, and genetics. It is a significant cause of morbidity and mortality, as it can lead to severe complications such as cardiovascular disease, kidney failure, and vision loss. Early detection and timely intervention are crucial in managing the disease and preventing its complications. As a result, the medical community has turned to machine learning (ML) techniques to develop predictive models that can identify individuals at risk of developing diabetes, allowing for earlier diagnosis and personalized treatment strategies.
Machine learning has gained significant attention in healthcare due to its ability to process large volumes of complex data and uncover patterns that might not be immediately obvious to healthcare professionals. In the case of diabetes prediction, ML models are trained on various clinical and demographic data, such as glucose levels, body mass index (BMI), age, family history, and lifestyle factors, to make predictions about the likelihood of diabetes development. However, the success of these models is not solely determined by the choice of algorithm; rather, it is the quality of the features—the input variables—that play a crucial role in improving prediction accuracy and reliability.
Feature engineering, the process of selecting, transforming, and creating new features from raw data, is a critical step in enhancing machine learning models' performance. Proper feature engineering helps to extract relevant information from raw data, reduce noise, and ensure that the data fed into ML models is more representative of the underlying patterns related to diabetes. It can also help address common issues in healthcare data, such as missing values, imbalanced datasets, and the high dimensionality of clinical features.
Understanding Feature Engineering
Feature engineering is a pivotal process in the development of machine learning models, particularly in domains like healthcare, where the complexity and variability of data require careful handling to ensure the best possible model performance. In simple terms, feature engineering refers to the techniques used to transform raw data into meaningful, structured, and informative features that can be input into a machine learning model. This process not only improves the predictive power of the model but also makes the model more interpretable and capable of handling complex healthcare data effectively.
Definition and Objectives of Feature Engineering
Feature engineering involves a series of steps to select, transform, and sometimes create new features from the raw data. The primary objective is to improve the model's ability to learn relevant patterns that lead to accurate predictions. In the context of diabetes prediction, feature engineering allows us to better represent the various factors that influence the disease, such as biological markers (e.g., blood glucose levels), lifestyle factors (e.g., diet and physical activity), and demographic information (e.g., age and family history).
The Main Objectives of Feature Engineering Include:
Improving Model Accuracy: By selecting relevant and high-quality features, the model can better learn the relationships between input data and target outcomes (e.g., the likelihood of developing diabetes).
Reducing Complexity: Proper feature selection and transformation can eliminate redundant or irrelevant data, which reduces model complexity and the risk of overfitting.
Enhancing Interpretability: Well-engineered features are easier to interpret, making it clearer why a model is making certain predictions. This is especially important in healthcare, where understanding the reasoning behind a diagnosis is critical.
Dealing with Data Imbalances: In many healthcare datasets, certain classes (e.g., diabetic vs. non-diabetic) may be underrepresented. Feature engineering can help mitigate the effects of such imbalances.
Key Processes in Feature Engineering
Feature engineering involves several critical processes, each aimed at improving the utility of raw data in machine learning tasks:
Feature Selection
Feature selection is the process of identifying the most relevant features from the raw dataset. In healthcare, this often involves choosing clinical variables that have the most significant impact on diabetes prediction, such as glucose levels, BMI, and age. Feature selection techniques include:
Filter Methods: Statistical tests are used to assess the relationship between each feature and the target variable, selecting the most significant features.
Wrapper Methods: These methods evaluate subsets of features based on model performance. They are computationally expensive but can yield highly effective feature subsets.
Embedded Methods: Feature selection is done as part of the model training process (e.g., decision trees, Lasso regression).
Handling Missing Data
Missing data is a common challenge in healthcare datasets. Feature engineering involves using various imputation techniques to fill in missing values, such as:
Mean/Median Imputation: Replacing missing values with the mean or median of the available data.
Predictive Imputation: Using other features to predict missing values (e.g., using regression or k-nearest neighbors).
Flagging Missing Values: Creating binary indicators (e.g., "missing" or "not missing") as separate features.
Challenges in Healthcare Data
Healthcare data often presents unique challenges that require specific attention during feature engineering:
High Dimensionality: Healthcare datasets can have numerous features, many of which may not contribute meaningfully to the model's predictions. This can result in overfitting, where the model learns noise instead of genuine patterns.
Data Imbalance: In many healthcare contexts, some classes (e.g., non-diabetic individuals) may be overrepresented, while others (e.g., diabetic patients) are underrepresented. This imbalance can bias the model, leading to poor generalization on minority classes.
Missing Data and Noise: Healthcare data often has missing or incomplete information, as well as errors or inconsistencies in the data. Feature engineering can help mitigate these issues by applying appropriate imputation and cleaning techniques.
Feature Engineering Techniques for Diabetes Mellitus Prediction
Feature engineering is a critical step in improving the accuracy and performance of machine learning models, especially when predicting diseases such as Diabetes Mellitus. The effectiveness of these models largely depends on the quality of the input features, and careful manipulation of raw data can significantly enhance model predictive power. This section explores various feature engineering techniques specifically applied to healthcare data for predicting the onset of diabetes, addressing the challenges inherent in such data, and providing methods to extract the most useful information for model training.
1. Data Preprocessing
Before diving into more advanced feature engineering techniques, data preprocessing plays a fundamental role in preparing healthcare datasets for machine learning models. This includes handling missing data, normalizing or standardizing numerical features, and encoding categorical data.
Handling Missing Data:
Missing data is a common issue in healthcare datasets. Different strategies for imputation include:
Mean/Median Imputation: Replacing missing values with the mean or median of the respective column (often used for continuous variables such as blood glucose levels or BMI).
Predictive Imputation: Using machine learning models (e.g., regression or k-nearest neighbors) to predict missing values based on other available features.
Multiple Imputation: A more advanced technique that generates multiple predictions for missing values to account for uncertainty.
Deletion: Removing records with missing values if the proportion of missing data is relatively low.
Normalization and Standardization:
Healthcare data often contains numerical features with different scales (e.g., glucose levels, age, and BMI). To ensure the model treats all features equally:
Normalization: Rescaling features to a specific range (e.g., 0 to 1).
Standardization: Scaling features to have zero mean and unit variance. This is particularly important for algorithms like support vector machines (SVMs) and k-nearest neighbors (KNN).
Encoding Categorical Data:
Many healthcare features (e.g., gender, ethnicity, and lifestyle factors) are categorical and need to be converted into numerical representations:
One-Hot Encoding: Creating binary columns for each category (e.g., creating separate columns for "Male" and "Female").
Ordinal Encoding: Assigning a rank to ordered categories (e.g., low, medium, high risk).
Target Encoding: Replacing categories with the mean of the target variable (e.g., risk of diabetes) for each category.
2. Feature Selection
Feature selection is the process of identifying and retaining the most relevant variables, which significantly influence the model’s prediction capability. Effective feature selection prevents overfitting, improves model generalization, and reduces computational complexity. In diabetes prediction, the most informative features often include demographic factors, medical history, and clinical measurements.
Filter Methods:
These methods involve statistical tests to assess the strength of the relationship between each feature and the target variable. Common tests include:
Chi-square Test: Used for categorical data to check if there’s a significant relationship between features (e.g., smoking status) and the target variable.
ANOVA (Analysis of Variance): Tests for continuous features to see if there are statistically significant differences between diabetes and non-diabetes groups.
Wrapper Methods:
These methods evaluate subsets of features by training a model using each subset and selecting the one with the best performance. Common wrapper methods include:
Recursive Feature Elimination (RFE): Recursively removes the least significant features to improve model performance.
Embedded Methods:
These methods perform feature selection as part of the model training process. Common embedded techniques include:
Lasso Regression (L1 Regularization): Helps in feature selection by penalizing coefficients of less important features to zero.
Decision Trees: Feature importance is evaluated based on how well the features split the data in tree-building algorithms like Random Forests and Gradient Boosting.
4. Dimensionality Reduction
Healthcare datasets can have a large number of features, which can lead to overfitting or computational inefficiency. Dimensionality reduction techniques help reduce the number of features while retaining as much information as possible.
Principal Component Analysis (PCA):
PCA is a technique that reduces the dimensionality of the dataset by projecting it onto a smaller number of uncorrelated features (principal components). PCA can be useful in healthcare datasets with many intercorrelated features, such as various lab test results or biomarkers.
t-Distributed Stochastic Neighbor Embedding (t-SNE):
t-SNE is a technique often used for visualization and exploring high-dimensional data. It is particularly useful for reducing dimensions while preserving the local structure of the data, making it easier to spot patterns in diabetes prediction datasets.
5. Domain-Specific Features
Incorporating domain knowledge is an essential part of feature engineering for diabetes prediction. Clinical expertise can help identify critical features that may not be immediately obvious in raw data.
Clinical and Demographic Features:
Features such as age, BMI, family history, and medical history of conditions like hypertension or hyperlipidemia are known to be strong predictors of diabetes. Incorporating these as new features can improve model accuracy.
Risk Scores:
Features derived from established clinical risk scores, such as the Framingham Diabetes Risk Score or the American Diabetes Association’s risk classification, can provide useful inputs for predictive models.
Comorbidities and Medication Usage:
Including features such as the use of medications (e.g., insulin or metformin), or presence of comorbid conditions (e.g., obesity, hypertension), can enrich models by accounting for the broader health context of patients.
6. Time-Series Analysis
For patients with historical data, time-series features can be crucial in predicting the onset of diabetes, especially for type 2 diabetes, where changes in biomarkers (e.g., blood glucose levels) over time are significant indicators.
Trend Analysis:
Analyzing the trend of key features over time (e.g., glucose levels or BMI) can reveal early signs of diabetes development, especially if data spans several years or includes regular check-ups.
Lag Features:
Using lag features allows for the inclusion of previous time steps (e.g., glucose levels from the last check-up) as predictors, which can help capture the progression of the disease.
Challenges and Ethical Considerations in Feature Engineering for Diabetes Mellitus Prediction
While feature engineering plays a critical role in improving the performance of machine learning models for diabetes mellitus prediction, it is not without its challenges. Additionally, the use of machine learning models in healthcare raises important ethical concerns that must be addressed to ensure fairness, transparency, and accountability in decision-making. This section explores both the technical challenges and the ethical considerations that arise in the process of feature engineering for diabetes prediction.
1. Technical Challenges in Feature Engineering
A. Data Quality and Availability
Missing Data:
One of the most common challenges in healthcare datasets is missing or incomplete data. Diabetes datasets may contain missing information about key features such as glucose levels, medical history, or demographic factors. Handling missing data through imputation or exclusion can be difficult, as poor imputation methods might lead to biased predictions, while dropping missing data could reduce the dataset's size and predictive power.
Inconsistent Data:
Healthcare data often come from various sources, such as electronic health records (EHRs), clinical trials, and laboratory tests. This can lead to inconsistencies in data formats, units, or terminologies. For example, blood glucose readings might be reported in different units (mg/dL vs. mmol/L), requiring standardization. Feature engineering techniques must account for these inconsistencies to ensure that the data used for model training is reliable.
B. High Dimensionality
Healthcare data can be very high-dimensional, with numerous features (e.g., lab results, vitals, genetic markers, lifestyle information) that may not all be relevant for diabetes prediction. While dimensionality reduction techniques such as PCA can help reduce the number of features, the risk of losing important information is always present. Identifying and retaining the most informative features requires deep domain expertise and careful feature selection.
C. Data Imbalance
In many healthcare datasets, especially those for chronic conditions like diabetes, there may be an imbalance between the number of diabetic and non-diabetic individuals. This class imbalance can lead to poor model performance, particularly in predicting the minority class (diabetic patients). Addressing class imbalance requires specialized techniques such as oversampling the minority class, undersampling the majority class, or using appropriate evaluation metrics (e.g., F1 score, precision-recall curve) to ensure that the model does not become biased toward the majority class.
D. Noise and Outliers
Healthcare data often contains noise and outliers, which can distort the relationship between features and target variables. For instance, a sudden spike in glucose levels due to an acute infection may not reflect the patient's usual health status. Feature engineering must include outlier detection and handling techniques to ensure that these outliers do not adversely affect model training.
E. Temporal Dynamics in Healthcare Data
Diabetes is a chronic disease, and its progression is influenced by changes over time, such as shifts in glucose levels or lifestyle factors. Temporal or longitudinal data introduces challenges in capturing and engineering time-related features. Models must account for these temporal dynamics, whether through time-series analysis or by creating lag features, which increases the complexity of feature engineering.
2. Ethical Considerations in Feature Engineering for Diabetes Prediction
- A.
Fairness and Bias in Feature Selection
Bias in Data Collection:
Healthcare data may inherently contain biases, such as underrepresentation of certain demographic groups (e.g., ethnic minorities, low-income populations). These biases can be unintentionally introduced into feature engineering, especially if the data does not fully capture the diversity of the population. This can result in models that perform poorly for underrepresented groups, exacerbating healthcare disparities.
Discriminatory Features:
Some features, such as race, gender, or socioeconomic status, may be correlated with diabetes risk but may also introduce ethical concerns. Using such features can lead to biased models that make predictions based on demographic characteristics rather than health-related factors. For example, a model that uses race as a feature may reinforce existing healthcare disparities and discrimination. Ethical feature engineering requires careful consideration of which features to include and how they are treated to avoid perpetuating systemic bias.
- B.
Privacy and Confidentiality of Patient Data
Healthcare data is highly sensitive, and privacy concerns are paramount. Feature engineering often involves accessing and processing large datasets containing personal health information, which may include sensitive details like medical history, genetic data, or lifestyle information. Ensuring patient confidentiality and complying with regulations such as the Health Insurance Portability and Accountability Act (HIPAA) or the General Data Protection Regulation (GDPR) is crucial during the feature engineering process.
Anonymization and De-identification:
To protect patient privacy, healthcare data used in model development must be anonymized or de-identified, especially when dealing with sensitive attributes. However, de-identification can sometimes compromise the quality of the data, leading to challenges in feature engineering. Striking a balance between privacy and the utility of the data is a key ethical challenge.
- C.
Transparency and Explainability of Models
Black-Box Models:
Machine learning models, especially complex ones like deep learning or ensemble methods (e.g., random forests, gradient boosting), are often considered "black boxes," meaning their decision-making process is not easily interpretable. This lack of transparency can be problematic in healthcare, where clinicians and patients need to understand the rationale behind predictions for informed decision-making. Feature engineering can help improve the explainability of models by selecting interpretable features and ensuring that the model's decision-making process aligns with clinical knowledge.
Clinician Trust and Accountability:
Healthcare professionals must trust the model's predictions to make informed decisions about patient care. Ethical feature engineering can enhance model explainability by ensuring that the features used are clinically relevant and that the model's output can be communicated clearly to healthcare providers. Additionally, clear documentation of how features were selected and engineered can help ensure accountability and transparency in the modeling process.
- D.
Informed Consent
Consent for Data Usage:
Patients whose data is used for diabetes prediction models must give informed consent. This includes understanding how their data will be used, what features will be engineered from their data, and how the resulting models may impact clinical decisions. Transparency in the feature engineering process is crucial for obtaining informed consent and maintaining public trust in healthcare technology.
- E.
Impact of Predictions on Patient Care
Clinical Decision-Making:
The predictions made by machine learning models can significantly impact patient care decisions, such as diagnosing diabetes, recommending lifestyle changes, or prescribing medication. Ethical feature engineering ensures that the model does not misclassify patients or lead to harmful consequences due to biased or incomplete features. Furthermore, models should always complement, not replace, clinical expertise, and healthcare providers should be involved in the final decision-making process.
3. Addressing Challenges and Ethical Considerations
To overcome the challenges and mitigate the ethical risks in feature engineering for diabetes prediction, several best practices should be followed:
Collaborative and Inclusive Data Collection:
Ensuring that the dataset represents a diverse population helps mitigate biases and leads to more equitable models. Collaboration with healthcare professionals from diverse backgrounds can also help identify the most relevant and ethically sound features.
Regular Audits and Bias Testing:
Regularly auditing models for biases and discriminatory practices is essential. This can be done through fairness-aware algorithms and testing for equal treatment across demographic groups.
Privacy-Preserving Techniques:
Utilizing privacy-preserving methods such as differential privacy or federated learning can help protect patient data while still allowing for effective feature engineering and model development.
Explainable AI:
Employing techniques for model interpretability, such as LIME (Local Interpretable Model-agnostic Explanations) or SHAP (SHapley Additive exPlanations), can enhance the transparency of diabetes prediction models, making them more trustworthy for clinicians and patients.
Future Directions in Feature Engineering for Diabetes Mellitus Prediction
The field of machine learning, particularly in healthcare, is continuously evolving, and diabetes mellitus prediction is no exception. With advancements in both technology and medical research, the future of feature engineering in this domain is poised for significant innovation. This section outlines potential future directions for improving feature engineering techniques in the prediction of diabetes mellitus, focusing on emerging trends, technological advancements, and the evolving landscape of healthcare data.
1. Integration of Multi-Omics Data
2. Real-Time Data and Wearable Technology
Continuous Monitoring:
The rise of wearable devices that continuously monitor vital health metrics such as glucose levels, heart rate, activity levels, and sleep patterns provides an opportunity for more dynamic and real-time prediction of diabetes risk. Feature engineering techniques will need to adapt to process real-time streaming data, potentially creating features that capture temporal trends, fluctuations, and contextual health information (e.g., exercise patterns affecting glucose levels).
Example:
Devices like continuous glucose monitors (CGMs) generate a constant stream of data, which could be used to create features such as glucose variability or average glucose over different time windows, allowing for more granular and timely predictions of diabetes risk.
3. Leveraging Artificial Intelligence for Automated Feature Engineering
AutoML and Deep Feature Learning:
Automated Machine Learning (AutoML) tools are evolving to automate many steps in the machine learning pipeline, including feature engineering. AutoML platforms can identify the most relevant features automatically through advanced algorithms and neural network architectures, potentially improving the quality and efficiency of feature engineering for diabetes prediction. In parallel, deep learning models, particularly those leveraging autoencoders or other unsupervised techniques, may be able to automatically learn higher-level features from raw data without manual intervention.
Example:
Deep learning models like convolutional neural networks (CNNs) or recurrent neural networks (RNNs) could potentially learn complex representations of health data from wearables or EHRs, automating the extraction of meaningful features for prediction tasks.
4. Personalized and Precision Medicine
Tailored Features for Individual Risk Profiles:
Feature engineering will increasingly need to focus on personalized medicine, where models are designed to predict diabetes risk based on individual patient characteristics, such as genetic predispositions, lifestyle factors, and family history. This personalized approach requires feature engineering techniques that can effectively integrate and weight individual risk factors.
Example:
By combining genomics, lifestyle data (e.g., physical activity, diet), and clinical features (e.g., HbA1c levels), feature engineering could create tailored features that reflect the unique risk profile of each individual, allowing for more accurate and targeted predictions of diabetes development.
5. Advanced Causality and Interpretability Methods
Causal Inference:
Current machine learning models often focus on correlation rather than causation, which can be a limitation in healthcare settings. Future feature engineering techniques may incorporate methods from causal inference to better understand the cause-and-effect relationships between various risk factors (e.g., diet, genetics, lifestyle) and diabetes. Causal models could help identify the most impactful interventions to prevent or manage diabetes, beyond just identifying correlations.
Example:
Causal models could help to identify whether certain lifestyle interventions (e.g., reducing sugar intake) directly lead to improved glucose control or whether other factors (e.g., medication adherence) mediate the effect. These insights would require feature engineering methods that capture these causal relationships.
Explainability and Interpretability:
As machine learning models become more complex, there is a growing emphasis on ensuring that predictions are interpretable and explainable. Future feature engineering techniques will focus on creating features that are not only predictive but also easily understandable by healthcare providers. Incorporating domain-specific features, such as clinical risk scores, into the feature engineering process will improve model transparency and trust.
6. Improved Handling of Unstructured Data
Text and Image Data Integration:
Healthcare data is often unstructured, such as text from electronic health records (EHRs), medical notes, radiological images, or pathology reports. Advanced natural language processing (NLP) and computer vision techniques will enable feature engineering to extract meaningful insights from these unstructured sources. Combining structured data (e.g., glucose levels, BMI) with unstructured data (e.g., physician notes, medical imaging) will provide a more comprehensive picture of a patient’s health, leading to better predictions.
Example:
Text data from medical records can be processed through NLP to extract features such as comorbidities, treatment plans, or specific symptoms (e.g., diabetic neuropathy), which can then be integrated with structured data to enhance model predictions.
7. Ethical and Fairer Data Usage
Bias Mitigation Techniques:
Addressing bias and fairness in feature engineering will continue to be a critical concern. Future advancements will focus on developing more sophisticated bias mitigation techniques that ensure machine learning models are fair, equitable, and do not perpetuate existing healthcare disparities. For example, ensuring that underrepresented groups (e.g., ethnic minorities) are adequately represented in training data and that feature selection does not inadvertently introduce discrimination will be crucial.
Example:
Techniques such as adversarial debiasing or fairness constraints in model training could be applied to ensure that diabetes prediction models do not favor certain demographic groups over others, leading to more equitable healthcare outcomes.
8. Collaboration Between AI and Medical Experts
Interdisciplinary Collaboration:
The future of feature engineering for diabetes prediction will involve deeper collaboration between AI specialists, healthcare professionals, and domain experts. Interdisciplinary teams can provide valuable insights into which features are most clinically relevant and how they should be processed. This collaboration will also be important in refining models to ensure that predictions align with real-world clinical practices and patient needs.
Example:
Clinicians may contribute domain-specific knowledge on the significance of certain biomarkers, lifestyle factors, or comorbidities, helping data scientists engineer features that reflect the clinical reality of diabetes diagnosis and treatment.
Conclusion
Feature engineering is a foundational component in the development of accurate and reliable machine learning models for predicting diabetes mellitus, a prevalent and complex chronic condition. The process of carefully selecting, transforming, and creating features from raw healthcare data has shown to significantly improve model performance, making predictions more precise and actionable. By leveraging clinical, demographic, and lifestyle factors, as well as emerging data sources like genetic information and real-time monitoring, feature engineering has the potential to enhance diabetes prediction models, enabling early detection and personalized treatment strategies.
However, the process of feature engineering is not without its challenges. Data quality issues such as missing or inconsistent data, high dimensionality, and class imbalance can hinder the effectiveness of machine learning models. Moreover, ethical concerns around fairness, privacy, and transparency must be carefully considered to ensure that diabetes prediction models do not exacerbate existing healthcare disparities or violate patient trust. As the field evolves, addressing these technical and ethical challenges will be essential to maximizing the positive impact of machine learning in healthcare.
The future of feature engineering for diabetes mellitus prediction is promising, with the integration of multi-omics data, the rise of wearable devices for real-time health monitoring, and advancements in AI and machine learning techniques. Personalized medicine, causal inference, and enhanced model interpretability are poised to further improve the accuracy and fairness of predictions. The collaboration between AI experts and healthcare professionals will also be crucial to ensuring that the engineered features are clinically relevant and actionable.
Ultimately, feature engineering will continue to play a pivotal role in transforming healthcare by enabling more effective, timely, and individualized care for diabetes patients. As the field progresses, it will be essential to maintain a balance between technological innovation and ethical responsibility, ensuring that the tools developed are not only powerful but also equitable and trustworthy in serving all populations.
References
- Fatima, S. (2024b). Transforming Healthcare with AI and Machine Learning: Revolutionizing Patient Care Through Advanced Analytics. International Journal of Education and Science Research Review, Volume-11(Issue6). https://www.researchgate.net/profile/Sheraz-Fatima/publication/387303877_Transforming_Healthcare_with_AI_and_Machine_Learn ing_Revolutionizing_Patient_Care_Through_Advanced_Analytics/links/676737fe00aa3770e0b29fdd/Transforming-Healthcare-with-AI-and-Machine-Learning-RevolutionizingPatient-Care-Through-Advanced-Analytics.pdf.
- Henry, Elizabeth. Deep learning algorithms for predicting the onset of lung cancer. No. 13589. EasyChair, 2024.
- Fatima, S. (2024). PUBLIC HEALTH SURVEILLANCE SYSTEMS: USING BIG DATA ANALYTICS TO PREDICT INFECTIOUS DISEASE OUTBREAKS. International Journal of Advanced Research in Engineering Technology & Science, Volume-11(Issue-12). https://www.researchgate.net/profile/Sheraz-Fatima/publication/387302612_PUBLIC_HEALTH_SURVEILLANCE_SYSTEMS_USING_BIG_DATA_ANALYTICS_TO_PREDICT_INFECTIOUS_DISEASE_OUTBREAKS/links/676736b7894c5520852267d9/PUBLIC-HEALTH-SURVEILLANCESYSTEMS-USING-BIG-DATA-ANALYTICS-TO-PREDICT-INFECTIOUSDISEASE-OUTBREAKS.pdf.
- Luz, Ayuns. Role of Healthcare Professionals in Implementing Machine Learning-Based Diabetes Prediction Models. No. 13590. EasyChair, 2024.
- Sheriffdeen, Kayode, and Samon Daniel. Explainable artificial intelligence for interpreting and understanding diabetes prediction models. No. 2516-2314. Report, 2024.
- Zierock, B. Chaotic Customer Centricity, HCI International 2023 Posters, Springer Nature Switzerland (2023).
- Zierock, Benjamin, Sieer Angar, and Mareike Rimmler. "Strategic Transformation and Agile thinking in Healthcare Projects." (2023).10.56831/PSEN-03-079.
- Zierock, Benjamin, Matthias Blatz, and Kris Karcher. "Team-Centric Innovation: The Role of Objectives and Key Results (OKRs) in Managing Complex and Challenging Projects." In Proceedings of the 15th International Conference on Applied Human Factors and Ergonomics (AHFE 2024). 2024.
- Zierock, Benjamin, Matthias Blatz, and Sieer Angar. "Transfer and Scale-Up of Agile Frameworks into Education: A Review and Retrospective of OKR and SCRUM." SCIREA Journal of Education 9, no. 4 (2024): 20-37.
- Fatima, S. (2024a). HEALTHCARE COST OPTIMIZATION: LEVERAGING MACHINE LEARNING TO IDENTIFY INEFFICIENCIES IN HEALTHCARE SYSTEMS. International Journal of Advanced Research in Engineering Technology & Science, volume 10(Issue-3). https://www.researchgate.net/profile/Sheraz-Fatima/publication/387304058_HEALTHCARE_COST_OPTIMIZATION_LEVERAGING_MACHINE_LEARNING_TO_IDENTIFY_INEFFICIENCIES_IN_HEALTHCARESYSTEMS/links/67673551e74ca64e1f242064/HEALTHCARE-COSTOPTIMIZATION-LEVERAGING-MACHINE-LEARNING-TO-IDENTIFY-INEFFICIENCIES-IN-HEALTHCARE-SYSTEMS.pdf.
- Fatima, S. (2024b). Improving Healthcare Outcomes through Machine Learning: Applications and Challenges in Big Data Analytics. International Journal of Advanced Research in Engineering Technology & Science, Volume-11(Issue-12). https://www.researchgate.net/profile/Sheraz-Fatima/publication/386572106_Improving_Healthcare_Outcomes_through_Machine_Le arning_Applications_and_Challenges_in_Big_Data_Analytics/links/6757324234301c1fe 945607f/Improving-Healthcare-Outcomes-through-Machine-Learning-Applications-andChallenges-in-Big-Data-Analytics.pdfHenry, Elizabeth. "Understanding the Role of Machine Learning in Early Prediction of Diabetes Onset." (2024).
- Fatima, Sheraz. "PREDICTIVE MODELS FOR EARLY DETECTION OF CHRONIC DISEASES LIKE CANCER." Olaoye, G (2024).
- Reddy, M., Galla, E. P., Bauskar, S. R., Madhavram, C., & Sunkara, J. R. (2021). Analysis of Big Data for the Financial Sector Using Machine Learning Perspective on Stock Prices. Available at SSRN 5059521.
- Kuraku, C., Gollangi, H. K., Sunkara, J. R., Galla, E. P., & Madhavram, C. (2024). Data Engineering Solutions: The Impact of AI and ML on ERP Systems and Supply Chain Management. Nanotechnology Perceptions, 20(S9), 10-62441.
- Galla, E. P., Kuraku, C., Gollangi, H. K., Sunkara, J. R., & Madhavaram, C. R. AI-DRIVEN DATA ENGINEERING.
- Galla, E. P., Rajaram, S. K., Patra, G. K., Madhavram, C., & Rao, J. (2022). AI-Driven Threat Detection: Leveraging Big Data For Advanced Cybersecurity Compliance. Available at SSRN 4980649.
- Reddy, Mohit Surender, Manikanth Sarisa, Siddharth Konkimalla, Sanjay Ramdas Bauskar, Hemanth Kumar Gollangi, Eswar Prasad Galla, and Shravan Kumar Rajaram. "Predicting tomorrow’s Ailments: How AI/ML Is Transforming Disease Forecasting." ESP Journal of Engineering & Technology Advancements 1, no. 2 (2021): 188-200.
- Gollangi, H. K., Bauskar, S. R., Madhavaram, C. R., Galla, E. P., Sunkara, J. R., & Reddy, M. S. (2020). Exploring AI Algorithms for Cancer Classification and Prediction Using Electronic Health Records. Journal of Artificial Intelligence and Big Data, 1(1), 65-74.
- Madhavaram, Chandrakanth Rao, Eswar Prasad Galla, Mohit Surender Reddy, Manikanth Sarisa, and Venkata Nagesh. "Predicting Diabetes Mellitus in Healthcare: A Comparative Analysis of Machine Learning Algorithms on Big Dataset." Journal homepage: https://gjrpublication. com/gjrecs 1, no. 01 (2021).
- Galla, P. , Sunkara, R., & Reddy, S. (2020). ECHOES IN PIXELS: THE INTERSECTION OF IMAGE PROCESSING AND SOUND DETECTION THROUGH THE LENS OF AI AND ML.
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).