Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Ascertaining Genetics of β—Thalassemia and Sickle Cell Disease Using Machine Learning Heuristics

Version 1 : Received: 18 April 2024 / Approved: 30 April 2024 / Online: 30 April 2024 (03:56:58 CEST)

How to cite: P, A.; SR, A.; Kumar, S.; Singh, G.; Kapoor, S.; Gupta, S.; Valadi, J. K.; Rao, R. S. P.; Polipalli, S. K.; Suravajhala, P. Ascertaining Genetics of β—Thalassemia and Sickle Cell Disease Using Machine Learning Heuristics. Preprints 2024, 2024041951. https://doi.org/10.20944/preprints202404.1951.v1 P, A.; SR, A.; Kumar, S.; Singh, G.; Kapoor, S.; Gupta, S.; Valadi, J. K.; Rao, R. S. P.; Polipalli, S. K.; Suravajhala, P. Ascertaining Genetics of β—Thalassemia and Sickle Cell Disease Using Machine Learning Heuristics. Preprints 2024, 2024041951. https://doi.org/10.20944/preprints202404.1951.v1

Abstract

Hemoglobinopathies are a group of disorders in which the hemoglobin molecule has abnormal production or structure. The hemoglobin molecules in red blood cells (RBC) are impacted by the blood disease known as sickle cell disease (SCD), and Thalassemia is one of the major monogenic disorders that reduces hemoglobin production( Kohne et al ., 2011). This disorder results in a large number of red blood cells being destroyed, leading to anemia. India bears a huge burden of hemoglobinopathies; thalassemia is the most prevalent (Mondal SK et al., 2016). A key component of thalassemia prevention is a successful screening procedure to identify Thalassemia carriers. Effective screening programs have numerous obstacles, especially in environments with limited resources. Machine learning (ML) has been used to solve technical and domain-specific problems in a variety of prognostic and diagnostic medical jobs. In this work, we aimed to identify and analyze the most common mutation of β--thalassemia and sickle cell disease from the north Indian population by employing Machine learning-based algorithms. To accurately predict the carrier state from a simple blood test, and to predict pathogenic hemoglobin variants in a group of individuals, these results demonstrate the application of integrated bioinformatics and machine learning approaches. This study contributes to the validation of the models based on data from several individuals and hemoglobinopathies.

Keywords

hemoglobinopathies; β--thalassemia; machine learning; bioinformatics; genomics

Subject

Biology and Life Sciences, Life Sciences

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.