Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

A New Cloud‐Native Tool for Pharmacogenetic Analysis

Version 1 : Received: 2 February 2024 / Approved: 5 February 2024 / Online: 5 February 2024 (10:54:41 CET)

A peer-reviewed article of this Preprint also exists.

Yuan, D.Y.; Park, J.H.; Li, Z.; Thomas, R.; Hwang, D.M.; Fu, L. A New Cloud-Native Tool for Pharmacogenetic Analysis. Genes 2024, 15, 352. Yuan, D.Y.; Park, J.H.; Li, Z.; Thomas, R.; Hwang, D.M.; Fu, L. A New Cloud-Native Tool for Pharmacogenetic Analysis. Genes 2024, 15, 352.

Abstract

Background: Advancement of next-generation sequencing (NGS) technologies provides opportunities for large-scale Pharmacogenetic (PGx) studies and pre-emptive PGx testing to cover a wide range of genotypes present in diverse populations. However, NGS-based PGx testing is limited by the lack of comprehensive computational tools to support genetic data analysis and clinical decisions. Methods: Bioinformatics utilities specialized for human genomics and the latest cloud-based technologies are used for developing a bioinformatics pipeline for analyzing the genomic sequence data and reporting PGx genotypes. A database was created and integrated in the pipeline for filtering the actionable PGx variants and clinical interpretations. Strict quality verification procedures were conducted on variant calls with whole genome sequencing (WGS) dataset of the 1000 Genomes Project (G1K). The accuracy of PGx allele identification was validated using the whole genome sequencing dataset of the Pharmacogenetics Reference Materials from the Centers for Disease Control and Prevention (CDC). Results: The newly created bioinformatics pipeline, Pgxtools, can analyze genomic sequence data, identify actionable variants in 13 PGx relevant genes, and generate reports annotated with specific interpretations and recommendations based on clinical practice guidelines. Verified with two independent methods, we have found that Pgxtools consistently identifies variants more accurately than the results in the G1K dataset on GRCh37 and GRCh38. Conclusions: Pgxtools provides an integrated workflow for large-scale genomic data analysis and PGx clinical decision support. Implemented with cloud-native technologies, it is highly portable in a wide variety of environments from a single laptop to high performance computing clusters and cloud platforms for different production scales and requirements.

Keywords

Pharmacogenetics; bioinformatics pipeline; cloud‐native technologies; workflow; genomic data analysis

Subject

Computer Science and Mathematics, Software

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.