Archer, K.J.; Seffernick, A.E.; Sun, S.; Zhang, Y. ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R. Stats2022, 5, 371-384.
Archer, K.J.; Seffernick, A.E.; Sun, S.; Zhang, Y. ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R. Stats 2022, 5, 371-384.
Archer, K.J.; Seffernick, A.E.; Sun, S.; Zhang, Y. ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R. Stats2022, 5, 371-384.
Archer, K.J.; Seffernick, A.E.; Sun, S.; Zhang, Y. ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R. Stats 2022, 5, 371-384.
Abstract
Stage of cancer is a discrete ordinal response that indicates aggressiveness of disease and is often used by physicians to determine the type and intensity of treatment to be administered. For example, the FIGO stage in cervical cancer is based on the size and depth of the tumor as well as the level of spread. It may be of clinical relevance to identify molecular features from high-throughput genomic assays that are associated with stage of cervical cancer, to elucidate pathways related to tumor aggressiveness, identify improved molecular features that may be useful for staging, and identify therapeutic targets. High-throughput RNA-Seq data and corresponding clinical data (including stage) for cervical cancer patients has been made available through The Cancer Genome Atlas Project (TCGA). We recently described penalized Bayesian ordinal response models that can be used for variable selection for over-parameterized datasets such as the TCGA-CESC dataset. Herein, we describe our ordinalbayes R package, available from the Comprehensive R Archive Network (CRAN), which is capable of fitting cumulative logit models when the outcome is ordinal and the number of predictors exceeds the sample size, P>N, such as for TCGA data. We demonstrate use of this package through application to TCGA cervical cancer dataset. Our ordinalbayes package can be used to fit models to high-dimensional dataset and effectively performs variable selection.
MATHEMATICS & COMPUTER SCIENCE, Probability and Statistics
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.