Submitted:
29 December 2024
Posted:
03 January 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Statistical Method
2.1. Model Setup
2.2. Estimation Method
- If , then is a varying effect predictor.
- If and , then is a constant effect predictor.
- If and , then has no effect on the response.
2.3. The Estimation Step
2.4. Selection of Tuning Parameters
2.4.1. Selection of Tuning Parameters
2.4.2. Selection of Order h and Number of Internal Knots K
2.5. Theoretical Properties
- (i)
- ;
- (ii)
- ,
- (i)
- for ;
- (ii)
- for , where is some non-zero constant;
- (iii)
- for .
3. Simulation
3.1. Continuous G
3.2. Discrete G
4. A Case Study
5. Discussion
Acknowledgments
Appendix A
Appendix A.1. Computational Algorithm
Appendix A.2. Proofs of Theorems
Appendix A.2.1. Notations
Appendix A.2.2. Some Regularity Conditions
Appendix A.2.3. Proof of Theorem 1
Appendix A.2.4. Proof of Theorem 2
Appendix A.3. Additional Real Data Analysis Results
| Gene ID | Gene Symbol | SNP ID |
| GeneID:440600 | RBM15-AS1 | rs6537663 |
| GeneID:2590 | GALNT2 | rs6666516 |
| GeneID:729993 | SHISA9 | rs1015431 |
| GeneID:54768 | HYDIN | rs4788621 |
| GeneID:117532 | TMC2 | rs7509377 |
| GeneID:758 | MPPED1 | rs5766384 |
| GeneID:23395 | LARS2 | rs4311249 |
| GeneID:647107 | LINC01192 | rs2404825 |
| GeneID:8633 | UNC5C | rs3775049 |
| GeneID:2185 | PTK2B | rs6557991 |
| GeneID:4915 | NTRK2 | rs6559870 |
| GeneID:19 | ABCA1 | rs4742969 |
| GeneID:286205 | vSCAI | rs2416996 |
| Gene ID | Gene Symbol | SNP ID |
| GeneID:114827 | FHAD1 | rs3815792 |
| GeneID:2899 | GRIK3 | rs12118788 |
| GeneID:260425 | MAGI3 | rs11102660 |
| GeneID:9857 | CEP350 | rs2293990 |
| GeneID:2590 | GALNT2 | rs9308482 |
| GeneID:6934 | TCF7L2 | rs7901695 |
| GeneID:55742 | PARVA | rs7101596 |
| GeneID:867 | CBL | rs4489755 |
| GeneID:10867 | TSPAN9 | rs740771 |
| GeneID:57494 | RIMKLB | rs11047510 |
| GeneID:196385 | DNAH10 | rs11058132 |
| GeneID:64328 | XPO4 | rs1961415 |
| GeneID:23348 | DOCK9 | rs7326971 |
| GeneID:23348 | DOCK9 | rs7991210 |
| GeneID:57099 | AVEN | rs16962542 |
| GeneID:11060 | WWP2 | rs16970994 |
| GeneID:25780 | RASGRP3 | rs6708570 |
| GeneID:100505498 | LOC100505498 | rs6730602 |
| GeneID:117532 | TMC2 | rs11696526 |
| GeneID:29780 | PARVB | rs5765571 |
| GeneID:25814 | ATXN10 | rs713999 |
| GeneID:9620 | CELSR1 | rs11090812 |
| GeneID:23429 | RYBP | rs17009630 |
| GeneID:80254 | CEP63 | rs11710699 |
| GeneID:8633 | UNC5C | rs10516957 |
| GeneID:157680 | VPS13B | rs1788161 |
References
- Carpenter, D. O., Arcaro, K., & Spink, D. C. (2002). Understanding the human health effects of chemical mixtures. Environmental Health Perspectives, 110(Suppl 1), 25. [CrossRef]
- Colditz, G. A., & Hankinson, S. E. (2005). The Nurses’ Health Study: lifestyle and health among women. Nature Reviews Cancer, 5(5), 388-396. [CrossRef]
- Fan, J., & Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American statistical Association, 96(456), 1348-1360. [CrossRef]
- Field, A. E., Coakley, E. H., Must, A., Spadano, J. L., Laird, N., Dietz, W. H., Rimm, E., & Colditz, G. A. (2001). Impact of overweight on the risk of developing common chronic diseases during a 10-year period. Archives of Internal Medicine, 161(13), 1581–1586. [CrossRef]
- Frank, L. E., & Friedman, J. H. (1993). A statistical view of some chemometrics regression tools. Technometrics, 35(2), 109-135. [CrossRef]
- Hu, F. B., Sigal, R. J., Rich-Edwards, J. W., Colditz, G. A., Solomon, C. G., Willett, W. C., & Manson, J. E. (1999). Walking compared with vigorous physical activity and risk of type 2 diabetes in women: a prospective study. Journal of the American Medical Association, 282(15), 1433–1439. [CrossRef]
- Grant, S. F., Thorleifsson, G., Reynisdottir, I., Benediktsson, R., Manolescu, A., Sainz, J., ... & Styrkarsdottir, U. (2006). Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes. Nature Genetics, 38(3), 320-323. [CrossRef]
- Guan, S., Zhao, M., Cui, Y. (2023). Variable selection for single-index varying-coefficients models with applications to synergistic G×E interactions. Electronic Journal of Statistics, 17(1): 823–857. [CrossRef]
- Jolliffe, I. (2002). Principal component analysis. John Wiley & Sons, Ltd.
- Kruit, J. K., Wijesekara, N., Westwell-Roper, C., Bruce, J., Zhao, M., Kahn, S. E., & Hayden, M. R. (2012). Loss of ATP-binding cassette transporter A1 in mice impairs glucose tolerance and insulin secretion in vivo. Diabetes, 61(12), 3166-3177.
- Morrison, J. L., Gallas, P. R., Zheng, W., Mohtashami, S., & Wong, T. (2020). UNC5C and its relevance to beta-cell apoptosis. Endocrinology, 161(7), bqaa064.
- Ottman, R. (1996). Gene-environment interaction: definitions and study designs. Preventive medicine, 25(6), 764. [CrossRef]
- Rajpathak, S. N., Crandall, J. P., Wylie-Rosett, J., Kabat, G. C., Rohan, T. E., & Hu, F. B. (2009). The role of iron in type 2 diabetes in humans. Biochimica et Biophysica Acta (BBA) - General Subjects, 1790(7), 671–681. [CrossRef]
- Rimm, E. B., Giovannucci, E. L., Willett, W. C., Colditz, G. A., Ascherio, A., Rosner, B., & Stampfer, M. J. (1991). Prospective study of alcohol consumption and risk of coronary disease in men. The Lancet, 338(8765), 464-468. [CrossRef]
- Sale, M. M., Smith, S. G., Mychaleckyj, J. C., Keene, K. L., Langefeld, C. D., Leak, T. S., ... & Freedman, B. I. (2007). Variants of the transcription factor 7-like 2 (TCF7L2) gene are associated with type 2 diabetes in an African-American population enriched for nephropathy. Diabetes, 56(10), 2638-2642. [CrossRef]
- Salmerón, J., Ascherio, A., Rimm, E. B., Colditz, G. A., Spiegelman, D., Jenkins, D. J., Stampfer, M. J., Wing, A. L., & Willett, W. C. (1997). Dietary fiber, glycemic load, and risk of NIDDM in men. Diabetes Care, 20(4), 545–550. [CrossRef]
- Schwarz, G. (1978). Estimating the dimension of a model. The annals of statistics, 6(2), 461-464.
- Schumaker, L. (2007). Spline functions: basic theory. Cambridge University Press.
- Sexton, K., & Hattis, D. (2007). Assessing cumulative health risks from exposure to environmental mixtures-three fundamental questions. Environmental Health Perspectives, 825-832. [CrossRef]
- Singh, J., Kumar, V., Aneja, A., Gupta, S., & Dhingra, S. (2022). Genetic polymorphisms in ABCA1 (rs2230806 and rs1800977) and LIPC (rs2070895) genes and their association with the risk of type 2 diabetes: A case-control study. International Journal of Diabetes in Developing Countries, 42(2), 227-235. [CrossRef]
- Smith, A. R., Jones, K. M., Patel, D., Thomas, C., & Reid, L. (2019). The role of SCAI in cellular stress pathways. Journal of Cell Science, 132(17), jcs231829.
- Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological), 267-288.
- van Berge, L., Keating, D. J., Greber, B., O’Neill, C., & Lang, J. (2019). Mitochondrial dysfunction in metabolic syndrome. Frontiers in Endocrinology, 10, 607.
- Wang, Y., Wang, J., He, Z., Tang, X., Zheng, Y., & Liu, H. (2021). Inflammatory pathways in T2D: A focus on PTK2B. Immunometabolism, 3(1), e210006.
- Wang, X., Zhang, J., Li, Y., Wang, Z., Sun, Q., Zhou, H., & Fan, Y. (2023). RBM15 regulates hepatic insulin sensitivity in GDM offspring. Molecular Medicine, 29(1), 45.
- Weissglas-Volkov, D., Aguilar-Salinas, C. A., Nikkola, E., Tusie-Luna, T., Cruz-Bautista, I., Arellano-Campos, O., & Pajukanta, P. (2011). GALNT2 polymorphisms and their role in lipid and glucose metabolism. Human Molecular Genetics, 20(9), 1632-1640.
- Xu, B., Li, X., Nian, K., Li, J., Li, M., Zheng, W., & Han, X. (2019). BDNF and NTRK2 in metabolic regulation. Nature Medicine, 25(5), 697-707.
- Zhang, C. H. (2010). Nearly unbiased variable selection under minimax concave penalty. The Annals of statistics, 38(2), 894-942. [CrossRef]
- Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American statistical association, 101(476), 1418-1429. [CrossRef]
- Zou, H., Hastie, T., & Tibshirani, R. (2006). Sparse principal component analysis. Journal of computational and graphical statistics, 15(2), 265-286. [CrossRef]


| n | ||||||||
| Oracle % | IMSEModel | IMSEOracle | Oracle % | IMSEModel | IMSEOracle | |||
| 100.0% | 1.50E-01 | 1.47E-01 | 100.0% | 1.48E-01 | 1.31E-01 | |||
| 99.2% | 1.44E-01 | 2.14E-01 | 99.7% | 1.48E-01 | 1.66E-01 | |||
| 99.4% | 1.34E-01 | 1.61E-01 | 99.7% | 1.37E-01 | 1.43E-01 | |||
| 100.0% | 3.95E-02 | 3.85E-02 | 100.0% | 4.19E-02 | 3.33E-02 | |||
| 99.9% | 5.58E-02 | 5.53E-02 | 100.0% | 5.93E-02 | 4.86E-02 | |||
| Zero | 99.1% | 1.03E-03 | 0 | 99.0% | 1.16E-03 | 0 | ||
| 100.0% | 6.85E-02 | 6.88E-02 | 100.0% | 7.00E-02 | 7.05E-02 | |||
| 100.0% | 5.17E-02 | 6.00E-02 | 100.0% | 5.43E-02 | 6.34E-02 | |||
| 100.0% | 5.46E-02 | 5.99E-02 | 100.0% | 5.40E-02 | 5.88E-02 | |||
| 100.0% | 1.40E-02 | 1.46E-02 | 100.0% | 1.46E-02 | 1.48E-02 | |||
| 100.0% | 1.79E-02 | 1.93E-02 | 100.0% | 1.89E-02 | 1.87E-02 | |||
| Zero | 99.4% | 3.33E-04 | 0 | 99.4% | 3.14E-04 | 0 | ||
| n | ||||||||
| Oracle % | MSEModel | MSEOracle | Oracle % | MSEModel | MSEOracle | |||
| 100.0% | 7.26E-04 | 5.91E-04 | 100.0% | 6.75E-04 | 5.75E-04 | |||
| 100.0% | 1.36E-02 | 1.11E-02 | 100.0% | 3.28E-03 | 5.78E-04 | |||
| 97.3% | 2.34E-04 | 0 | 96.7% | 6.22E-04 | 0 | |||
| 97.5% | 1.90E-04 | 0 | 96.7% | 2.72E-04 | 0 | |||
| 96.8% | 9.01E-04 | 0 | 96.6% | 2.95E-04 | 0 | |||
| 100.0% | 2.76E-04 | 2.68E-04 | 100.0% | 2.77E-04 | 2.75E-04 | |||
| 100.0% | 2.71E-04 | 2.66E-04 | 100.0% | 2.76E-04 | 2.74E-04 | |||
| 99.1% | 5.49E-05 | 0 | 99.6% | 1.57E-05 | 0 | |||
| 99.6% | 2.68E-05 | 0 | 99.4% | 3.16E-05 | 0 | |||
| 99.3% | 4.58E-05 | 0 | 99.4% | 2.22E-05 | 0 | |||
| Function | MAF of |
| - | |
| 0.5 | |
| 0.5 | |
| 0.3 | |
| 0.3 | |
| 0.1 | |
| 0.1 | |
| 0.5 | |
| 0.3 | |
| 0.1 | |
| unif(0.05, 0.5) |
| n | ||||||||
| Oracle % | IMSEModel | IMSEOracle | Oracle % | IMSEModel | IMSEOracle | |||
| 100.0% | 1.63E-01 | 2.28E-01 | 100.0% | 1.75E-01 | 2.29E-01 | |||
| 83.4% | 4.29E-01 | 5.61E-01 | 83.1% | 4.44E-01 | 6.08E-01 | |||
| 87.7% | 3.82E-01 | 4.38E-01 | 87.9% | 3.69E-01 | 4.50E-01 | |||
| 76.0% | 5.43E-01 | 5.87E-01 | 75.5% | 5.50E-01 | 6.17E-01 | |||
| 83.3% | 4.37E-01 | 4.19E-01 | 79.6% | 5.11E-01 | 4.50E-01 | |||
| 23.2% | 1.35E+00 | 1.05E+00 | 22.1% | 1.45E+00 | 1.18E+00 | |||
| 25.7% | 1.27E+00 | 8.93E-01 | 23.7% | 1.31E+00 | 9.22E-01 | |||
| 100.0% | 5.09E-02 | 6.22E-02 | 99.9% | 5.05E-02 | 5.86E-02 | |||
| 99.9% | 6.08E-02 | 7.50E-02 | 99.9% | 5.83E-02 | 6.75E-02 | |||
| 100.0% | 1.05E-01 | 1.24E-01 | 99.9% | 1.13E-01 | 1.21E-01 | |||
| Zero | 99.0% | 2.74E-03 | 0 | 99.2% | 2.15E-03 | 0 | ||
| 100.0% | 7.47E-02 | 8.03E-02 | 100.0% | 7.42E-02 | 8.35E-02 | |||
| 100.0% | 9.46E-02 | 1.38E-01 | 99.9% | 1.02E-01 | 1.49E-01 | |||
| 100.0% | 9.52E-02 | 1.21E-01 | 99.9% | 9.47E-02 | 1.21E-01 | |||
| 100.0% | 1.07E-01 | 1.55E-01 | 99.8% | 1.08E-01 | 1.50E-01 | |||
| 99.9% | 1.05E-01 | 1.37E-01 | 99.8% | 1.05E-01 | 1.30E-01 | |||
| 77.5% | 4.84E-01 | 2.99E-01 | 75.1% | 5.15E-01 | 3.08E-01 | |||
| 77.5% | 4.93E-01 | 2.63E-01 | 73.5% | 5.32E-01 | 2.53E-01 | |||
| 100.0% | 1.89E-02 | 2.11E-02 | 100.0% | 1.75E-02 | 1.97E-02 | |||
| 100.0% | 2.17E-02 | 2.39E-02 | 100.0% | 2.08E-02 | 2.33E-02 | |||
| 100.0% | 4.47E-02 | 4.95E-02 | 100.0% | 4.24E-02 | 4.56E-02 | |||
| Zero | 99.3% | 9.15E-04 | 0 | 99.5% | 6.81E-04 | 0 | ||
| n | ||||||||
| Oracle % | MSEModel | MSEOracle | Oracle % | MSEModel | MSEOracle | |||
| 100.0% | 6.64E-04 | 7.28E-04 | 100.0% | 5.84E-04 | 5.13E-04 | |||
| 100.0% | 7.37E-03 | 7.32E-03 | 100.0% | 2.76E-03 | 3.27E-03 | |||
| 95.2% | 3.64E-04 | 0 | 95.2% | 3.73E-04 | 0 | |||
| 97.0% | 1.21E-04 | 0 | 96.1% | 4.18E-04 | 0 | |||
| 96.1% | 2.04E-04 | 0 | 94.7% | 5.46E-04 | 0 | |||
| n = 2000 | 100.0% | 2.34E-04 | 2.22E-04 | 100.0% | 2.20E-04 | 2.12E-04 | ||
| 100.0% | 2.31E-04 | 2.20E-04 | 100.0% | 2.30E-03 | 2.37E-03 | |||
| 98.9% | 5.00E-05 | 0 | 98.9% | 3.24E-05 | 0 | |||
| 98.7% | 5.44E-05 | 0 | 98.9% | 4.49E-05 | 0 | |||
| 98.9% | 4.37E-05 | 0 | 99.0% | 5.07E-05 | 0 | |||
| act | bmi | alcohol | heme | gl |
| -0.1832 | 0.9544 | 0 | 0.2157 | 0.0947 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).