1. Box GEP, Draper NR. Empirical model-building and response surfaces. New York, USA: Wiley; 1987.
2. Greenland S. Introduction to regression models. In Rothman KJ, Greenland S, Lash TL, eds. Modern Epidemiology, 3rd edn. Philadelphia, USA: Lippincott Williams & Wikins; 2008, pp. 381-417.
3. Le-Rademacher J, Lopez C, Wolfe E, Foster NR, Mandrekar SJ, Wang X, Kumar R, Adjei A, Jatoi A. Weight loss over time and survival: a landmark analysis of 1000+ prospectively treated and monitored lung cancer patients. J Cachexia Sarcopenia Muscle. 2020. doi: 10.1002/jcsm.12625.
4. Dy GK, Mandrekar SJ, Nelson GD, Meyers JP, Adjei AA, Ross HJ, Ansari RH, Lyss AP, Stella PJ, Schild SE, Molina JR, Adjei AA. A randomized phase II study of gemcitabine and carboplatin with or without cediranib as first-line therapy in advanced non-small-cell lung cancer: North Central Cancer Treatment Group Study N0528. J Thorac Oncol. 2013; 8: 79-88.
5. Negrao MV, Lam VK, Reuben A, Rubin ML, Landry LL, Roarty EB, Rinsurongkawong W, Lewis J, Roth JA, Swisher SG, Gibbons DL, Wistuba II, Papadimitrakopoulou V, Glisson BS, Blumenschein GR Jr, Lee JJ, Heymach JV, Zhang J. PD-L1 expression, tumor mutational burden, and cancer gene mutations are stronger predictors of benefit from immune checkpoint blockade than HLA class I genotype in non-small cell lung cancer. J Thorac Oncol. 2019; 14: 1021-1031.
6. Hosmer DW, Lemeshow S, Sturdivant RX. Applied Logistic Regression, 3rd edn. Wiley; 2013.
7. Hastie TJ, Tibshirani RJ. Generalized Additive Models. Boca Raton, USA: Chapman & Hall/CRC; 1990.
8. Harrell FE. Regression Modeling Strategies With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis. Springer; 2015.
9. Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology 1999; 10: 37-48.
10. Williamson EJ, Aitken Z, Lawrie J, Dharmage SC, Burgess JA, Forbes AB. Introduction to causal diagrams for confounder selection. Respirology 2014; 19: 303-311.
11. Heinze G, Dunkler D. Five myths about variable selection. Transpl Int. 2017; 30: 6-10.
12. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, Pencina MJ, Kattan MW. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology 2010; 21: 128-138.
13. Mandrekar SJ, Schild SE, Hillman SL, et al. A prognostic model for advanced stage nonsmall cell lung cancer pooled analysis of north central cancer treatment group trials. Cancer 2006: 107:781-92.
14. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd edn. Springer; 2009.
15. Halabi S, Li C, Luo S. Developing and validating risk assessment models of clinical outcomes in modern oncology. JCO Precis Oncol. 2019; 3:10.1200/PO.19.00068.
16. Babyak MA. What you see may not be what you get: a brief, nontechnical introduction to overfitting in regression-type models. Psychosom Med. 2004; 66: 411-421.
17. Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, Vickers AJ, Ransohoff DF, Collins GS. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015; 162: W1-73.
18. Hernán MA, Robins JM. Using big data to emulate a target trial when a randomized trial is not available. Am J Epidemiol. 2016; 183: 758-764.
19. Steyerberg EW. Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating. 2nd edn. Springer; 2019.
20. Riley RD, Ensor J, Snell KIE, Harrell FE Jr, Martin GP, Reitsma JB, Moons KGM, Collins G, van Smeden M. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020; 368: m441.