References

Box, G. E. P., and J. S. Hunter. 1957. “Multi-Factor Experimental Designs for Exploring Response Surfaces.” Annals of Mathematical Statistics 28 (1): 195–241. https://doi.org/10.1214/aoms/1177707047.
Bullough, Richard C., and Christopher L. Melby. 1993. “Effect of Inpatient Versus Outpatient Measurement Protocol on Resting Metabolic Rate and Respiratory Exchange Ratio.” Annals of Nutrition and Metabolism 37 (1): 24–32. https://doi.org/10.1159/000177745.
Chang, Clarence D., Oleg K. Kononenko, and Raymond E. Franklin. 1960. “Maximum Data Through a Statistical Design.” Industrial & Engineering Chemistry 52 (11): 939–42. https://doi.org/10.1021/ie50611a030.
Czitrom, Veronica. 1999. “One-Factor-at-a-Time Versus Designed Experiments.” The American Statistician 52 (2): 126–31. https://doi.org/10.1080/00031305.1999.10474445.
Dean, Angela, Daniel Voss, and Danel Draguljić. 2017. Design and Analysis of Experiments. 2nd ed. Springer-Verlag. https://doi.org/10.1007/978-3-319-52250-0.
DeLuca, Laura S., Alex Reinhart, Gordon Weinberg, Michael Laudenbach, Sydney Miller, and David West Brown. 2025. “Developing Students’ Statistical Expertise Through Writing in the Age of AI.” Journal of Statistics and Data Science Education 33 (3): 266–78. https://doi.org/10.1080/26939169.2025.2497547.
Giesbrecht, Francis G., and Marcia L. Gumpertz. 2004. Planning, Construction, and Statistical Analysis of Comparative Experiments. John Wiley & Sons, Inc. https://doi.org/10.1002/0471476471.
Imbens, Guido W., and Donald B. Rubin. 2015. Causal Inference for Statistics, Social, and Biomedical Sciences. Cambridge University Press. https://doi.org/10.1017/CBO9781139025751.
King, James R. 1992. “Presenting Experimental Data Effectively.” Quality Engineering 4 (3): 399–412. https://doi.org/10.1080/08982119208918921.
Klotz, Jerome. 1969. “A Simple Proof of Scheffé’s Multiple Comparison Theorem for Contrasts in the One-Way Layout.” The American Statistician 23 (5): 44–45. https://doi.org/10.2307/2682195.
Lattimore, Tor, and Csaba Szepesvári. 2020. Bandit Algorithms. Cambridge University Press. https://tor-lattimore.com/downloads/book/book.pdf.
Lock Morgan, Kari, and Donald B. Rubin. 2012. “Rerandomization to Improve Covariate Balance in Experiments.” Annals of Statistics 40 (2): 1263–82. https://doi.org/10.1214/12-AOS1008.
Maxwell, Scott E., Ken Kelley, and Joseph R. Rausch. 2008. “Sample Size Planning for Statistical Power and Accuracy in Parameter Estimation.” Annual Review of Psychology 59: 537–63. https://doi.org/10.1146/annurev.psych.59.103006.093735.
Mead, R., S. G. Gilmour, and A. Mead. 2012. Statistical Principles for the Design of Experiments. Cambridge University Press. https://doi.org/10.1017/CBO9781139020879.
O’Brien, Peter C., and Thomas R. Fleming. 1979. “A Multiple Testing Procedure for Clinical Trials.” Biometrics 35 (3): 549–56. https://doi.org/10.2307/2530245.
Pashley, Nicole E., and Luke W. Miratrix. 2022. “Block What You Can, Except When You Shouldn’t.” Journal of Educational and Behavioral Statistics 47 (1): 69–100. https://doi.org/10.3102/10769986211027240.
Pollock, K. H., H. M. Ross-Parker, and R. Mead. 1979. “A Sequence of Games Useful in Teaching Experimental Design to Agriculture Students.” The American Statistician 33 (2): 70–76. https://doi.org/10.1080/00031305.1979.10482663.
Reinhart, Alex, Ben Markey, Michael Laudenbach, Kachatad Pantusen, Ronald Yurko, Gordon Weinberg, and David West Brown. 2025. “Do LLMs Write Like Humans? Variation in Grammatical and Rhetorical Styles.” Proceedings of the National Academy of Sciences 122 (8): e2422455122. https://doi.org/10.1073/pnas.2422455122.
Rosén, Bengt. 1964. “Limit Theorems for Sampling from Finite Populations.” Arkiv För Matematik 5: 383–424. https://doi.org/10.1007/BF02591138.
Rubin, Donald B. 2008. “Comment: The Design and Analysis of Gold Standard Randomized Experiments.” Journal of the American Statistical Association 103 (484): 1350–53. https://doi.org/10.1198/016214508000001011.
Scheffé, Henry. 1953. “A Method for Judging All Contrasts in the Analysis of Variance.” Biometrika 40: 87–104. https://doi.org/10.2307/2333100.
Semitala, Fred C., Jillian L. Kadota, Allan Musinguzi, Fred Welishe, Anne Nakitende, Lydia Akello, Lynn Kunihira Tinka, et al. 2024. “Comparison of 3 Optimized Delivery Strategies for Completion of Isoniazid-Rifapentine (3HP) for Tuberculosis Prevention Among People Living with HIV in Uganda: A Single-Center Randomized Trial.” PLOS Medicine 21 (2): e1004356. https://doi.org/10.1371/journal.pmed.1004356.
Siegmund, David. 1985. Sequential Analysis: Tests and Confidence Intervals. Springer. https://doi.org/10.1007/978-1-4757-1862-1.
Wassmer, Gernot, and Werner Brannath. 2016. Group Sequential and Confirmatory Adaptive Designs in Clinical Trials. Springer. https://doi.org/10.1007/978-3-319-32562-0.
Wu, C. F. Jeff, and Michael Hamada. 2021. Experiments: Planning, Analysis, and Optimization. John Wiley & Sons. https://doi.org/10.1002/9781119470007.