Reply to: Multivariate BWAS can be replicable with moderate sample sizes

Tervo-Clemmens, Brenden; Marek, Scott; Chauvin, Roselyne J.; Van, Andrew N.; Kay, Benjamin P.; Laumann, Timothy O.; Thompson, Wesley K.; Nichols, Thomas E.; Yeo, B. T. Thomas; Barch, Deanna M.; Luna, Beatriz; Fair, Damien A.; Dosenbach, Nico U. F.

doi:10.1038/s41586-023-05746-w

Download PDF

Matters Arising
Open access
Published: 08 March 2023

Reply to: Multivariate BWAS can be replicable with moderate sample sizes

Nature volume 615, pages E8–E12 (2023)Cite this article

7836 Accesses
5 Citations
141 Altmetric
Metrics details

Subjects

The Original Article was published on 08 March 2023

replying to: T. Spisak et al. Nature https://doi.org/10.1038/s41586-023-05745-x (2023)

In our previous study¹, we documented the effect of sample size on the reproducibility of brain-wide association studies (BWAS) that aim to cross-sectionally relate individual differences in human brain structure (cortical thickness) or function (resting-state functional connectivity (RSFC)) to cognitive or mental health phenotypes. Applying univariate and multivariate methods (for example, support vector regression (SVR)) to three large-scale neuroimaging datasets (total n ≈ 50,000), we found that overall BWAS reproducibility was low for n < 1,000, due to smaller than expected effect sizes. When samples and true effects are small, sampling variability, and/or overfitting can generate ‘statistically significant’ associations that are likely to be reported due to publication bias, but are not reproducible^2,3,4,5, and we therefore suggested that BWAS should build on recent precedents^6,7 and continue to aim for samples in the thousands. In the accompanying Comment, Spisak et al.⁸ agree that larger BWAS are better^5,9, but argue that “multivariate BWAS effects in high-quality datasets can be replicable with substantially smaller sample sizes in some cases” (n = 75–500); this suggestion is made on the basis of analyses of a selected subset of multivariate cognition/RSFC associations with larger effect sizes, using their preferred method (ridge regression with partial correlations) in a demographically more homogeneous, single-site/scanner sample (Human Connectome Project (HCP), n = 1,200, aged 22–35 years).

There is no disagreement that a minority of BWAS effects can replicate in smaller samples, as shown with our original methods¹). Using the exact methodology (including cross-validation) and code of Spisak et al.⁸ to repeat 64 multivariate BWAS in the 21-site, larger and more diverse Adolescent Brain Cognitive Development Study (ABCD, n = 11,874, aged 9–11 years), we found that 31% replicated at n = 1,000, dropping to 14% at n = 500 and none at n = 75. Contrary to the claims of Spisak et al.⁸, replication failure was the outcome in most cases when applied to this larger, more diverse dataset. Basing general BWAS sample size recommendations on the largest effects has at least two fundamental flaws: (1) failing to detect other true effects (for example, reducing the sample size from n = 1,000 to n = 500 leads to a 55% false-negative rate), therefore restricting BWAS scope, and (2) inflation of reported effects^3,10,11,12. Thus, regardless of the method, associations based on small samples can remain distorted and lack generalizability until confirmed in large, diverse, independent samples.

We always test for BWAS replication with null models (using permutation tests) of out-of-sample estimates to ensure that our reported reproducibility is unaffected by in-sample overfitting. Nonetheless, Spisak et al.⁸ argue against plotting inflated in-sample estimates^1,10 on the y axis, and out-of-sample values on the x axis, as we did (Fig. 1a). Instead, they propose plotting cross-validated associations from an initial, discovery sample (Fig. 1b (y axis)) against split-half out-of-sample associations (x axis). However, cross-validation—just like split-half validation—estimates out-of-sample, and not in-sample, effect sizes¹³. The in-sample associations^1,10 for the method of Spisak et al.⁸ (Fig. 1b), that is, from data in the sample used to develop the model, show the same degree of overfitting (Fig. 1a versus Fig. 1b). The plot of Spisak et al.⁸ (Fig. 1c) simply adds an additional out-of-sample test (cross-validation before split half), and therefore demonstrates the close correspondence between two different methods for out-of-sample effect estimation¹⁴. Analogously, we can replace the cross-validation step in the code of Spisak et al.⁸ with split-half validation (our original out-of-sample test), obtaining split-half effects in the first half of the sample, and then comparing them to the split-half estimates from the full sample (Fig. 1d). The strong correspondences between cross-validation followed by split-half (Spisak et al. method⁸; Fig. 1c) and repeated split-half validation (Fig. 1d) are guaranteed by plotting out-of-sample estimates (from the same dataset) against one another. Here, plotting cross-validated discovery sample estimates on the y axis (Fig. 1c,d) provides no additional information beyond the x axis out-of-sample values. The critically important out-of-sample predictions, required for reporting multivariate results¹, generated using the method of Spisak et al.⁸ and our method are nearly identical (Fig. 1e).

**Fig. 1: In-sample versus out-of-sample effect estimates in multivariate BWAS.**

As Spisak et al.⁸ highlight, cross-validation of some type is considered to be standard practice¹⁰, and yet the distribution of out-of-sample associations (Fig. 1f (dark blue)) does not match published multivariate BWAS results (Fig. 1g), which have largely ranged from r = 0.25 to 0.9, decreasing with increasing sample size^10,15,16. Instead, published effects more closely follow the distribution of in-sample associations (Fig. 1h). This observation suggests that, in addition to small samples, structural problems in academic research (for example, non-representative samples, publication bias, misuse of cross-validation and unintended overfitting) have contributed to the publication of inflated effects^12,17,18. A recent biomarker challenge⁵ showed that cross-validation results continued to improve with the amount of time researchers spent with the data, and the models with the best cross-validation results performed worse on never-seen held-back data. Thus, cross-validation alone has proven to be insufficient and must be combined with the increased generalizability of large, diverse datasets and independent out-of-sample evaluation in new, never before seen data^5,10.

The use of additional cross-validation in the discovery sample by Spisak et al.⁸ does not affect out-of-sample prediction accuracies (Fig. 1e). However, by using partial correlations and ridge regression on HCP data, they were able to generate higher out-of-sample prediction accuracies than our original results in ABCD (Fig. 2a). The five variables they selected are strongly correlated¹⁹ cognitive measures from the NIH Toolbox (mean r = 0.37; compare with the correlation strength for height versus weight r = 0.44)²⁰ and age (not a complex behavioural phenotype), unrepresentative of BWAS as a whole (Fig. 2b (colour versus grey lines)). As the HCP is the relatively smallest and most homogeneous dataset, we applied the exact method and code of Spisak et al.⁸ to the ABCD data (Fig. 2c and Supplementary Table 2). At n = 1,000 (training; n = 2,000 total), only 31% of BWAS (44% RSFC, 19% cortical thickness) were replicable (Fig. 2d; defined as in Spisak et al.⁸; Supplementary Information). Expanding BWAS scope beyond broad cognitive abilities towards complex mental health outcomes therefore requires n > 1,000 (Fig. 2b–d). The absolute largest BWAS (cognitive ability: RSFC, green) reached replicability only using n = 400 (n = 200 train; n = 200 test) with an approximate 40% decrease in out-of-sample prediction accuracies from HCP to ABCD (Fig. 2e (lighter green, left versus right)). The methods of Spisak et al.⁸ and our previous study¹ returned equivalent out-of-sample reproducibility for this BWAS (cognitive ability: RSFC) in the larger, more diverse ABCD data (Fig. 2e (right, dark versus light green)). Thus, the smaller sample sizes (Fig. 2b,c) that are required for out-of-sample reproducibility (Fig. 2e) reported by Spisak et al.⁸ in the HCP data did not generalize to the larger ABCD dataset. See also our previous study¹ for a broader discussion of convergent evidence across HCP and ABCD datasets.

**Fig. 2: BWAS reproducibility, scope and prediction accuracy using the method of Spisak et al.**

Notably, the objections of Spisak et al.⁸ raise additional reasons to stop the use of smaller samples in BWAS that were not highlighted in our original article. Multivariate BWAS prediction accuracies—absent overfitting—are systematically suppressed in smaller samples^5,9,21, as prediction accuracy scales with increasing sample size^1,9. Thus, the claim that “cross-validated discovery effect-size estimates are unbiased” does not account for out-of-dataset generalizability and downward bias. In principle, if unintended overfitting and publication bias could be fully eliminated, meta-analyses of small-sample univariate BWAS would return the correct association strengths (Fig. 2f (left)). However, meta-analyses of small multivariate BWAS would always be downwardly biased (Fig. 2f (right)). If we are interested in maximizing prediction accuracy, essential for clinical implementation of BWAS²², large samples and advancements in imaging and phenotypic measurements¹ are necessary.

Repeatedly subsampling the same dataset, as Spisak et al.⁸ and we have done, overestimates reproducibility compared with testing on a truly new, diverse dataset. Just as in genomics²³, BWAS generalization failures have been highlighted^5,24. For example, BWAS models trained on white Americans transferred poorly to African Americans and vice versa (within dataset)²⁴. Historically, BWAS samples have lacked diversity, neglecting marginalized and under-represented minorities²⁵. Large studies with more diverse samples and data aggregation efforts can improve BWAS generalizability and reduce scientific biases contributing to massive health inequities^26,27.

Spisak et al.⁸ worry that “[r]equiring sample sizes that are larger than necessary for the discovery of new effects could stifle innovation”. We appreciate the concern that rarer populations may never be investigated with BWAS. Yet, there are many non-BWAS brain–behaviour study designs (fMRI ≠ BWAS) focused on within-patient effects, repeated-sampling and signal-to-noise-ratio improvements that have proven fruitful down to n = 1 (ref. ²⁸). By contrast, the strength of multivariate BWAS lies in leveraging large cross-sectional samples to investigate population-level questions. Sample size requirements should be based on expected effect sizes and real-world impact, and not resource availability. Through large-scale collaboration and clear standards on data sharing, GWAS has reached sample sizes in the millions^29,30,31, pushing genomics towards new horizons. Similarly, BWAS analyses of the future will not be limited to statistical replication of the same few strongest effects in small homogeneous populations, but also have broad scope, maximum prediction accuracy and excellent generalizability.

Reporting summary

Further information on experimental design is available in the Nature Portfolio Reporting Summary linked to this Article.

Data availability

Participant-level data from all datasets (ABCD and HCP) are openly available pursuant to individual, consortium-level data access rules. The ABCD data repository grows and changes over time (https://nda.nih.gov/abcd). The ABCD data used in this report came from ABCD collection 3165 and the Annual Release 2.0 (https://doi.org/10.15154/1503209). Data were provided, in part, by the HCP, WU-Minn Consortium (principal investigators: D. Van Essen and K. Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. Some data used in the present study are available for download from the HCP (www.humanconnectome.org). Users must agree to data use terms for the HCP before being allowed access to the data and ConnectomeDB, details are provided online (https://www.humanconnectome.org/study/hcp-young-adult/data-use-terms).

Code availability

Manuscript analysis code specific to this study is available at GitHub (https://gitlab.com/DosenbachGreene/bwas_response). Code for processing ABCD data is provided at GitHub (https://github.com/DCAN-Labs/abcd-hcp-pipeline). MRI data analysis code is provided at GitHub (https://github.com/ABCD-STUDY/nda-abcd-collection-3165). FIRMM software is available online (https://firmm.readthedocs.io/en/latest/release_notes/). The ABCD Study used v.3.0.14.

References

Marek, S. et al. Reproducible brain-wide association studies require thousands of individuals. Nature 603, 654–660 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Schönbrodt, F. D. & Perugini, M. At what sample size do correlations stabilize? J. Res. Pers. 47, 609–612 (2013).
Button, K. S. et al. Confidence and precision increase with high statistical power. Nat. Rev. Neurosci. 14, 585–586 (2013).
Article CAS PubMed Google Scholar
Varoquaux, G. Cross-validation failure: small sample sizes lead to large error bars. Neuroimage 180, 68–77 (2018).
Article PubMed Google Scholar
Traut, N. et al. Insights from an autism imaging biomarker challenge: promises and threats to biomarker discovery. Neuroimage 255, 119171 (2022).
Article PubMed Google Scholar
Casey, B. J. et al. The Adolescent Brain Cognitive Development (ABCD) study: imaging acquisition across 21 sites. Dev. Cogn. Neurosci. 32, 43–54 (2018).
Article CAS PubMed PubMed Central Google Scholar
Littlejohns, T. J. et al. The UK Biobank imaging enhancement of 100,000 participants: rationale, data collection, management and future directions. Nat. Commun. 11, 2624 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Spisak, T., Bingel, U. & Wager, T. D. Multivariate BWAS can be replicable with moderate sample sizes. Nature https://doi.org/10.1038/s41586-023-05745-x (2023).
Schulz, M.-A., Bzdok, D., Haufe, S., Haynes, J.-D. & Ritter, K. Performance reserves in brain-imaging-based phenotype prediction. Preprint at https://doi.org/10.1101/2022.02.23.481601 (2022).
Poldrack, R. A., Huckins, G. & Varoquaux, G. Establishment of best practices for evidence for prediction: a review. JAMA Psychiatry 77, 534–540 (2020).
Article PubMed PubMed Central Google Scholar
Poldrack, R. A. The costs of reproducibility. Neuron 101, 11–14 (2019).
Article CAS PubMed Google Scholar
Button, K. S. et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat. Rev. Neurosci. 14, 365–376 (2013).
Article CAS PubMed Google Scholar
Kohavi, R. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proc. IJCAI 95 (ed. Mellish, C. S.) 1137–1143 (Morgan Kaufman, 1995).
Scheinost, D. et al. Ten simple rules for predictive modeling of individual differences in neuroimaging. Neuroimage 193, 35–45 (2019).
Article PubMed Google Scholar
Sui, J., Jiang, R., Bustillo, J. & Calhoun, V. Neuroimaging-based individualized prediction of cognition and behavior for mental disorders and health: methods and promises. Biol. Psychiatry 88, 818–828 (2020).
Article PubMed PubMed Central Google Scholar
Woo, C.-W., Chang, L. J., Lindquist, M. A. & Wager, T. D. Building better biomarkers: brain models in translational neuroimaging. Nat. Neurosci. 20, 365–377 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ioannidis, J. P. A. Why most discovered true associations are inflated. Epidemiology 19, 640–648 (2008).
Article PubMed Google Scholar
Pulini, A. A., Kerr, W. T., Loo, S. K. & Lenartowicz, A. Classification accuracy of neuroimaging biomarkers in attention-deficit/hyperactivity disorder: effects of sample size and circular analysis. Biol. Psychiatry Cogn. Neurosci. Neuroimaging 4, 108–120 (2019).
PubMed Google Scholar
Funder, D. C. & Ozer, D. J. Evaluating effect size in psychological research: sense and nonsense. Adv. Methods Pract. Psychol. Sci. 2, 156–168 (2019).
Meyer, G. J. et al. Psychological testing and psychological assessment: a review of evidence and issues. Am. Psychol. 56, 128–165 (2001).
He, T. et al. Deep neural networks and kernel regression achieve comparable accuracies for functional connectivity prediction of behavior and demographics. Neuroimage 206, 116276 (2020).
Article PubMed Google Scholar
Leptak, C. et al. What evidence do we need for biomarker qualification? Sci. Transl. Med. 9, eaal4599 (2017).
Article PubMed Google Scholar
Weissbrod, O. et al. Leveraging fine-mapping and multipopulation training data to improve cross-population polygenic risk scores. Nat. Genet. 54, 450–458 (2022).
Article CAS PubMed PubMed Central Google Scholar
Li, J. et al. Cross-ethnicity/race generalization failure of behavioral prediction from resting-state functional connectivity. Sci. Adv. 8, eabj1812 (2022).
Article PubMed PubMed Central Google Scholar
Henrich, J., Heine, S. J. & Norenzayan, A. The weirdest people in the world? Behav. Brain Sci. 33, 61–83 (2010).
Article PubMed Google Scholar
Bailey, Z. D. et al. Structural racism and health inequities in the USA: evidence and interventions. Lancet 389, 1453–1463 (2017).
Article PubMed Google Scholar
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584–591 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gratton, C., Nelson, S. M. & Gordon, E. M. Brain-behavior correlations: two paths toward reliability. Neuron 110, 1446–1449 (2022).
Article CAS PubMed Google Scholar
Levey, D. F et al. Bi-ancestral depression GWAS in the Million Veteran Program and meta-analysis in >1.2 million individuals highlight new therapeutic directions. Nature 24, 954–963 (2021).
Muggleton, N. et al. The association between gambling and financial, social and health outcomes in big financial data. Nat. Hum. Behav. 5, 319–326 (2021).
Article PubMed Google Scholar
Yengo, L. et al. A saturated map of common genetic variants associated with human height. Nature 610, 704–712 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Data used in the preparation of this Article were, in part, obtained from the Adolescent Brain Cognitive Development (ABCD) Study (https://abcdstudy.org), held in the NIMH Data Archive (NDA). This is a multisite, longitudinal study designed to recruit more than 10,000 children aged 9–10 years and follow them over 10 years into early adulthood. The ABCD Study is supported by the National Institutes of Health and additional federal partners under award numbers U01DA041022, U01DA041028, U01DA041048, U01DA041089, U01DA041106, U01DA041117, U01DA041120, U01DA041134, U01DA041148, U01DA041156, U01DA041174, U24DA041123, U24DA041147, U01DA041093 and U01DA041025. A full list of supporters is available online (https://abcdstudy.org/federal-partners.html). A listing of participating sites and a complete listing of the study investigators is available online (https://abcdstudy.org/scientists/workgroups/). ABCD consortium investigators designed and implemented the study and/or provided data but did not necessarily participate in the analysis or the writing of this report. This Article reflects the views of the authors and may not reflect the opinions or views of the NIH or ABCD consortium investigators. Data were provided, in part, by the HCP, WU-Minn Consortium (U54 MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. This work used the storage and computational resources provided by the Masonic Institute for the Developing Brain (MIDB), the Neuroimaging Genomics Data Resource (NGDR) and the Minnesota Supercomputing Institute (MSI). The NGDR is supported by the University of Minnesota Informatics Institute through the MnDRIVE initiative in coordination with the College of Liberal Arts, Medical School and College of Education and Human Development at the University of Minnesota. This work used the storage and computational resources provided by the Daenerys Neuroimaging Community Computing Resource (NCCR). The Daenerys NCCR is supported by the McDonnell Center for Systems Neuroscience at Washington University, the Intellectual and Developmental Disabilities Research Center (IDDRC; P50 HD103525) at Washington University School of Medicine and the Institute of Clinical and Translational Sciences (ICTS; UL1 TR002345) at Washington University School of Medicine. This work was supported by NIH grants MH121518 (to S.M.), NS090978 (to B.P.K.), MH129616 (to T.O.L.), 1RF1MH120025-01A1 (to W.K.T), MH080243 (to B.L.), MH067924 (to B.L.), DA041148 (to D.A.F.), DA04112 (to D.A.F.), MH115357 (to D.A.F.), MH096773 (to D.A.F. and N.U.F.D.), MH122066 (to D.A.F. and N.U.F.D.), MH121276 (to D.A.F. and N.U.F.D.), MH124567 (to D.A.F. and N.U.F.D.), NS088590 (to N.U.F.D.), and the Andrew Mellon Predoctoral Fellowship (to B.T.-C.), the Staunton Farm Foundation (to B.L.), the Lynne and Andrew Redleaf Foundation (to D.A.F.) and the Kiwanis Neuroscience Research Foundation (to N.U.F.D.).

Author information

These authors contributed equally: Brenden Tervo-Clemmens, Scott Marek
These authors jointly supervised this work: Damien A. Fair, Nico U. F. Dosenbach

Authors and Affiliations

Department of Psychiatry, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Brenden Tervo-Clemmens
Department of Radiology, Washington University School of Medicine, St Louis, MO, USA
Scott Marek & Nico U. F. Dosenbach
Department of Psychiatry, Washington University School of Medicine, St Louis, MO, USA
Scott Marek, Timothy O. Laumann & Deanna M. Barch
Department of Neurology, Washington University School of Medicine, St Louis, MO, USA
Roselyne J. Chauvin, Andrew N. Van, Benjamin P. Kay & Nico U. F. Dosenbach
Department of Biomedical Engineering, Washington University in St Louis, St Louis, MO, USA
Andrew N. Van & Nico U. F. Dosenbach
Division of Biostatistics, University of California San Diego, La Jolla, CA, USA
Wesley K. Thompson
Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Thomas E. Nichols
Department of Electrical and Computer Engineering, National University of Singapore, Singapore, Singapore
B. T. Thomas Yeo
Centre for Sleep and Cognition, National University of Singapore, Singapore, Singapore
B. T. Thomas Yeo
Centre for Translational MR Research, National University of Singapore, Singapore, Singapore
B. T. Thomas Yeo
N.1 Institute for Health, Institute for Digital Medicine, National University of Singapore, Singapore, Singapore
B. T. Thomas Yeo
Integrative Sciences and Engineering Programme, National University of Singapore, Singapore, Singapore
B. T. Thomas Yeo
Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, MA, USA
B. T. Thomas Yeo
Department of Psychological and Brain Sciences, Washington University in St Louis, St Louis, MO, USA
Deanna M. Barch & Nico U. F. Dosenbach
Department of Psychology, University of Pittsburgh, Pittsburgh, PA, USA
Beatriz Luna
Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA
Beatriz Luna
Masonic Institute for the Developing Brain, University of Minnesota Medical School, Minneapolis, MN, USA
Damien A. Fair
Department of Pediatrics, University of Minnesota Medical School, Minneapolis, MN, USA
Damien A. Fair
Institute of Child Development, University of Minnesota Medical School, Minneapolis, MN, USA
Damien A. Fair
Program in Occupational Therapy, Washington University School of Medicine, St Louis, MO, USA
Nico U. F. Dosenbach
Department of Pediatrics, Washington University School of Medicine, St Louis, MO, USA
Nico U. F. Dosenbach

Authors

Brenden Tervo-Clemmens
View author publications
You can also search for this author in PubMed Google Scholar
Scott Marek
View author publications
You can also search for this author in PubMed Google Scholar
Roselyne J. Chauvin
View author publications
You can also search for this author in PubMed Google Scholar
Andrew N. Van
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin P. Kay
View author publications
You can also search for this author in PubMed Google Scholar
Timothy O. Laumann
View author publications
You can also search for this author in PubMed Google Scholar
Wesley K. Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Thomas E. Nichols
View author publications
You can also search for this author in PubMed Google Scholar
B. T. Thomas Yeo
View author publications
You can also search for this author in PubMed Google Scholar
Deanna M. Barch
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Luna
View author publications
You can also search for this author in PubMed Google Scholar
Damien A. Fair
View author publications
You can also search for this author in PubMed Google Scholar
Nico U. F. Dosenbach
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception: B.T.-C., S.M., D.A.F. and N.U.F.D. Design: B.T.-C., S.M., R.J.C., D.A.F. and N.U.F.D. Data acquisition, analysis and interpretation: B.T.-C., S.M., R.J.C., A.V.N., B.P.K., W.K.T., T.E.N., B.T.T.Y., D.A.F. and N.U.F.D. Manuscript writing and revising: B.T.-C., S.M., R.J.C., A.V.N., B.P.K., T.O.L., W.K.T., T.E.N., B.T.T.Y., D.M.B., B.L., D.A.F. and N.U.F.D. We note that the reply author list differs from the original paper in number and in order to accurately reflect its more focused scope compared with the original work.

Corresponding authors

Correspondence to Brenden Tervo-Clemmens, Scott Marek, Damien A. Fair or Nico U. F. Dosenbach.

Ethics declarations

Competing interests

D.A.F. and N.U.F.D. have a financial interest in Turing Medical and may financially benefit if the company is successful in marketing FIRMM motion monitoring software products. A.N.V., D.A.F. and N.U.F.D. may receive royalty income based on FIRMM technology developed at Washington University School of Medicine and Oregon Health and Sciences University and licensed to Turing Medical. D.A.F. and N.U.F.D. are co-founders of Turing Medical. These potential conflicts of interest have been reviewed and are managed by Washington University School of Medicine, Oregon Health and Sciences University and the University of Minnesota.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Methods, Supplementary Tables 1 and 2 and Supplementary References.

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tervo-Clemmens, B., Marek, S., Chauvin, R.J. et al. Reply to: Multivariate BWAS can be replicable with moderate sample sizes. Nature 615, E8–E12 (2023). https://doi.org/10.1038/s41586-023-05746-w

Download citation

Published: 08 March 2023
Issue Date: 09 March 2023
DOI: https://doi.org/10.1038/s41586-023-05746-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.