Original Article
Journal of Human Genetics (2006) 51, 523–532; doi:10.1007/s10038-006-0393-6
A two-stage design for multiple testing in large-scale association studies
Shu-Hui Wen1, Jung-Ying Tzeng2, Jau-Tsuen Kao3 and Chuhsing Kate Hsiao4
- 1Department of Public Health, College of Medicine, Tzu-Chi University, Hua-Lien, 97004, Taiwan
- 2Department of Statistics and Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27606, USA
- 3Department of Clinical Laboratory Sciences and Medical Biotechnology, College of Medicine, National Taiwan University, Taipei, 100, Taiwan
- 4Division of Biostatistics, Institute of Epidemiology, National Taiwan University, Taipei, 100, Taiwan
Correspondence: Chuhsing Kate Hsiao, Division of Biostatistics, Institute of Epidemiology, National Taiwan University, Taipei, 100, Taiwan. Fax: +886-2-23418562. E-mail: ckhsiao@ha.mc.ntu.edu.tw
Received 9 November 2005; Accepted 14 February 2006; Published online 12 May 2006.
Abstract
Modern association studies often involve a large number of markers and hence may encounter the problem of testing multiple hypotheses. Traditional procedures are usually over-conservative and with low power to detect mild genetic effects. From the design perspective, we propose a two-stage selection procedure to address this concern. Our main principle is to reduce the total number of tests by removing clearly unassociated markers in the first-stage test. Next, conditional on the findings of the first stage, which uses a less stringent nominal level, a more conservative test is conducted in the second stage using the augmented data and the data from the first stage. Previous studies have suggested using independent samples to avoid inflated errors. However, we found that, after accounting for the dependence between these two samples, the true discovery rate increases substantially. In addition, the cost of genotyping can be greatly reduced via this approach. Results from a study of hypertriglyceridemia and simulations suggest the two-stage method has a higher overall true positive rate (TPR) with a controlled overall false positive rate (FPR) when compared with single-stage approaches. We also report the analytical form of its overall FPR, which may be useful in guiding study design to achieve a high TPR while retaining the desired FPR.
Keywords:
Association studies, Cost-effectiveness, False positive rate, Multiple testing, Two-stage design
MORE ARTICLES LIKE THIS
These links to content published by NPG are automatically generated.
RESEARCH
Identification of mutations in the arginine vasopressin receptor 2 gene causing nephrogenic diabetes insipidus in Chinese patientsJournal of Human Genetics Original Article
A grid-search algorithm for optimal allocation of sample size in two-stage association studiesJournal of Human Genetics Original Article
Response of Nasopharyngeal Carcinoma Cells to Epstein-Barr Virus Infection In VitroLaboratory Investigation Article Response

