Introduction

Reliable prenatal paternity testing requires direct chorionic villus sampling or amniotic fluid sampling via amniocentesis, both of which increase risk of harm to the mother and fetus. Amniocentesis carries a miscarriage risk of 1 in 300 to 1 in 500 (ref. 1) and a small risk of other complications.2,3 Chorionic villus sampling carries a similar miscarriage risk and a 1/3,000 risk of fetal limb reduction defects, especially when performed before 10 weeks of gestation.4,5 Although the American College of Obstetricians and Gynecologists recommends that these procedures be offered to all pregnant women,1 they are most often used when the mother is at a high risk for genetic defects, such as after a positive first-trimester screen.

The discovery of fetal cell-free DNA (cfDNA) in maternal blood suggested that a noninvasive paternity test could be developed, which would avoid the risks associated with invasive procedures.6 Although fetal cfDNA in maternal circulation was initially reported more than a decade ago,6,7,8 the development of a reliable, noninvasive paternity test using maternal peripheral blood has proved elusive because fetal cfDNA typically comprises <20% of the total cfDNA in maternal plasma and is highly fragmented.9 The limited amount of fetal cfDNA and its heavy dilution by maternal cfDNA present significant technical challenges. These can be overcome by coupling microarrays or high-throughput sequencing, which measure hundreds of thousands to millions of DNA segments, with sophisticated bioinformatics techniques to extract maximal information from the limited fetal genotype data obtained from maternal plasma cfDNA measurements.

We previously reported such a method, called Parental Support, which uses single-nucleotide polymorphism (SNP) microarray measurements from maternal and paternal DNA to improve the fidelity of noisy embryonic DNA genotype measurements.10 Although this method accurately identifies ploidy at all 24 chromosomes in single embryonic cells, it has not been adapted to paternity testing.

Here, we use a version of the Parental Support algorithm to determine whether the fetal components of the cfDNA SNP measurements made on cfDNA isolated from maternal plasma could be due to the alleged father. We report the results of testing this new method on 21 pregnant women, with each sample independently tested against the confirmed father as well as 1,820 unrelated individuals. We identify paternity with 100% accuracy as early as 6 weeks into pregnancy.

Materials and Methods

Inclusion criteria for enrolled women/couples were singleton pregnancies in the first or second trimester with known paternity. Couples were enrolled through in vitro fertilization clinics, obstetric offices, or online advertisements. All women who met the inclusion criteria were enrolled during the period from April to June 2011. Gestational ages ranged from 6 to 21 weeks, and paternity was confirmed in various ways ( Table 1 ). Nine of 21 samples had known parentage through control of fertilization during in vitro fertilization, with paternity reconfirmed through genotype-specific preimplantation genetic diagnosis. The remaining 12 samples had paternity confirmed by independent paternity testing on fetal or postnatal genetic material (DNA Diagnostic Center, Fairfield, OH). Written informed consent was obtained from all participants, and genetic samples were collected under an institutional review board–approved research protocol.

Table 1 Paternity inclusions, including the sample type used in confirmatory testing

Maternal blood samples (~20 ml) were collected into cell-free blood tubes (Streck, Omaha, NE), and paternal blood samples (~5 ml) were collected into EDTA blood collection tubes. On average, 10 ml of plasma was isolated from each maternal sample via double centrifugation (1,600g for 10 min, tube transfer, centrifugation at 16,000g for 10 min). Maternal plasma cfDNA was isolated using the Qiagen (Valencia, CA) circulating nucleic acid kit and eluted in 45 µl buffer according to the manufacturer’s instructions. DNA termini were blunted in a reaction using 20 µl eluate in 1x NEB4 buffer, 0.42 mmol/l dNTP, and 2.5U T4 DNA polymerase (New England Biolabs, Ipswich, MA), incubated at 20 °C for 30 min, and the polymerase was denatured by incubating at 75 °C for 15 min. Three microliters of ligation mixture (0.5 µl 10x NEB 4, 1 µl 10 mmol/l adenosine triphosphate, 1 µl T4 polynucleotide kinase (New England Biolabs), 0.5 µl T4 DNA ligase (New England Biolabs)) was added, and samples were incubated at 16 °C for 24 h and then at 75 °C for 15 min. Test samples were transferred to a standard Illumina Infinium assay (Illumina, San Diego, CA), as were the maternal and alleged paternal genomic DNA samples. Briefly, 24 µl of DNA was whole-genome amplified at 37 °C for 20–24 h followed by fragmentation and precipitation. All subsequent procedures were performed in a Tecan EVO (Männedorf, Switzerland) unless otherwise specified. Precipitate was resuspended in microarray hybridization buffer, heat denatured, and transferred to Cyto12-SNP microarrays. The microarrays were incubated at 48 °C for at least 16 h, X-Stained (Infinium II Chemistry), washed, and scanned.

Microarray intensities were extracted using BeadStudio (Illumina) software. Fetal fraction in maternal plasma was determined by considering autosomal SNPs for which the mother is homozygous. The group of maternal homozygous SNPs was divided into subsets in which the fetus is either very likely or very unlikely to have the same genotype as the mother, on the basis of population frequencies. The fetal fraction is proportional to the difference of observed off-allele intensity between the two subsets.

The genotyping results were analyzed using Parental Support.8 For each mother and alleged father combination, the method generated a test statistic that indicated how well the genotype of that possible father accounted for the fetal component of the genotypic measurements made on maternal plasma cfDNA. The test statistic was calculated separately for the mother with each of 5,000 unrelated males, creating an unrelated-male test statistic distribution for that case ( Figure 1 and Supplementary Figures S1–S19 online). The test statistic was then calculated for the mother and alleged father, and a single-hypothesis rejection test determined whether the alleged father’s test statistic fell under the distribution of test statistics created using the unrelated males. If the alleged father’s test statistic was excluded from the unrelated-male test statistic distribution, then the alleged father was determined to be the biological father. If the alleged father’s test statistic fell within the unrelated-male test statistic distribution, then it was determined that he was not the biological father.

Figure 1
figure 1

The Parental Support method for paternity determination. Paternity test results for two separate women: (a) one with paternity inclusion and (b) one with paternity exclusion. The test statistic distribution is indicated in gray, the region indicating paternity inclusion (alleged fathers with P values <10−4) is indicated in green, the region indicating paternity exclusion (alleged fathers with P values >0.02) is indicated in beige, and the region indicating indeterminate paternity (alleged fathers with P values between 10−4 and 0.02) is indicated in black.

Specifically, a paternity inclusion (confirming paternity) was called when the P value of the alleged father’s test statistic according to the distribution of unrelated males was <10−4. An indeterminate result (no call) was called when the calculated P value fell between 10−4 and 0.02 ( Figure 1 ). A paternity exclusion was called when the calculated P value was >0.02. No determination was made when the fetal DNA fraction in maternal cfDNA was <2%. The set of unrelated individuals used to generate the expected distribution was comprised of individuals from a wide variety of racial backgrounds, and paternity determination was recalculated using different racial sets of unrelated individuals, including the race indicated for the alleged father. Inclusion and exclusion results were automatically generated by the informatics method; no human judgment was necessary.

Results

Twenty-one maternal blood samples with known paternity were tested using Parental Support ( Table 1 ). One sample could not be evaluated due to the low fraction of fetal cfDNA in the maternal plasma (<2%, Table 1 ). For each of the remaining 20 samples with sufficient fetal cfDNA, 1,821 independent paternity tests were run using the confirmed father and a random set of 1,820 unrelated male individuals, for a total of 36,420 paternity tests. We confirmed paternity in 100% (20/20) of the cases, with P values of <10−4 ( Table 1 ). The test accurately identified paternity as early as 6 weeks into pregnancy and with a low fraction of fetal cfDNA.

Of note, 99.95% (36,382/36,400) of the tests correctly excluded paternity using Parental Support. Only 18 (0.05%) were called indeterminate. None of the tested samples had an incorrect paternity determination.

Discussion

We report the first highly accurate, noninvasive, clinically available prenatal paternity test. Traditional postnatal paternity testing involves analysis of single tandem repeat sequences and comparison with alleged fathers. Because large amounts of intact genomic child DNA are available postnatally, measurements are robust, and only 15–20 single tandem repeat sequences are necessary to achieve highly accurate paternity results.11

Options are limited for determining paternity prenatally without amniocentesis or chorionic villus sampling, or very early in the preganancy. Although fetal cfDNA from maternal plasma is reliably measured,12 and various groups have reported the amplification and detection of fetal-specific alleles using polymerase chain reaction,13,14,15 these methods do not include sufficient fetal-specific alleles to measure paternity accurately and are not amenable to scaling of interrogated loci.

The use of cfDNA to correctly identify paternity was recently reported in 30 cases.16 However, that study only compared the known father to one unrelated male. If the actual father is not one of the two samples, this method may identify paternity in an individual who is simply more closely related than the other due to close genetic relationships in, for example, small ethnic subpopulations. This underscores the importance of testing against a large population of unrelated males; in this study, we tested against 5,000 random individuals.

Of greater concern is the fact that they do not report a per-test accuracy, instead reporting the chance of correctly guessing paternity in 30 cases, which is one in a billion—a number calculated by taking the chance of correctly guessing paternity per test (1/2) and raising it to the 30th power (30 tests). This combined statistic may be misleading and is not clinically useful, given that each test need not be highly accurate to provide this impressive number. For example, assuming 99% per-test accuracy (well below the 99.99% industry standard), the overall chance of guessing 30 tests correctly is still 74%.

By comparison, assuming a 99% per-test accuracy using Parental Support, the chance of correctly guessing all 36,402 calls is ~1 in 10158. Assuming 99.99% accuracy, the typical after-birth paternity testing accuracy level, the chance of correctly guessing all 36,402 calls is still only 2.6%. This suggests that the Parental Support method is >99.99% accurate.

It is important to note that a closely related male may return a false paternity inclusion. Preliminary results show that a paternal grandfather or uncle of the child in question returns test statistics that, although smaller than the actual father’s test statistic, have P values of <10−4 (data not shown); for this reason, samples from closely related alleged fathers must be measured for comparison to ensure correct paternity identification.

The method used herein obviates problems with traditional- and cfDNA-based methods by measuring autosomal SNPs using a high-throughput SNP array. A large amount of maternal cfDNA is present in the plasma measurements, which generally results in low signal-to-noise ratio when determining fetal genotype from cfDNA. Identifying the maternal genotype and fetal fraction allows us to predict the amount of maternal signal and adjust accordingly. Whereas each individual SNP measurement is noisy, aggregating data over 300,000 SNPs allows for highly accurate paternity determinations. This test provided accurate paternity confirmation as early as 6 weeks into pregnancy, with as low as 2.3% fetal cfDNA fraction, suggesting that paternity testing can be performed very early in pregnancy when fetal fractions are still very low. This is important when paternity is considered as a reason for termination of pregnancy.

This method is adaptable to other platforms, such as next-generation sequencing, which has recently been used to identify fetal cfDNA in maternal plasma.17 That study used massively parallel sequencing of maternal plasma cfDNA from a single subject, identifying the entire fetal and maternal genomes. Parental genotypes were included to assemble the fetal genome and determine maternal versus paternal SNP inheritance, and not in a statistical paternity test; the method lacks informatics-based analysis that contributes to the accuracy with which we identify and exclude paternity, and is more labor intensive, thereby not lending itself well to clinical applications. In addition, the maternal haplotype was inferred from chorionic villi sampling, thus negating the advantage of a noninvasive test. We are currently attempting to validate the efficacy of Parental Support using next-generation sequencing.

In summary, we report a highly accurate, noninvasive, clinically available prenatal paternity test. A larger study is warranted to accurately determine how early in pregnancy this method identifies paternity and to confirm efficacy in a larger cohort.

Disclosure

All authors are employees of Natera and/or hold stock or have options to hold stock in the company.