A nonlinear mixed-effect mixture model for functional mapping of dynamic traits

Hou, W; Li, H; Zhang, B; Huang, M; Wu, R

doi:10.1038/hdy.2008.53

Download PDF

Original Article
Published: 09 July 2008

A nonlinear mixed-effect mixture model for functional mapping of dynamic traits

W Hou¹,
H Li¹,
B Zhang²,
M Huang³ &
…
R Wu^1,2,3

Heredity volume 101, pages 321–328 (2008)Cite this article

1274 Accesses
6 Citations
Metrics details

Abstract

Functional mapping has emerged as a next-generation statistical tool for mapping quantitative trait loci (QTL) that affect complex dynamic traits. In this article, we incorporated the idea of nonlinear mixed-effect (NLME) models into the mixture-based framework of functional mapping, aimed to generalize the spectrum of applications for functional mapping. NLME-based functional mapping, implemented with the linearization algorithm based on the first-order Taylor expansion, can provide reasonable estimates of QTL genotypic-specific curve parameters (fixed effect) and the between-individual variation of these parameters (random effect). Results from simulation studies suggest that the NLME-based model is more general than traditional functional mapping. The new model can be useful for the identification of the ontogenetic patterns of QTL genetic effects during time course.

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Genome-wide association studies

Article 26 August 2021

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Introduction

Dynamic traits change their phenotypes with time or other independent variables. A profound understanding of the genetic control of a dynamic trait should include the timing of the underlying genes to turn on and off in a time course, the duration of genetic main and interaction effects, the pleiotropic effects of the genes on various developmental events, and sensitivity of the genes in response to environmental signals. Functional mapping, emerging as a next-generation statistical method for genetic mapping, has proven to be powerful for addressing the above-mentioned issues by mapping ontogenetic quantitative trait loci (QTL) for complex dynamic traits (Ma et al., 2002; Wu et al., 2003a, 2003b, 2004a, 2004b, 2004c; reviewed in Wu and Lin, 2006; Yang et al., 2006; Yang and Xu, 2007). The fundamental idea of functional mapping is to jointly model the mean covariance structure within a mixture model framework for dynamic traits longitudinally measured at different time points by using parametric or nonparametric approaches. If there exist biologically meaningful mathematical equations for longitudinal curves, such as growth equation (West et al., 2001), biexponential curve for HIV dynamics (Ho et al., 1995), Fourier series approximation for cell cycle (Spellman et al., 1998) and power equation for allometric scaling (West et al., 1997), parametric approaches can be implemented to estimate the mathematical parameters that define the shapes of curves for a QTL genotype expressed as a mixture component, instead of directly estimating the QTL genotypic means at all different time points.

As a type of time series data, longitudinal traits exhibit a strong autocorrelation between successive time points. Structuring such a time-dependent covariance matrix by a stationary or nonstationary approach can increase the model's stability, robustness and statistical power to detect QTL. The approaches for modeling the covariance structure in functional mapping have been based on autoregressive (AR; Ma et al., 2002) or antedependence models (Zhao et al., 2005). In all such modeling work, repeated measurements are assumed to be independent among different subjects and, thus, only within-subject covariance structures have been considered. In a general setting of longitudinal data analysis, three components of random variability in the modeling process should be distinguished, that is, the random effects that stem from heterogeneity between individual profiles, serial correlation between observations within sampling unit and measurement error (Davidian and Giltinan, 1995, 2003; Diggle et al., 2002). Thus, although the approximation of a covariance structure merely based on serial correlations in current functional mapping is thought to be parsimonious, it may have serious limitations that would prevent a wide implication of functional mapping. These limitations are shown in the following aspects.

First, the mathematical parameters for individual longitudinal curves with the same QTL genotype may not be independent among subjects. The ignorance of among-subject dependence for the curve parameters would overestimate the genetic effect of QTL on longitudinal trajectories. To draw a valid statistical inference for longitudinal data, random effects that capture heterogeneity among subjects should be considered, in a conjunction with direct modeling of the within-subject correlation (Chi and Reinsel, 1989; Schabenberger, 1995). Second, the curve parameters may be affected by an array of biological or demographic covariates, such as age, sex, race and body weight. For those biological covariates, it is possible that they are under the control of genetic systems that are the same as, or different from, those for the longitudinal traits under consideration. Testing the difference of genetic control for different traits or processes presents an interesting and challenging genetic issue (Lynch and Walsh, 1998).

In this article, nonlinear mixed-effect (NLME) models, or hierarchical nonlinear models, will be incorporated into the context of functional mapping based on a mixture model, aimed to circumvent the above-mentioned limitations of current functional mapping strategies. Since their first emergence in the early 1980s (Beal and Sheiner, 1982; reviewed in Davidian and Giltinan, 1995), NLME models have quickly become a popular statistical method for studying longitudinal data (Lindstrom and Bates, 1990). More recently, a number of extensions and modifications to better suit new challenges have been developed (Davidian and Giltinan, 1995; Vonesh et al., 2002; Wu, 2002, 2004a, 2004b). The major advantage of NLME models lies in their capacity and flexibility to model various structures of covariance matrices. Also, they display a unique ability to accommodate a general intraindividual covariance structure for unbalanced data where measurements are sparse for some subjects and different subjects receive different measurement patterns.

The application of NLME models is a promising approach for improving parameter estimation and valid inferences of longitudinal data including pharmacokinetics and HIV dynamics. Mixed-effect models may also be appealing to genetic studies by increasing the flexibility of QTL mapping for response curves (Rodriguez-Zas et al., 2002; Malosetti et al., 2006). However, because the statistical properties behind this technique have not been explored, its application lacks sensible justifications. Also, although these published works can model interindividual variation in curve parameters, they have a limited flexibility to model the common genetic basis shared by biologically meaningful curve parameters and other biological variables. The purpose of this study is to develop NLME models for estimating the ontogenetic pattern of the genetic control of complex dynamic traits and examine the statistical behavior of this technique through extensive simulation studies. We will integrate NLME models and mixture models within the framework of functional mapping to increase the vision of this mapping method.

Functional mapping

Nonlinear mixed-effects model

The purpose for the development of functional mapping is to map the temporal effects of QTL on longitudinal traits. Consider a mapping population of n individuals, in which a total of J QTL genotypes at different loci are segregating to affect time-dependent phenotypes of a trait. All the individuals are genotyped for multiple polymorphic markers that construct a genetic linkage map and phenotyped for a longitudinal trait measured at a finite set of time points.

Let $t_{i} = {t_{i τ}}_{τ = 1}^{T_{i}}$ be the vector of times for individual i measured at T_i time points and $y_{i} = {y_{i} {(t}_{i τ})}_{τ = 1}^{T_{i}}$ be the vector for longitudinal phenotypic measurements of individual i. The time points may be unbalanced among individuals and unequally spaced during measurements. The phenotypic value of the trait for individual i affected by the putative QTL can be described by a two-stage NLME model, expressed as:

Stage 1 (individual-level model): The response value of individual i across different time points is described by

where ξ_ij is the indicator variable defined as 1 if individual i carries QTL genotype j and 0 otherwise, g is a nonlinear function of β_ij and t_i, β_ij is a (q × 1) vector of individual-specific unknown curve parameters and ɛ_i is a (1 × T_i) error term, usually assumed to have a normal distribution with mean vector 0 and within-individual covariance matrix Σ_i. Note that Σ_i is a (T_i × T_i) serial covariance matrix, which can be structured by a set of parameters (Diggle et al., 2002).

Stage 2 (population level): The parameters that define the curve shape of individual i with QTL genotype j can be expressed as

where β_j is a (p × 1) vector of the unknown population parameters for QTL genotype j, b_ij is a (k × 1) vector of the random effects, assumed to be normally distributed with mean vector 0 and (k × k) between-individual covariance matrices D_j, and A_i and B_i are design matrices of size q × p and q × k for β_j and b_ij, respectively. This stage captures the interindividual systematic and random variation. This model in a general form can handle any kinds of nonlinear function and the design matrices A_i and B_i can vary for different groups, covariates or even for different individuals.

Mixture model-based likelihood

The statistical foundation for QTL mapping with molecular markers is a finite mixture model. According to the mixture model, the trait value of an individual is assumed to have arisen from one (and only one) of J QTL genotype groups or mixture components, each component with a relative proportion and being modeled by a normal distribution density.

The likelihood of unknown parameters given the longitudinal measurements (y) and marker information (M) for the mapping population is formulated, in terms of a mixture model, as

where ω={ω_j|i}_j=1^J are the QTL genotype frequencies which are constrained to be nonnegative and sum to unity, β={β_j}_j=1^J and b_i={b_ij}_j=1^J are the component (or QTL genotype)-specific parameters, with β_j and b_ij being specific to QTL genotype j, θ is the common parameters to all QTL genotypes, which is the set of unknown parameters that construct D_j and Σ_i.

The mixture proportion or QTL genotype frequency ω_j∣i depends on the type of mapping population, such as the backcross, recombinant inbred lines, F₂ or natural population. The frequencies of QTL genotypes can be inferred by observed marker genotypes because markers and QTL are assumed to be cosegregating in the mapping population. Assume that a putative QTL is located between two flanking markers that bracket the QTL. Thus, the mixture proportions, ω_j∣i, can be expressed as the conditional probabilities of QTL genotypes given the flanking marker genotype of individual i. The conditional probability can be derived in terms of recombination fractions between the QTL and each of the two markers and between the two markers.

Computational algorithm

There are three types of parameters that define the likelihood (3), which are the mixing proportions of QTL genotypes conditional on marker genotypes (ω), QTL genotype-specific curve parameters (β, b_i) and the covariance-structuring parameters (θ). The mixing proportions (ω) are expressed in terms of the recombination fractions between the markers and QTL and, therefore, the genomic location of the QTL (converted by the map function). In practice, ω, that is, the QTL location, can be treated as a constant because a putative QTL can be searched at every 1 or 2 cM on an interval of two flanking makers throughout the entire linkage group. The log-likelihood ratio (LR) test statistics are plotted against the linkage map distance. The linkage map position corresponding to a peak of the log-LR plot will be determined as the maximum-likelihood estimate (MLE) of the QTL location. Thus, on each scanning location of a QTL, the mixture likelihood will only depend on β_j, b_ij and θ. This grid approach is computationally simple, but cannot provide the estimate of the confidence interval of the QTL location estimate. Chen (2005) derived an algorithm for simultaneously estimating the standard errors and confidence intervals of the estimates of QTL effects and locations within the mixture model framework.

From the likelihood (3), the estimates of β_j, b_ij and θ will need to jointly maximize the posterior distribution function f_j(y_i|β_j, b_ij, θ) f(b_ij|D) weighted by ω_j∣i. But an inference based on the maximization of this distribution is difficult because its expectation is not linear for these unknowns. A few statistical approaches have been developed to obtain the MLEs of β_j, b_ij and θ, and they include numerical evaluation of the integral (Davidian and Giltinan, 1995, 2003), Monte Carlo expectation maximization (EM) algorithm (Wu, 2002, 2004a) and approximations to the nonlinear likelihood function (Tierney and Kadane, 1986; Lindstrom and Bates, 1990; Wolfinger, 1993). Here, we will use a linearization approximation method by using the first-order Taylor expansion to approximate the nonlinear expectation function (Beal and Sheiner, 1982; Lindstrom and Bates, 1990).

For individual i, the mixture-based NLME models (1) and (2) are rewritten into a single equation, expressed in matrix notation as

By taking the first-order Taylor expansion of g(β_j, b_ij; t_i);, Equation (4) is linearized to become a linear mixed-effect (LME) model expressed as

where

with the W_i and Z_i composed of time-dependent elements:

and

According to Laird and Ware (1982), the estimates of b_ij and β_j under the LME model are approximated by

Also, QTL genotype-specific curve parameters β_j can be estimated, along with covariance matrix parameters θ, by maximizing the approximate-likelihood function expressed as

where

The simplex algorithm implemented with the MatLab function fminsearch can be used to obtain the MLEs of β_j and θ (Lagarius et al., 1998).

Hypotheses

A significant advantage of functional mapping is that it can perform a number of biologically meaningful hypotheses based on the mathematical model of longitudinal curves. Most importantly, the existence of a QTL that exerts an effect on an overall growth curve should first be tested and this can be formulated as

at least one of the equalities above does not hold (1), where H₀ corresponds to the reduced model, in which the data can be fit by a single mathematical curve, and H₁ corresponds to the full model, in which there exist different longitudinal curves to fit the data. The log-likelihood values L₀ and L₁ under the H₀ and H₁ are calculated. The test is performed with a log-LR statistic

To determine the significance of the LR test, we use the critical threshold generated by permutation tests (Churchill and Doerge, 1994). By repeatedly shuffling the relationships between marker genotypes and phenotypes, a series of the maximum log-LRs are calculated, from the distribution of which the critical threshold is obtained. The LR statistic is plotted against test locations for all the linkage groups. A location of a high peak of LR that is beyond the threshold is considered corresponding to the position of QTL.

In addition, the hypothesis test for the time at which the detected QTL turns on or off its effect on longitudinal trajectories can be performed, by comparing the difference of the expected means between different genotypes at various time points. Within the functional mapping framework, the effect of the QTL on a period of time course and its interaction with age can also be tested (Wu et al., 2004a).

A worked example

Mapping population

Here we reanalyzed a published data set for QTL mapping of growth trajectories (Ma et al., 2002) to demonstrate the utilization of NLME-incorporated functional mapping. The plant materials used were derived from the interspecific hybridization (F₁) between Eastern Cottonwood (Populus deltoides) and Canadian poplar (P. euroamericana). Different from inbred lines that need an advanced-generation design for mapping, outcrossing species like trees can make use of a controlled cross of F₁, in which genes are segregating in different patterns because of heterozygous parents. Grattapaglia and Sederoff (1994) proposed a pseudotest backcross design to perform QTL mapping in such an F₁ cross for outcrossing species. This design capitalizes on the so-called testcross markers that are segregating in one parent but null in the second parent. Thus, two different linkage maps can be constructed for an outbred cross, each derived from a different heterozygous parent.

The hybrid poplars for QTL mapping were planted at a spacing of 4 × 5 m at a forest farm near Xuzhou City, Jiangsu Province, China. Total stem heights and diameters measured at the end of each of 11 growing seasons are used in this example. A subset (90) of hybrid trees randomly selected from the original population were used to construct two parent-specific genetic linkage maps with random amplified polymorphic DNAs, amplified fraction length polymorphisms and inter-simple sequence repeats (Yin et al., 2002). Using NLME-based functional mapping, we attempt to locate QTL affecting stem diameter growth trajectories on the linkage map derived from the P. deltoides parent. Individuals with missing joint genotypes for a given pair of markers were excluded from our analysis.

Model formulation

The growth of the stem diameter can be well fit by a logistic equation expressed as

where a is the asymptotic or limiting value of g when t → ∞, a/(1+b) is the initial value of g when t=0 and r is the relative rate of growth (von Bertalanffy, 1957). Given this growth equation, we express the growth of individual i by

where indicator ξ equals 1 or 0 for QTL genotype Qq and qq, respectively. Growth parameters (a, b, r) are QTL genotype-specific, subscripted by the genotype notation. For simplicity, we only model interindividual variation for parameters a and b by a simple linear regression

where random effects b_i=(b_ai, b_bi) are assumed to be genotype-invariant, normally distributed with mean vector zero and diagonal covariance matrix

In this analysis, ɛ_i(t) is assumed to display a normal distribution with mean vector zero and the first-order AR (AR(1)) covariance matrix specified by two parameters ρ and σ² (Ma et al., 2002). The AR(1) model assumes that the variance (σ²) is time-invariant and correlation decays in a proportion ρ with time lag. These two assumptions can be relaxed by introducing more complicated nonstationary models (Zhao et al., 2005).

QTL scanning and estimation

The NLME-based mapping model is used to genomewide scan for all possible QTL, their existence and chromosomal distribution. We detect two QTL on linkage groups 9 and 10 that affect diameter growth trajectories in poplar trees. Figure 1 illustrates a plot of the LRs between the full (there is a QTL) and reduced model (there is no QTL) across all the linkage groups. These two detected QTL are located at 111.1 cM from the first left marker on linkage group D9 and 12 cM from the first left marker on linkage group D10 because the LR peaks (34.77 and 33.33) at these positions far exceed the genome-wide critical threshold (31.64). Permutation tests were performed to determine the empirical threshold for declaring the genome-wide existence of QTL throughout all the linkage groups.

The MLEs of the curve parameters for each of two QTL genotypes, Qq and qq, and the parameters that model the structure of the variance matrix are tabulated in Table 1, along with the approximate standard errors of these estimates estimated from Fisher's information matrix. All the parameters can be estimated with reasonable precision. The MLEs of the curve parameters in Table 1 were used to draw growth curves at each QTL for diameter growth (Figure 2). The pattern of the differentiation in growth curves between two QTL genotypes at each QTL suggests that these two detected QTL do not trigger an effect on growth at an early stage of tree development, but are activated at age 5–6 years and keep operational afterwards. The timing of QTL to be switched on seems to be concordant with the emerging age of intertree competition for resources availability. These results broadly support those obtained from traditional functional mapping (TFM, Ma et al., 2002).

Table 1 MLEs of QTL genotype-specific parameters that define stem diameter growth trajectories in poplar trees from the NLME model

Full size table

Monte Carlo simulation

In order to examine the statistical properties of the NLME model for QTL mapping, two different Monte Carlo simulation strategies were performed. The simulation studies mimic the example of poplar trees with two sample sizes (80 and 200). For the first strategy, data are simulated according to the NLME model, whereas, for the second strategy, data are simulated according to TFM by Ma et al. (2002). In both cases, only serial correlations are modeled with the AR(1) process. The simulated data sets under different strategies are analyzed, respectively, by the NLME and TFM models. Such reciprocal designs are thought to be helpful for the methodological comparison of QTL mapping.

As expected, if the data are simulated by the NLME model, the NLME model displays better estimation accuracy and precision of parameters than does the TFM model (Table 2). The NLME model can precisely estimate the QTL location, but the TFM fails to do so. Also, compared to the TFM model, the NLME model is more advantageous for convergence under the same convergence criterion. For the data simulated under the TFM model, the two analytical models, NLME and TFM, perform similarly in the precision of parameter estimation and power (Table 3). The estimates of heritability by the two models are consistent with the true value. Tables 2 and 3 give the results for a sample size of 80. Increased sample sizes tend to blur the difference between the two models (results not shown). In general, it can be suggested that the NLME model covers the TFM model and, thus, can be used in a broader range of data types than the TFM model.

Table 2 MLEs of parameters for data set simulated by NLME model with a sample size of 80 obtained from NLME-incorporated and traditional functional mapping

Full size table

Table 3 MLEs of parameters for data set simulated by the TFM model with a sample size of 80 obtained from NLME-incorporated and traditional functional mapping

Full size table

The two models are similar in computational efficiency. On a desktop (CPU 2.4 gHz and memory 512 mb), both models use about 20 min per simulation round for the data simulated with the NLME model. Yet, the TFM model uses more time when it has a convergence problem. For the data simulated with the TRM model, the NLME model still uses about 20 min, but the TRM model is faster (using about 15 min).

Discussion

In all the organisms, the development of morphological, anatomical and physiological traits takes place in characteristic ontogenetic periods. Effective modeling of the genetic control of particular physiological alterations emerging in the course of the developmental process (from their early onset until their late consequences) requires the use of adequate statistical models. Some basic statistical models for the genetic study of developmental dynamics have been proposed, in an attempt to identify the ontogenetic genetic factors or QTL that control the structure and function of a developmental system (Wu et al., 1999, 2003a, 2003b, 2004a, 2004b, 2004c; Ma et al., 2002; Zhao et al., 2005; Wu and Lin, 2006; Yang et al., 2006; Yang and Xu, 2007). These so called functional mapping models have been expanded into various genetic fields related to biomedical sciences, such as cancer growth (Liu et al., 2005), HIV dynamics (Wang and Wu, 2004; Wang et al., 2006) and drug response (Lin and Wu, 2005).

The central idea of functional mapping is to model the mean vector and covariance matrix structure by parametric or nonparametric approaches. Previous functional mapping approaches have modeled the structure of the covariance matrix by considering autocorrelation components, but ignoring other sources that also affect the covariance structure, such as random effects and measurement errors (Diggle et al., 2002). The study presented in this article is aimed to generalize functional mapping to model the effects of random effects on the parameter estimation of functional mapping and its relevant hypothesis tests, thus broadening the visibility of functional mapping. The incorporation of random effects with functional mapping based on NLME models (Beal and Sheiner, 1982; Lindstrom and Bates, 1990; Davidian and Giltinan, 1995, 2003; Vonesh et al., 2002; Wu, 2002, 2004a, 2004b) is robust; in that it can provide sufficient power to detect ontogenetic QTL for longitudinal data measured at uneven spaces and irregularly for different subjects.

The NLME-incorporated functional mapping model has been used to analyze a published growth data set in poplar trees. As compared to previous simpler functional mapping (TFM) (Ma et al., 2002), the new model generates agreeable results for the detection of QTL, their chromosomal locations and ontogenetic effects during a time course. However, simulation studies based on reciprocal designs, that is, the data simulated and, then, analyzed by NLME and TFM models, respectively, suggest that whereas QTL contained in the TFM-simulated data can be detected by both models, QTL in the NLME-simulated data can only well be detected by the NLME model. All this implies that the NLME model is more general and can be used more widely in practice than the TFM model.

Perhaps, the most significant advantage of NLME-based functional mapping is its flexibility to extend the idea of functional mapping to a broad spectrum of biological and biomedical areas (see also Malosetti et al., 2006). NLME models include two-stage hierarchical characterization of intra- and intersubject variation. In the first stage, any form of parametric models can be incorporated that are defined by biologically meaningful mathematical parameters; for example, growth rate parameter in the growth equation (West et al., 2001) is related to the developmental status of an organism in a time period. These mathematical parameters may be correlated with other physiological variables or expressed differently under different environmental conditions or genetic backgrounds. The genetic control of these biological phenomena can be integrated into the second stage of the NLME model at which specific underlying QTL can be modeled, estimated and tested.

Statistics inference of longitudinal measurements based on the NLME model has received considerable attention in recent years because of its flexibility to incorporate the correlation within repeated measurements, between-individual variation and covariates (Vonesh et al., 2002; Wu, 2002, 2004a, 2004b; Davidian and Giltinan, 2003). The NLME model has been recently extended to take into account censoring and covariate measured with errors (Wu, 2002), missing covariates (Wu, 2004a) and nonignorable dropouts (Wu, 2004b). In addition, to clearly describe the NLME model, we constructed our model framework in the context of interval mapping. More recently, Xu and group have developed a series of shrinkage models that allow a genome-wide search for all possible QTL (Xu, 2003, 2007; Wang et al., 2005). These multiple QTL models taking into account epistatic interactions between different QTL can be incorporated into the NLME model. All these statistical and genetic extensions can be incorporated into functional mapping, which will provide a powerful means for characterizing the developmental machinery of the genetic control of complex traits at the interplay between trait formation and progression and the environment in which the organism is grown. The computer code for the statistical method proposed in this article can be available from the corresponding author.

References

Beal SL, Sheiner LB (1982). Estimating population kinetics. Crit Rev Biomed Eng 8: 195–222.
CAS PubMed Google Scholar
Chen Z (2005). The full EM algorithm for the MLEs of QTL effects and positions and their estimated variances in multiple interval mapping. Biometrics 61: 474–480.
Article PubMed Google Scholar
Chi EM, Reinsel GC (1989). Models for longitudinal data with random effects and AR (1) errors. J Am Stat Assoc 84: 452–459.
Article Google Scholar
Churchill GA, Doerge RW (1994). Empirical threshold values for quantitative trait mapping. Genetics 138: 963–971.
CAS PubMed PubMed Central Google Scholar
Davidian M, Giltinan D (1995). Nonlinear Models for Repeated Measurement Data. Chapman and Hall: New York.
Google Scholar
Davidian M, Giltinan DM (2003). Nonlinear models for repeated measurements: an overview and update. J Agric Biol Environ Stat 8: 387–419.
Article Google Scholar
Diggle PJ, Heagerty P, Liang KY, Zeger SL (2002). Analysis of Longitudinal Data. Oxford University Press: Oxford, UK.
Google Scholar
Grattapaglia D, Sederoff RR (1994). Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-testcross: mapping strategy and RAPD markers. Genetics 137: 1121–1137.
CAS PubMed PubMed Central Google Scholar
Ho DD, Neumann AU, Perelson AS, Chen W, Leonard JM, Markowitz M (1995). Rapid turnover of plasma virions and CD4 lymphocytes in HIV infection. Nature 373: 123–126.
Article CAS PubMed Google Scholar
Lagarius JC, Reeds JA, Wright MH, Wright PE (1998). Convergence properties of the Neler–Mead simplex method in low dimensions. SIAM J Optim 9: 112–147.
Article Google Scholar
Laird NM, Ware JH (1982). Random effects models for longitudinal data. Biometrics 38: 963–974.
Article CAS PubMed Google Scholar
Lin M, Wu RL (2005). Theoretical basis for the identification of allelic variants that encode drug efficacy and toxicity. Genetics 170: 919–928.
Article CAS PubMed PubMed Central Google Scholar
Lindstrom MJ, Bates DM (1990). Nonlinear mixed effects models for repeated measures data. Biometrics 46: 673–687.
Article CAS PubMed Google Scholar
Liu T, Zhao W, Tian LL, Wu RL (2005). An algorithm for molecular dissection of tumor progression. J Math Biol 50: 336–354.
Article PubMed Google Scholar
Lynch M, Walsh B (1998). Genetics and Analysis of Quantitative Traits. Sinauer: Sunderland, MA, USA.
Google Scholar
Ma C-X, Casella G, Wu RL (2002). Functional mapping of quantitative trait loci underlying the character process: a theoretical framework. Genetics 161: 1751–1762.
PubMed PubMed Central Google Scholar
Malosetti M, Visser RGF, Celis-Gamboa C, van Eeuwijk FA (2006). QTL methodology for response curves on the basis of non-linear mixed models, with an illustration to senescence in potato. Theor Appl Genet 113: 288–300.
Article CAS PubMed Google Scholar
Rodriguez-Zas SL, Southney BR, Heyen DW, Lewin HA (2002). Detection of quantitative trait loci influencing dairy traits using a model for longitudinal data. J Dairy Sci 85: 2681–2691.
Article CAS PubMed Google Scholar
Schabenberger O (1995). The use of ordinal response methodology in forestry. Forest Sci 41: 321–336.
Google Scholar
Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, Eisen MB et al. (1998). Comprehensive identification of cell-cycle regulated genes in Saccharomyces cerevisiae by microarray hybridization. Mol Biol Cell 95: 14863–14868.
Google Scholar
Tierney L, Kadane JB (1986). Accurate approximations for posterior moments and marginal densities. J Am Stat Assoc 81: 82–86.
Article Google Scholar
von Bertalanffy L (1957). Quantitative laws in metabolism and growth. Q Rev Biol 32: 217–231.
Article CAS PubMed Google Scholar
Vonesh EF, Wang H, Nie L, Majumdar D (2002). Conditional second-order generalized estimating equations for generalized linear and nonlinear mixed-effects models. J Am Stat Assoc 97: 271–283.
Article Google Scholar
Wang ZH, Hou W, Wu RL (2006). A statistical model to analyze quantitative trait locus interactions for HIV dynamics from the virus and human genomes. Stat Med 25: 495–511.
Article PubMed Google Scholar
Wang ZH, Wu RL (2004). A statistical model for high-resolution mapping of quantitative trait loci determining human HIV-1 dynamics. Stat Med 23: 3033–3051.
Article PubMed Google Scholar
Wang H, Zhang Y-M, Li X, Masinde GL, Mohan S, Baylink DJ et al. (2005). Bayesian shrinkage estimation of quantitative trait loci parameters. Genetics 170: 465–480.
Article CAS PubMed PubMed Central Google Scholar
West GB, Brown JH, Enquist BJ (1997). A general model for the origin of allometric scaling laws in biology. Science 276: 122–126.
Article CAS PubMed Google Scholar
West GB, Brown JH, Enquist BJ (2001). A general model for ontogenetic growth. Nature 413: 628–631.
Article CAS PubMed Google Scholar
Wolfinger RD (1993). Laplace's approximation for nonlinear mixed models. Biometrika 80: 791–795.
Article Google Scholar
Wu L (2002). A joint model for nonlinear mixed-effects models with censoring and covariates measured with error, with application to AIDS studies. J Am Stat Assoc 97: 955–964.
Article Google Scholar
Wu L (2004a). Exact and approximate inferences for nonlinear mixed-effects models with missing covariates. J Am Stat Assoc 32: 700–709.
Article Google Scholar
Wu L (2004b). Nonlinear mixed-effects models with nonignorably missing covariates. Can J Stat 32: 27–37.
Article Google Scholar
Wu RL, Lin M (2006). Functional mapping? How to map and study the genetic architecture of dynamic complex traits. Nat Rev Genet 7: 229–237.
Article CAS PubMed Google Scholar
Wu RL, Ma C-X, Lin M, Casella G (2004a). A general framework for analyzing the genetic architecture of developmental characteristics. Genetics 166: 1541–1551.
Article CAS PubMed PubMed Central Google Scholar
Wu RL, Ma C-X, Lin M, Wang ZH, Casella G (2004b). Functional mapping of quantitative trait loci underlying growth trajectories using a transform-both-sides logistic model. Biometrics 60: 729–738.
Article PubMed Google Scholar
Wu RL, Ma CX, Lou XY, Casella G (2003a). Molecular dissection of allometry, ontogeny, and plasticity: a genomic view of developmental biology. Bioscience 53: 1041–1047.
Article Google Scholar
Wu RL, Ma C-X, Zhao W, Casella G (2003b). Functional mapping of quantitative trait loci underlying growth rates: a parametric model. Physiol Genomics 14: 241–249.
Article CAS PubMed Google Scholar
Wu RL, Wang ZH, Zhao W, Cheverud JM (2004c). A mechanistic model for genetic machinery of ontogenetic growth. Genetics 168: 2383–2394.
Article PubMed PubMed Central Google Scholar
Wu W-R, Li W-M, Tang D-Z, Lu H-R, Worland AJ (1999). Time-related mapping of quantitative trait loci underlying tiller number in rice. Genetics 151: 297–303.
CAS PubMed PubMed Central Google Scholar
Xu S (2003). Estimating polygenic effects using markers of the entire genome. Genetics 163: 789–801.
CAS PubMed PubMed Central Google Scholar
Xu S (2007). Derivation of the shrinkage estimates of quantitative trait locus effects. Genetics 177: 1255–1258.
Article PubMed PubMed Central Google Scholar
Yang RQ, Tian Q, Xu S (2006). Mapping quantitative trait loci for longitudinal traits in line crosses. Genetics 173: 2339–2356.
Article CAS PubMed PubMed Central Google Scholar
Yang RQ, Xu S (2007). Bayesian shrinkage analysis of quantitative trait loci for dynamic traits. Genetics 176: 1169–1185.
Article CAS PubMed PubMed Central Google Scholar
Yin TM, Zhang XY, Huang MR, Wang MX, Zhuge Q, Zhu LH et al. (2002). Molecular linkage maps of the Populus genome. Genome 45: 541–555.
Article CAS PubMed Google Scholar
Zhao W, Chen YQ, Casella G, Cheverud JM, Wu RL (2005). A nonstationary model for functional mapping of complex traits. Bioinformatics 21: 2469–2477.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Associate Editor, Dr Shizhong Xu, and the three anonymous referees for their constructive comments on the manuscript. The preparation of this manuscript is partially supported by grants from NSF (0540745) and the National Natural Science Foundation of China (09-95671 and 30230300).

Author information

Authors and Affiliations

Department of Statistics, University of Florida, Gainesville, FL, USA
W Hou, H Li & R Wu
The Key Laboratory of Forest Genetics and Gene Engineering, Nanjing Forestry University, Nanjing, Jiangsu, People's Republic of China
B Zhang & R Wu
UF Genetics Institute, University of Florida, Gainesville, FL, USA
M Huang & R Wu

Authors

W Hou
View author publications
You can also search for this author in PubMed Google Scholar
H Li
View author publications
You can also search for this author in PubMed Google Scholar
B Zhang
View author publications
You can also search for this author in PubMed Google Scholar
M Huang
View author publications
You can also search for this author in PubMed Google Scholar
R Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R Wu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hou, W., Li, H., Zhang, B. et al. A nonlinear mixed-effect mixture model for functional mapping of dynamic traits. Heredity 101, 321–328 (2008). https://doi.org/10.1038/hdy.2008.53

Download citation

Received: 10 March 2008
Revised: 25 April 2008
Accepted: 30 April 2008
Published: 09 July 2008
Issue Date: October 2008
DOI: https://doi.org/10.1038/hdy.2008.53

Keywords

This article is cited by

An improved SAEM algorithm for maximum likelihood estimation in mixtures of non linear mixed effects models
- Marc Lavielle
- Cyprien Mbogning
Statistics and Computing (2014)

A nonlinear mixed-effect mixture model for functional mapping of dynamic traits

Abstract

Similar content being viewed by others