A mechanistic mathematical model of initiation and malignant transformation in sporadic vestibular schwannoma

Paterson, Chay; Bozic, Ivana; Smith, Miriam J.; Hoad, Xanthe; Evans, D. Gareth R.

doi:10.1038/s41416-022-01955-8

Download PDF

Article
Open access
Published: 12 September 2022

Genetics and Genomics

A mechanistic mathematical model of initiation and malignant transformation in sporadic vestibular schwannoma

British Journal of Cancer volume 127, pages 1843–1857 (2022)Cite this article

1660 Accesses
1 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Background

A vestibular schwannoma (VS) is a relatively rare, benign tumour of the eighth cranial nerve, often involving alterations to the gene NF2. Previous mathematical models of schwannoma incidence have not attempted to account for alterations in specific genes, and could not distinguish between nonsense mutations and loss of heterozygosity (LOH).

Methods

Here, we present a mechanistic approach to modelling initiation and malignant transformation in schwannoma. Each parameter is associated with a specific gene or mechanism operative in Schwann cells, and can be determined by combining incidence data with empirical frequencies of pathogenic variants and LOH.

Results

This results in new estimates for the base-pair mutation rate u = 4.48 × 10⁻¹⁰ and the rate of LOH = 2.03 × 10⁻⁶/yr in Schwann cells. In addition to new parameter estimates, we extend the approach to estimate the risk of both spontaneous and radiation-induced malignant transformation.

Discussion

We conclude that radiotherapy is likely to have a negligible excess risk of malignancy for sporadic VS, with a possible exception of rapidly growing tumours.

Prevalence and natural history of schwannomas in neurofibromatosis type 2 (NF2): the influence of pathogenic variants

Article 24 January 2022

Pathomechanisms in schwannoma development and progression

Article Open access 02 July 2020

Cellular mechanisms of heterogeneity in NF2-mutant schwannoma

Article Open access 21 March 2023

Background

Vestibular schwannoma is a benign tumour of Schwann cells on the eighth cranial nerve. It is a relatively rare disease, with a lifetime risk of around 1 in 1000 [1, 2]. The majority of vestibular schwannoma cases carry pathogenic variants on the gene NF2 and at least one other genetic hit [3]. Age-related risk is described relatively well by a multistage model involving three alterations [4]. This is consistent with a picture in which most cases involve two genetic hits to NF2, and one additional genetic insult [3, 5].

In contrast to many other neoplasms, vestibular schwannoma is almost always benign when detected [1, 2]; detailed measurements of tumour growth are possible [6]; and detailed histories of benign disease are often available in cases of malignant transformation [7]. Vestibular schwannomas are very rarely undergo malignant transformation: only around 0.2% of cases develop into malignancy [2, 7]. A recent study on familial neurofibromatosis type II patients (in which pathogenic variants of NF2 are inherited) suggested an association between radiotherapy and malignant transformation [8]. From such a small sample size, it was impossible to give a quantitative estimate for the excess risk associated with irradiation [9].

Due to the rarity of malignant schwannoma, its pathogenesis is unclear, and the relationships between genomic features, clinical course, and epidemiology are poorly understood [10]. The benign disease is better understood: researchers have been able to use sequencing studies to investigate what genetic alterations have occurred in individual cases [3, 11, 12]. A mathematical model that could connect genomic data to clinical observables would therefore be very useful. Proposed models should pay close attention to specific, plausible mechanisms; as well as summarising the experimental literature in a precise, quantitative form.

One recent example of a mechanistic model connects every parameter to either a population of cells, or the rate of a specific biological process: for example, the mutation rate of individual genes, or the rate of cell division in precursor cells [13]. Several key parameters can be determined directly from the sequences of implicated genes, with no statistical fitting. The main result is an incidence curve for a specific molecular subtype involving alterations to three genes, with the additional advantage that every parameter has a clear interpretation in terms of a specific mechanism [4, 13, 14].

Here, we extend this approach to meet three main goals: first, to describe the incidence of sporadic vestibular schwannoma with a mechanistic model. In this process, we find a new use for existing experimental data in the section “Parameter estimation”, to derive new estimates for biological parameters of interest. Secondly, to use these parameter estimates in a new model for the lifetime risk of malignant transformation. Our work in the section “Modelling sporadic malignant transformation in schwannoma” may help to constrain the genes responsible. Thirdly, to model the excess risk of malignancy following a dose of ionising radiation. At each stage, by relating model parameters to potential observables like variant allele frequencies, we are able draw new inferences from existing data, and suggest informative future experiments.

Methods

Modelling incidence of sporadic vestibular schwannoma

The incidence of sporadic vestibular schwannoma is relatively well-described by a three-hit model [4]. Our model is also a three-hit process, but is distinguished from previous efforts by a focus on specific genes; by allowing for hits to occur in any order; and by tying all parameters to underlying mechanisms. For simplicity, we will not study familial or germline NF2 variants, and focus only on sporadic VS involving somatic NF2 variants.

NF2 displays somatic pathogenic variants in at least 85% of all sporadic vestibular schwannomas according to current estimates, a majority [3, 5]. To reduce the number of unknowns, we consider only cases of VS associated with somatic NF2 loss-of-function. In addition to mutations on NF2, which may consist of single-nucleotide variants (SNVs) or indels, loss of heterozygosity (LOH) is also commonly found on chromosome 22 [3, 11]. Loss-of-function of NF2 can only account for two hits: at least one other gene must be involved. A natural hypothesis is that the third hit is an oncogene (or one of several possible proto-oncogenes). However, the third hit may also be a tumour suppressor on chromosome 22. If both NF2 and this other tumour suppressor had inactivating mutations on one chromosome, and then LOH removes the other trans allele, both tumour suppressors would then be deactivated by only three events. At least two such tumour suppressors are known, SMARCB1 and LZTR1 (see Fig. 1) [15, 16]. We will only consider SMARCB1 here, and leave the inclusion of LZTR1 to future work.

**Fig. 1: A map of chromosome 22 showing the approximate locations of *NF2* and two other tumour-suppressor genes of possible interest, *SMARCB1* and *LZTR1* [15, 16].**

We will first define our sporadic incidence model, before showing how to infer its parameters in the section “Parameter estimation”. Our goal will be to derive improved estimates for the underlying parameters from experimental data. These parameters will be critical to our models of malignancy in sections “Modelling sporadic malignant transformation in schwannoma” and “Modelling excess risk of malignancy following irradiation”.

The model has four basic types of parameter: the base population N₀ of progenitor cells from which the tumour derives; the nonsense mutation rates μ_gene for each gene, and the rate r_LOH (22q) of LOH on chromosome 22; the selective advantages s_i associated with mutant variants; and patient age t [13]. For modelling purposes, a mutation consists of a nonsense mutation caused by either an SNV or an indel. We do not model e.g., frameshift mutations. There is a lack of evidence for an exponential phase in the incidence of benign VS [4]. As a result, we will assume that the three initiating mutations are neutral (s_i ≈ 0) [4, 14]. This reduces the parameters to the initial population N₀, the relevant mutation rates μ_gene and r_LOH (22q), and age t, the only independent variable [13].

The three alterations in this model consist of two hits to NF2, and one hit to either SMARCB1 or a hypothetical oncogene we will call GFX, for “gain-of-function X”. Additional tumour suppressors not on chromosome 22 will be excluded from our discussion of sporadic VS, as this would be a four-event model rather than a three-event model. The model explicitly accounts for the different orders in which the hits can occur. There are three distinct sets of mutations that could result in a schwannoma, and each set may occur in many possible orders:

1.
single-copy inactivation of NF2 → LOH on 22q → gain of GFX (3! = 6 different orders)
2.
single-copy inactivation of NF2 → mutation on second NF2 allele → gain of GFX ($\left( 3 \atop 2 \right) = 3$ different orders)
3.
single-copy inactivation of NF2 → LOH on 22q → SMARCB1 single-copy inactivation (3! = 6 different orders).

Around 65% of the second events in schwannoma tumorigenesis are large chromosomal losses on 22q, overlapping the NF2 locus. Around 20% are known to be SNVs, with the remainder being unclear [17].

In principle, a complete model should also include hits to LZTR1. Due to a lack of available data, we are leaving this to future work.

Each order of events can be represented as a pathway through a network (see Fig. 2): the end state of any of these pathways is a population of tumour cells. These neoplastic mutants are labelled 1, 2 and 3 in Fig. 2. These end states correspond to subtypes of schwannoma with distinct alterations present: LOH will be found in states 1 and 3; pathogenic variants of SMARCB1 will only be present in state 3; and NF2 will be doubly mutated in state 2. These different orderings imply the existence of many distinct subpopulations of cells. These intermediate steps, between wild-type progenitor cells and neoplastic tumour cells, are labelled in Fig. 2.

**Fig. 2: The network structure of two multi-stage models of tumour initiation.**

At age zero, the entire population of N₀ progenitor cells are assumed to start in the wild-type state, N_WT. Random mutations result in intermediate mutants successively emerging, until a single neoplastic cell results. Rather than treat each subpopulation as a discrete random variable, we will employ a “mean field” or “fluid” approximation, and only track the mean subpopulation in each state [13, 14, 18].

Neoplastic cells emerge from the pre-neoplastic cells in a Poisson process. The rate of this process is the rate with which the relevant mutations appear in the subpopulations N_e to N_j (see Fig. 2). The subpopulations N_m labelled in Fig. 2 follow a system of differential equations which can be summarised:

$$\frac{{d\vec N}}{{dt}} = M \cdot \vec N$$

(1)

where the vector $\vec N = \left( {N_{WT},N_a, \ldots ,N_j} \right)$. The matrix M depends on the parameters {μ_NF2, μ_SMARCB1, μ_GFX, r_LOH}, and encodes the allowed transitions in Fig. 2. An explicit form of Eq. (1) is given in Appendix A. The mutation rate μ_gene for a given gene is determined by the error rate u per replication per base pair, the cell replication rate b, and the number of “sensitive locations” on that gene n_gene, [13]:

$$\mu _{gene} = n_{gene}ub$$

and r_LOH is the rate of loss of heterozygosity on chromosome 22. In system (1), the subscript μ_GFX stands for the mutation rate of GFX, the hypothetical oncogene. The populations in system (1) have the initial conditions N_WT (t = 0) = N₀, N_(othertypes) (t = 0) = 0.

The probabilities of developing a schwannoma of subtype 1, 2 or 3 by age t follow

$$\frac{{dP_1}}{{dt}} = \left( {\frac{1}{2}\mu _{NF2}N_e + \frac{1}{2}r_{LOH}N_f + \mu _{GFX}N_g} \right)\left( {1 - P_1} \right) \\ \frac{{dP_2}}{{dt}} = \left( {\frac{1}{2}\mu _{NF2}N_f + \mu _{GFX}N_j} \right)\left( {1 - P_2} \right) \\ \frac{{dP_3}}{{dt}} = \left({\frac{1}{2}\mu _{SMARCB1}N_g + \frac{1}{2}\mu _{NF2}N_h + \frac{1}{2}r_{LOH}N_i} \right)\left( {1 - P_3} \right)$$

(2)

where P_j is the probability for end-node j to be reached. These end nodes are labelled in Fig. 2.

Tumours of the vestibular nerve often consist of multiple independent tumour foci on the same nerve, apparently polyclonal, that form a larger mass [19, 20]. This contrasts with models of tumour initiation in other diseases, where each tumour is assumed to be monoclonal [13, 21, 22]. For VS, the emergence of tumours of subtypes 1, 2 or 3 should therefore be treated as independent events, and not mutually exclusive events.

After solving systems (1) and (2) for P₁, P₂ and P₃, we can use the independence of these events to find the probability to develop a schwannoma with LOH,

$$Pr\left( {LOH,t} \right) = 1 - \left( {1 - P_1\left( t \right)} \right)\left( {1 - P_3\left( t \right)} \right)$$

the probability to develop a schwannoma in which SMARCB1 has been inactivated,

$$Pr\left( {SMARCB1^{ - / - }tumour,t} \right) = P_3\left( t \right)$$

and the probability to develop a schwannoma with any of the above alterations,

$$Pr\left( {tumour,t} \right) = 1 - \left( {1 - P_1\left( t \right)} \right)\left( {1 - P_2\left( t \right)} \right)\left( {1 - P_3\left( t \right)} \right)$$

Vestibular schwannoma is a rare disease, so all the final probabilities P_type must be small. When this is the case, we can make the approximation that quadratic and higher order terms (like P₁ P₂ or P₁ P₂ P₃) are zero. The relevant formulae simplify considerably:

$$\begin{array}{*{20}{c}} {Pr\left( {LOH,t} \right) \approx P_1\left( t \right) + P_3\left( t \right)} \\ {Pr\left( {SMARCB1^{ - / - }tumour,t} \right) = P_3\left( t \right)} \\ {Pr\left( {tumour,t} \right) \approx P_1\left( t \right) + P_2\left( t \right) + P_3\left( t \right)} \end{array}$$

System (1) is linear in the intermediate populations N_m, and can be readily solved by a variety of methods. Several underlying parameters are presently unknown, so we will first give a symbolic solution, and then determine numerical values of these parameters in section “Parameter estimation”.

The intermediate populations of pre-neoplastic cells must also be small. When this is the case, each probability is approximately a power of age, P_type ≈ Ct^k. This can be seen from a power series solution of (1)–(2) around t = 0 [13, 14] (see Appendix A). This is analogous to the classic “log-log” behaviour of many cancers [14, 23]. For sporadic vestibular schwannoma, this should be a good approximation at all ages (see Appendix A for a detailed analysis). The solutions for P₁, P₂ and P₃ read

$$P_1\left( t \right) \approx\; \frac{1}{2}N_0n_{NF2}n_{GFX}r_{LOH}u^{2}b^{2}t^{3} \\ P_2\left( t \right) \approx\; \frac{1}{4}N_0n_{NF2}^{2}n_{GFX}u^{3}b^{3}t^{3} \\ P_3\left( t \right) \approx\; \frac{1}{4}N_0n_{NF2}n_{SMARCB1}r_{LOH}u^{2}b^{2}t^{3}$$

(3)

From Eq. (3), we can find the probability P(tumour,t) that a tumour has been initiated by age t:

$$Pr\left( {tumour,t} \right) = \; P_{1} + P_{2} + P_{3} \approx \frac{1}{4}N_0n_{NF2}\big( {n_{GFX}\left( {2r_{LOH} + n_{NF2}ub} \right)} \\ + {n_{SMARCB1}r_{LOH}} \big)u^{2}b^{2}t^{3} = At^{3}$$

(4)

The constant A can be directly related to incidence data and other models, as shown in the section “Parameter estimation” [4]. Also of interest are the probability to develop a tumour in which SMARCB1 has been inactivated,

$$Pr\left( {SMARCB1^{ - / - }tumour,t} \right) = P_3\left( t \right) \approx \frac{1}{4}N_0n_{NF2}n_{SMARCB1}r_{LOH}u^2b^2t^3$$

(5)

and the probability to develop a tumour that displays LOH on chromosome 22,

$$Pr\left( {LOH,t} \right) = P_1\left( t \right) + P_3\left( t \right) \approx \frac{1}{4}N_0n_{NF2}\left( {2n_{GFX} + n_{SMARCB1}} \right)r_{LOH}u^2b^2t^3$$

(6)

Equations (5) and (6) are crucial in determining the model parameters in section “Parameter estimation”, as they can be related to empirical frequencies of pathogenic variants. The resulting curves (4), (5) and (6) are plotted in Fig. 3.

**Fig. 3: The theoretical probability of developing sporadic vestibular schwannoma (black, dashed curves) and empirical cumulative incidence (solid grey line) as a function of age.**

Parameter estimation

The model in section “Modelling incidence of sporadic vestibular schwannoma” has seven parameters: the initial wild-type population N₀; the cell turnover rate b; the mutation rate per base pair u; the sensitivity parameters n_NF2, n_SMARCB1, and n_GFX; and the rate of LOH r_LOH. These must now be estimated.

Precursor cell population and division rate

The base population of precursor cells N₀ can be estimated from anatomical considerations. There are two vestibular nerves, with around 19,000 axons in each [4, 24]. The region on the vestibular nerve from which schwannomas usually originate is ~6 mm long [4], and the approximate spacing between Schwann cells is ~0.5 mm [4]. Thus, the base population in the normal tissue pool N₀ is approximately

$$N_0 = 2 \times 19,\!000 \times \frac{6}{{0.5}} = 456,\!000$$

(7)

This is higher than R. Woods’ 2003 estimate [4], which we attribute to having a more recent estimate for the number of axons in each vestibular nerve [24].

The rate of cell division b in the precursor cell population can be estimated from an experiment on Schwann cell proliferation following injury [25]. Their observed rate of Schwann cell proliferation of 7%/day corresponds to a value of b of 25.5/year.

Sensitive sites

The effective numbers of sensitive locations n_NF2 and n_SMARCB1 can be calculated from reference sequences [13, 26, 27]. A site is “sensitive” if a mutation there results in a nonsense codon, truncating the gene. Only sites on the longest open-reading frame are considered. Express n_gene as

$$n_{gene} = l_{gene} + m_{gene}$$

(8)

where n_gene is the overall number of sensitive sites, l_gene is the number of substitution-sensitive sites, and m_gene is the number of indel-sensitive sites.

To calculate the number of substitution-sensitive sites l_gene, count the number of places where a base substitution could result in a nonsense codon. If two possible substitutions can result in a stop codon, count this base twice. Finally, multiply this by 1/3 to account for the fact that only one of the three possible substitutions will result in a nonsense codon. This set of assumptions is equivalent to the Jukes–Cantor model of mutation processes [28].

Several other models of molecular substitution were considered, like Kimura 1980, Tamura and Nei 1993, and Tavaré 1986 [29,30,31,32]. In full generality, different rates of transitions/transversions could be represented by redefining

$$\mu _{gene} = \mathop {\sum }\limits_i l_iu_ib + m_{gene}u_{indel}b$$

where i runs over the different substitutions, i = C > A, C > T, C > G etc., u_i represents the rate of that substitution per replication, u_indel is the mean rate of indels, and l_i represents the number of ways to produce a stop codon with the substitution i. Due to a lack of schwannoma-specific data on substitution rates, it was decided not to use more complex models to avoid overfitting, a point elaborated upon in “Discussion”.

Strictly speaking, truncating mutations near the start of the gene might also be expected to be more deleterious than truncating mutations near the end. Something similar is likely to hold for frameshifts. It has also been shown that truncating variants at the very beginning of SMARCB1 lead to reinitiation of translation [33]. The same is thought to be true for NF2, since very early truncating variants cause less severe disease [34]. But to avoid making the model too granular, we have decided to ignore such positional effects, and leave their inclusion to future work.

To calculate m_gene, the empirical approximation

$$m_{gene} \approx 0.74l_{gene}$$

(9)

from Bozic et al. 2010 can be used [35]. However, a more mechanistic way of calculating m_gene is desirable, as this will be an important parameter in the section “Modelling excess risk of malignancy following irradiation”. As a first approximation, we will only count sites as sensitive if a deletion there is next to or overlaps a nonsense codon and assume this always causes a loss of function. We can now count the number of sites m(k) where a deletion of length k results in a stop codon: TAG, TAA or TGA. Small insertions and deletions have a distribution of possible lengths k, with k = 1 occurring more often than any other length [36,37,38].

Calling the length of an indel k, with k > 0 corresponding to deletions, and k < 0 to insertions; denote its frequency of occurrence f(k). For each possible length k, there is a corresponding number of sensitive sites m(k) on the gene. This is calculated in a similar way for each k: for each site on the gene, check if deleting the base pair there, and k – 1 bases afterwards, results in a nonsense codon. The number of indel-sensitive sites, m_gene, is then the average of m(k) given the length distribution f(k):

$$m_{gene} = \frac{{\mathop {\sum }\nolimits_{k = - \infty }^\infty m\left( k \right)f\left( k \right)}}{{\mathop {\sum }\nolimits_{k = - \infty }^\infty f\left( k \right)}}$$

(10)

In the following, we will assume that insertions (with k > 0) are just as likely as deletions (k < 0), and f(k) is symmetric. The k < 0 terms will therefore drop out of the sum in Eq. (10).

Given a sequence, we can calculate m(k) explicitly for all values of k. Then, given a length distribution f(k) such as

$$f\left( k \right) = Aq^{k \vee }$$

m_gene can be computed explicitly given a sequence. We will use q = 0.53 for consistency [36], Supplementary Table 5A. Our results were largely insensitive to the exact choice of q. Supplemental Python code that computes l_gene and m_gene from EMBL sequences has been provided.

Using reference sequences for NF2 and SMARCB1 from the European Nucleotide Archive [26, 27], we find:

$$n_{NF2} =\; 251 \times \frac{1}{3} + 44.27 = 135 \\ \approx 251 \times \frac{1}{3} \times 1.74 = 142 \\ n_{SMARCB1} =\; 143 \times \frac{1}{3} + 37 = 85 \\ \approx 143 \times \frac{1}{3} \times 1.74 = 83$$

Which shows that Eq. (9) is a good approximation of the results of Eq. (10) for the genes we consider.

Base-pair mutation rates, rates of LOH and n _GFX

The number of sensitive sites n_GFX on GFX cannot be estimated by a similar counting argument, because the identity of this gene is unknown. Direct measurements of the rate r_LOH of loss of heterozygosity in Schwann cells, and of the per-base mutation rate u, are also unavailable.

However, all three parameters can be constrained using measurements of the empirical probabilities f_LOH of LOH on 22q, and f_SMARCB1 of pathogenic variants of SMARCB1. These can be computed from experimental studies [5, 15, 17, 39]. In a sample size of n patients, if k_LOH patients are found to have LOH on 22q, then the relative frequency f_LOH can be estimated as $f_{LOH} = \frac{{k_{LOH} + \frac{1}{2}}}{{n + 1}}$ (see Appendix B for a discussion of additive smoothing). A similar calculation can be done for f_SMARCB1, the frequency of pathogenic variants of SMARCB1.

The significance of this is that f_LOH and f_SMARCB1 are both predicted by our model. From Eqs. (4–6), theoretical predictions for f_LOH and f_SMARCB1 can be found:

$$f_{LOH} = \frac{{Pr\left( {3hitwithLOH} \right)}}{{Pr\left( {3hit} \right)}} = \frac{{P_1 + P_3}}{{P_1 + P_2 + P_3}} \\ \approx \frac{{2n_{GFX}r_{LOH} + n_{SMARCB1}r_{LOH}}}{{2n_{GFX}r_{LOH} + n_{SMARCB1}r_{LOH} + n_{GFX}n_{NF2}ub}}$$

(11)

and

$$f_{SMARCB1} =\, \frac{{Pr\left( {3hitwithSMARCB1} \right)}}{{Pr\left( {3hit} \right)}} = \frac{{P_3}}{{P_1 + P_2 + P_3}} \\ \approx \frac{{n_{SMARCB1}r_{LOH}}}{{2n_{GFX}r_{LOH} + n_{SMARCB1}r_{LOH} + n_{GFX}n_{NF2}ub}}$$

(12)

f_LOH and f_SMARCB1 are algebraically independent. So, if f_LOH and f_SMARCB1 are known experimentally, then two parameters of the model can be fixed. n_GFX and r_LOH can be expressed explicitly in terms of f_LOH and f_SMARCB1:

$$n_{GFX} = \frac{1}{2}n_{SMARCB1}\frac{{f_{LOH} - f_{SMARCB1}}}{{f_{SMARCB1}}}$$

(13)

$$r_{LOH} = \frac{{f_{LOH} - f_{SMARCB1}}}{{1 - f_{LOH}}}\frac{1}{2}n_{NF2}ub$$

(14)

so that if f_LOH and f_SMARCB1 are both known, then the only remaining free parameter is u.

A recent study of NF2 status and LOH in sporadic vestibular schwannoma found that out of 23 patients, 17 had some form of LOH [5]. So, f_LOH = (17 + 0.5)/(23 + 1) = 73%. This includes copy-neutral LOH, by which we understand any reduction in genetic dose at some locus with an otherwise normal karyotype. The primary mechanism of copy-neutral LOH observed in VS is mitotic recombination [11]. Other hypothetical possibilities are trisomy rescue, gain of one chromosome and followed by a loss of or large deletion on the other; and monosomy rescue, a copy number loss followed by subsequent gain [40, 41].

Relevant measurements of SMARCB1 status were not available, so we performed our own sequencing experiments to determine f_SMARCB1. Sequences from 32 sporadic vestibular schwannomas were sourced from the NHS DNA archive in the Manchester Centre for Genomic Medicine, and pathogenic variants of SMARCB1 were searched for using Sanger sequencing.

Sanger sequencing of SMARCB1 involved initial amplification of each coding exon, including 50–100 bp flanking intronic sequence per exon, using GoTaq G2 (Promega, Madison, WI, USA). The products were purified using Ampure cleanup beads, and the purified products were used as a template for sequencing PCRs using BigDye Terminator v3.1 (Life Technologies, Paisley, UK). Sequencing PCR products were analysed on an ABI3730xl DNA Analyser (Life Technologies, Paisley, UK). Sequencing chromatograms were aligned to the reference sequence to identify variants.

Our experimental results found no pathogenic variants of SMARCB1 in a sample of 32 sporadic vestibular schwannomas. The naive estimate of f_SMARCB1 is therefore k/n = 0/32 = zero. This presents a problem, as our estimate for n_GFX diverges when this is the case. As discussed above, we use additive smoothing to regularise f_LOH, with a pseudocount of $\alpha = \frac{1}{2}$ (see Appendix B for a further discussion) [42]. Thus,

$$f_{SMARCB1} = \frac{{k + \frac{1}{2}}}{{n + 1}} = \frac{{\frac{1}{2}}}{{32 + 1}} \approx 1.5\%$$

or f_SMARCB1 = 1.5%. Together with n_SMARCB1 = 85, and f_LOH = 73% imply that

$$n_{GFX} = \frac{1}{2} \times 85 \times \frac{{0.73 - 0.015}}{{0.015}} = 2002$$

(15)

from (13), and

$$r_{LOH} = \frac{{0.73 - 0.015}}{{1 - 0.73}} \times \frac{1}{2} \times 135 \times u \times 25.5 = 4.5 \times 10^3uyear^{ - 1}$$

(16)

from (14). The only remaining degree of freedom is now u, the error rate per base pair per division. We determine u first by fitting a power law,

$$Pr\left( {tumour,t} \right) = At^3$$

(17)

to incidence data from Evans et al. using non-linear least squares [1]. The coefficient A is related to the model parameters by Eq. (4),

$$A = \frac{1}{4}N_0n_{NF2}\left( {n_{GFX}\left( {2r_{LOH} + n_{NF2}ub} \right) + n_{SMARCB1}r_{LOH}} \right)u^2b^2$$

(18)

The probability to develop a tumour by age t, P(tumour,t), is the cumulative incidence by age t, after correcting for survival. Before fitting, the empirical incidence was therefore corrected for mortality using life tables for the period and region of the study [43].

The dataset was also truncated above the age of 80. This is because the relevant population size at ages greater than 80 was very small, and the incidence displays an old-age “plateau” that we suspect is an observational artifact. This amounts to excluding 8 patients from the original dataset of 417, so only 2% of the data has been trimmed [1]. In addition to these modifications to the dataset, our model only accounts for VS associated with somatic NF2 loss. This occurs in (at least) 85% of cases, so the cumulative incidence data is also rescaled by 85% [3, 5].

Although the results of Carlson et al. suggest that NF2 may be implicated in 100% of cases, we use AL Håvik et al.’s estimate of 85% as their sample size was larger (n = 46 vs. n = 23) and both authors’ methods should have a comparable resolution [3, 5].

Equation (17) was fit to the data from Evans et al. using non-linear least squares regression and standard errors obtained from the Hessian matrix [1]. The resulting best-fit value was A = 2.26 × 10⁻¹¹/yr³, with a standard error of σ(A) = ±2.95 × 10⁻¹³. This was not an important source of uncertainty, and was dwarfed by the uncertainty in n_GFX and r_LOH resulting from the small sample sizes of the experiments used to determine f_LOH on 22q and SMARCB1 alteration frequencies (see the section “Uncertainties in parameters”).

This three-hit model displays a high goodness of fit, with R² = 0.989. However, prior work had already established that a three-hit model provided a good fit to this dataset, so replicating this finding with high confidence is not statistically remarkable [4]. The purpose of this procedure was primarily to provide new parameter estimates. These new estimates, in particular u and r_LOH, will be useful in our study of malignant transformation (section “Modelling sporadic malignant transformation in schwannoma”).

Together with Eq. (18), this value for A implies that

$$u = 4.48 \times 10^{ - 10}$$

per base pair per division. This is similar in order of magnitude to the value of u ≈10⁻⁹ estimated by Tomasetti and Bozic, and similar to the range reported by Keogh et al. for brain tissue [44, 45]. This estimate for u now allows a concrete numerical value to be placed on r_LOH:

$$r_{LOH} = 2.03 \times 10^{ - 6} \; yr^{ - 1}$$

(19)

This is rather low in comparison to Paterson et al.’ estimate for r_LOH = 1.2 × 10⁻⁴/yr in colorectal cancer [13]. Possible explanations for this difference are discussed in the section “Sporadic vestibular schwannoma”.

Uncertainties in parameters

The sample sizes used to estimate frequencies of f_LOH and f_SMARCB1 were relatively small, both with N = 20–30 patients. As a result, n_GFX, u, and r_LOH have important sources of uncertainty that are difficult to quantify with standard methods—i.e., propagation of uncertainty typically assumes normally distributed errors, which is not a valid assumption when sample sizes are small [46].

To address this, distributions for these parameter values were bootstrapped. Randomly resampling regularised datasets for LOH and SMARCB1 with replacement generates a distribution of estimated values for n_GFX, u, and r_LOH (see Appendix B) [5, 15, 47]. A 95% confidence interval can then be determined for each parameter from the resulting distribution [48]. These confidence intervals read:

$${n_{GFX}} \in {\left[ {315,2442} \right]} \\ u \in {\left[ {3.36 \, \times 10^{ - 10},8.74 \, \times 10^{ - 10}} \right]} \\ {r_{LOH}} \in {\left[ {1.29 \, \times 10^{ - 6},4.93 \, \times 10^{ - 6}} \right] \, yr^{ - 1}}$$

(20)

In the context of bootstrapped distributions, the “best estimates” from section “Parameter estimation” can be interpreted as modal estimates. Supplemental Python code that implements the bootstrapping procedure has been provided. All parameter estimates and intervals are presented in Table 1 and Fig. 4. The input f_LOH and f_SMARCB1 estimates are presented in Table 2.

Table 1 Estimates of model parameters.

Full size table

**Fig. 4: Table of parameter estimates.**

Table 2 Frequencies of LOH and pathogenic alleles.

Full size table

Modelling sporadic malignant transformation in schwannoma

Additional mutations may occur as the tumour expands. When the “right” mutations occur together, a cell within the tumour gains a malignant phenotype [49]. In the case of malignant schwannoma, it is not currently clear whether these additional hits are due to oncogenes, tumour suppressors, or a combination of the two. We investigated several possibilities.

It was found during preliminary work that if malignant schwannoma were caused by a single oncogene, then the chances of its emerging from any benign schwannoma would be extremely high, on the order of 100%. A similar conclusion was also reached for any tumour suppressor on chromosome 22. As malignant schwannoma is a rare disease, it is far more likely that malignancy occurs after multiple hits—the simplest hypothesis is that a tumour-suppressor gene that is not on chromosome 22 is responsible.

In the following, we call this hypothetical tumour-suppressor gene “TSX”. We should bear in mind that there may be several such tumour suppressors, presumably associated with distinct molecular subtypes of malignant schwannoma. But for simplicity, we will develop the simplest model that is consistent with experimental observations, and make the operational assumption that there is only one such gene. Models that fully account for genetic diversity and multiple tumour suppressors will be left to future work. If the gene TSX needs to be extremely large for the model to make realistic predictions, this would be an early warning sign that we have missed some of this genetic diversity.

Once again, we will adopt a mean-field approach, and model the mean of intermediate populations [13]. Unlike in our approach in the section “Modelling incidence of sporadic vestibular schwannoma”, the underlying “wild-type” population is not constant, but expanding as the tumour grows [14, 50].

We will make the approximation that there is no cell death in the tumour, and all growth occurs at the outermost edge [50]. This is likely to be an underestimate of the real rate of cell division and turnover. But it allows us to eliminate dependence on the time t elapsed since tumour initiation, which is not directly observable. This obviates any assumptions about the tumour’s growth curve.

In each cell division event, there is some probability to create a mutant daughter cell. This can occur through either a mutation (i.e., SNV or indel) on TSX with probability n_TSX u, or through LOH on the relevant chromosome with probability p_LOH. The subpopulations N_k, N_l, N_m in the expanding tumour (see Fig. 5) obey the following differential equations:

$$\frac{{dN_k}}{{dt}} =\, g\left( t \right) \\ \frac{{dN_l}}{{dt}} =\, n_{TSX}u\frac{{dN_k}}{{dt}} + s_lN_l \\ \frac{{dN_m}}{{dt}} =\, p_{LOH}\frac{{dN_k}}{{dt}} + s_mN_m$$

(21)

where the growth rate g(t) of the tumour is some smooth function of time with g(0) = 0, and s_l and s_m are fitnesses of the mutant clones. System (21) has the initial conditions: N_k (t = 0) = g(0) = 0, N_l (t = 0) = 0, so that N_k is just

$$N_k = {\int}_{t^{\prime} = 0}^t {g\left( {t{^\prime}} \right)dt{^\prime}}$$

**Fig. 5: Our 2-hit model of tumour-suppressor inactivation, accounting for all three orders of appearance.**

Note that the populations are easier to measure than tumour age t.

As in the model from the section “Modelling incidence of sporadic vestibular schwannoma”, we will assume that as soon as a malignant cell emerges in the benign VS tumour, it survives and establishes a growing malignant lineage. The probabilities for a malignant subclone to emerge in a benign VS with and without LOH on the locus of TSX follow

$$\frac{{dP\left( {malignancywithLOH} \right)}}{{dt}} = \left( {\frac{1}{2}n_{TSX}u\frac{{dN_m}}{{dt}} + \frac{1}{2}p_{LOH}\frac{{dN_l}}{{dt}}} \right)\left( {1 - P\left( {malignancywithLOH} \right)} \right) \\ \frac{{dP\left( {malignancywithoutLOH} \right)}}{{dt}} = \left( {\frac{1}{2}n_{TSX}u\frac{{dN_l}}{{dt}}} \right)\left( {1 - P\left( {malignancywithoutLOH} \right)} \right)$$

(22)

respectively, where p_LOH is the probability for LOH to occur during a single division, and n_TSX is the number of sensitive locations on TSX. For the parameters u and p_LOH, we will use the value of u = 4.48 × 10⁻¹⁰ from section “Modelling incidence of sporadic vestibular schwannoma”; for p_LOH, we will use a value consistent with our previous r_LOH estimate:

$$r_{LOH} = p_{LOH}b \\ \Rightarrow p_{LOH} = \frac{{r_{LOH}}}{b} = 7.97 \times 10^{ - 8}$$

(23)

In the absence of better experimental constraints, we will take p_LOH to be the same for all chromosomes. As TSX is currently unidentified, n_TSX cannot be determined a priori from a reference sequence like in section “Parameter estimation”. We constrain n_TSX in the section “Parameter estimation”.

General solutions to system Eq. (21) can be written:

$$\begin{array}{*{20}{c}} {N_l = n_{TSX}uN_k + s_le^{s_lt}{\int}_{t{^\prime} = 0}^t {e^{ - s_lt{^\prime}}n_{TSX}uN_k\left( {t{^\prime}} \right)dt{^\prime}} } \\ {N_m = p_{LOH}N_k + s_me^{s_mt}{\int}_{t{^\prime} = 0}^t {e^{ - s_mt{^\prime}}p_{LOH}N_k\left( {t{^\prime}} \right)dt{^\prime}} } \end{array}$$

(24)

The fitnesses s_l and s_m relative to the “wild-type” tumour cells N_k are not known. Since both subpopulations l and m still have one active copy of TSX, we will assume these mutations are neutral, s_l = s_m = 0. This amounts to assuming that TSX is haplosufficient. In this limit, Eq. (24) becomes

$$\begin{array}{*{20}{c}} {N_l = n_{TSX}uN_k} \\ {N_m = p_{LOH}N_k} \end{array}$$

(25)

Note that all explicit dependence on t has disappeared, and these depend purely on N_k. The risks of different types of malignancy are then given by the solutions of Eq. (22),

$$\begin{array}{*{20}{c}} {P\left( {malignancywithLOH} \right) = 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - n_{TSX}up_{LOH}N_k} \right)} \\ {P\left( {malignancywithoutLOH} \right) = 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - \frac{1}{2}\left( {n_{TSX}u} \right)^2N_k} \right)} \end{array}$$

(26)

The overall risk of malignancy can be calculated from Eq. (26) using the relation

$$1 - P\left( {malignancy} \right) = \left( {1 - P\left( {malignancywithLOH} \right)} \right)\left( {1 - P\left( {malignancywithoutLOH} \right)} \right)$$

from which it follows that

$$P\left( {malignancy} \right) = 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - \frac{1}{2}n_{TSX}u\left( {n_{TSX}u + 2p_{LOH}} \right)N_k} \right)$$

We can also note that both n_TSX u and p_LOH are small, on the order of 10⁻⁷. The tumour population N and subpopulation N_k must therefore be similar, to within one part in 10⁻⁷:

$$N = N_k + N_l + N_m = \left( {1 + n_{TSX}u + p_{LOH}} \right)N_k \approx N_k$$

so as a result,

$$P\left( {malignancy} \right) \approx 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - \frac{1}{2}n_{TSX}u\left( {n_{TSX}u + 2p_{LOH}} \right)N} \right)$$

(27)

Finally, we can notice once again that because malignant schwannoma is rare [2], P (malignancy) must be very small, P ≈ 0.2% ≪ 1, and Eq. (27) simplifies to

$$P\left( {malignancy} \right) \approx \frac{1}{2}n_{TSX}u\left( {n_{TSX}u + 2p_{LOH}} \right)N$$

(28)

To connect P(malignancy) to the observed volume V of the tumour, the population N must be related to V:

$$N = \frac{{f_{SC}V}}{{V_{SC}}}$$

(29)

where f_SC is the fraction of cells in the tumour that are Schwann cells, V_SC is the volume of an individual Schwann cell (in mm³), and V the volume of the tumour (also in mm³). The majority of cells in schwannomas are not Schwann cells, but macrophages: the best available estimates of f_SC are in the range 0.3–0.5 [51]. Of these, we choose the upper value of f_SC = 0.5. A representative value for the volume V_SC can be set at 1.6 × 10⁻⁶ mm³ (from ref. [52]).

The final expression for the risk of malignancy in terms of tumour volume is thus

$$P\left( {malignancy} \right) \approx \frac{1}{2}n_{TSX}u\left( {n_{TSX}u + 2p_{LOH}} \right)\frac{{f_{SC}V}}{{V_{SC}}}$$

(30)

The tumour volume V can be observed, in principle, on MRI scans [53]. Notably, the risk of malignancy is simply proportional to tumour volume V. We constrain the number of sensitive sites n_TSX in section “Parameter estimation”, and explore a range of plausible values in Fig. 6.

**Fig. 6: Risk of spontaneous malignant transformation in a benign schwannoma as a function of diameter d in mm, as modelled by Eq. (30).**

Parameter estimation

The main unknown in this model of malignant transformation is n_TSX, the number of sensitive sites on TSX. As the identity of this gene is unknown, n_TSX cannot be calculated using a reference sequence. An estimate could help to constrain its identity.

From the SEER study, we know that the lifetime risk of malignant schwannoma ≈0.2% [2]. The model in the section “Modelling sporadic malignant transformation in schwannoma” avoided using “tumour age” as an independent variable, which is usually unobservable: the main result, Eq. (30), is formulated in terms of tumour volume.

If a typical tumour diameter on surgery is 40 mm, the tumour will contain roughly N ≈10¹⁰ cells [54, 55]. If P(malignancy) ≈0.2% [2], then

$$P = \frac{1}{2}n_{TSX}u\left( {n_{TSX}u + 2p_{LOH}} \right)N \approx 0.2\%$$

(31)

from Eq. (27). Rearranging (31) for n_TSX is elementary. Given that u = 4.48 × 10⁻¹⁰, N ≈ 10¹⁰, and p_LOH = 7.97 × 10⁻⁸ p_LOH = 7.97 × 10⁻⁸, n_TSX should then be

$$n_{TSX} \approx \frac{{p_{LOH}}}{u}\left( {\sqrt {1 + \frac{{2P}}{{p_{LOH}^2N}}} - 1} \right) \approx 1245$$

(32)

While this is a high figure, it is not impossible for TSX to be one gene: one may compare n_APC = 604 from ref. [13]. It nonetheless seems more likely that several tumour suppressors are involved, and this high estimate for n_TSX indicates a degree of genetic diversity.

This estimate must be qualified by the uncertainties in u and r_LOH. We can place confidence intervals on n_TSX by a similar bootstrapping procedure to that in the section “Uncertainties in parameters”. Using Eq. (32), one n_TSX estimate can be determined for every pair of u and r_LOH values. A distribution for n_TSX can therefore be generated at the same time as those for u and r_LOH. The resulting 95% confidence interval is

$$n_{TSX} \in \left[ {604,1515} \right]$$

The full bootstrapped distribution for n_TSX is shown in Fig. 4. Supplemental Python code has also been provided.

LOH in malignant schwannoma

Another testable prediction is the proportion of malignant tumours that should display LOH. Considered as a subtype of schwannoma associated with a mutation on TSX, levels of LOH on chromosome 22 should be similar to that of benign tumours. If malignant transformation involves an additional tumour-suppressor gene, and it is not on chromosome 22, we should also expect to see LOH elsewhere.

This “excess” LOH should be found on the chromosome that carries TSX, the hypothetical tumour suppressor from the section “Modelling sporadic malignant transformation in schwannoma”. Studying LOH, especially in the form of copy number changes, has already been widely used to search for tumour suppressors, although it only gives limited information about the precise locus [56]. In this section, we estimate the frequency f_LOH of LOH aggregated across all chromosomes, relate this to n_TSX, and estimate a minimum useful sample size for a study.

The parameter n_TSX can be mathematically related to f_LOH by a similar method to that used for n_GFX in the section “Parameter estimation”. This will be useful for two reasons. Firstly, so that future measurements of f_LOH can be used to estimate n_TSX [56]. Secondly, f_LOH can be estimated in advance to judge whether or not such an experiment would be worthwhile.

From Eqs. (25) and (27), it follows that

$$f_{LOH} = \frac{{P\left( {malignancywithLOH} \right)}}{{P\left( {malignancy} \right)}} \\ = \frac{{1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - n_{TSX}up_{LOH}N} \right)}}{{1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - \frac{1}{2}n_{TSX}u\left( {n_{TSX}u + 2p_{LOH}} \right)N} \right)}} $$

(33)

where f_LOH is the excess LOH found at the locus of the unknown tumour suppressor. Again, we know that malignancy is very rare, with a lifetime risk of around 0.2% [2]. In the limit that P(malignancy) is small, the product n_TSX up_LOH N must also be small. Taylor expansion of Eq. (33) with respect to N yields

$$f_{LOH} \approx \frac{{2p_{LOH}}}{{2p_{LOH} + n_{TSX}u}} + O\left( {n_{TSX}up_{LOH}N} \right)$$

(34)

that is, f_LOH should approach a constant value when N is sufficiently small. Substituting our estimates of u, p_LOH and n_TSX into (28), one can show that this approximation holds when N ≪ 10¹² cells. A tumour with 10¹² cells would be 18 cm in diameter. This is a hundred times more massive than the majority of schwannomas [54, 55]: it should not even be possible for a tumour this large to form inside the cerebellopontine angle. Equation (34) should therefore be a good approximation.

Because f_LOH is insensitive to tumour size N, it should be possible to estimate n_TSX from experimental f_LOH figures even when the sizes and ages of the sampled tumours are unknown. Equation (34) implies:

$$n_{TSX} \approx \frac{{2p_{LOH}}}{u}\frac{{1 - f_{LOH}}}{{f_{LOH}}}$$

(35)

Without an LOH survey of malignant schwannoma samples, this formula cannot yet be used to provide a new estimate of n_TSX. It is presented here for future work.

Returning to the second point about whether or not this experiment would be realistic, we should now estimate f_LOH. Substituting the estimates p_LOH = 7.97 × 10⁻⁸, u = 4.48 × 10⁻¹⁰ and n_TSX ≈1245 into Eq. (34) yields

$$f_{LOH} \approx \frac{{2p_{LOH}}}{{2p_{LOH} + n_{TSX}u}} = 22\%$$

This is a large enough value that a pilot study with only 15 samples is likely to detect something: the probability that no samples display excess LOH, assuming the above estimates, is (1–0.22)¹⁵ = 2.4%.

It is notable that (34) is independent of tumour size N. A study of biopsies from malignant VS tumours should therefore not detect an association between LOH and tumour size. It should be remembered that this assumes selective neutrality of s_l and s_m—i.e., TSX is haplosufficient.

Modelling excess risk of malignancy following irradiation

The main mechanism of mutagenesis following radiation is the induction and misrepair of double-strand breaks (DSBs) [57,58,59,60]. Our model of radiation mutagenesis can be summarised in simple terms: following a large dose of radiation D, a DSB occurs at a given base pair with some small probability p_DSB (D). The DSB is then repaired, with a probability ϵ of faulty repair. There are m_TSX such sites on gene TSX where misrepair results in TSX being deactivated. If a DSB occurs on a gene, and the repair was faulty, and it was on one of the sensitive sites, then one copy of TSX will be inactivated. These are taken to be independent events. The probability P(TSXlost) to inactivate one copy of TSX in a given cell is therefore

$$P\left( {TSXlost} \right) = p_{DSB}\left( D \right) \times P\left( {misrepair} \right) \times \left( {number\;of\;sensitivesites} \right) = p_{DSB}\left( D \right){\it{\epsilon }}m_{TSX}$$

(36)

The repair error rate ϵ, probability of DSB induction p_DSB, and number of radiosensitive sites m_TSX now need to be specified.

There are two main mechanisms of repair following irradiation and the induction of double-strand breaks. These are non-homologous end joining (NHEJ), and homologous repair (HR) [58,59,60]. NHEJ is thought to have a high error rate, and HR a low (but nonzero) one [58]. In addition to NHEJ, there are several other more mutagenic repair mechanisms that take over when HR is suppressed [57]. At low-dose rates, HR is supposed to be the primary repair mechanism, with NHEJ and alternative mechanisms taking over at higher dose rates [57, 58, 61]. Misrepair is much more common at large dose rates of radiation, in excess of 0.5 cGy/min [61,62,63].

DSB misrepair is associated with the introduction of small insertions and deletions (“indels”) [58]. To model the number of sensitive sites, we will use the parameter m_TSX from the section “Modelling incidence of sporadic vestibular schwannoma”, because this was the number of indel-sensitive locations. From Eq. (9), m_TSX ≈ 0.74l_TSX, or in terms of n_TSX, the total number of sensitive sites on TSX,

$$m_{TSX} = \frac{{0.74}}{{1.74}}n_{TSX} = 0.42n_{TSX}$$

(37)

The therapies we are modelling have high dose rates, on the order of several Grays per minute: this is far outside the low-dose rate regime studied by Stenerlöw et al. [63]. The repair in this regime will therefore be relatively error-prone, similar to that studied by Rothkamm et al. In this high dose rate regime, as many as 50% of the repairs may be faulty [62]. Strictly speaking, the error rate should depend on both dose D and dose rate $\dot D$: i.e., ${\it{\epsilon }}\left( {D,\dot D} \right)$. In the absence of sufficiently well-developed models, we will instead take ϵ = 50% = constant [62], and leave more detailed models of DSB repair to future work. The figure of ϵ = 50% is a pessimistic upper bound: this section should then establish an upper limit on the associated risk.

The only term in the above model that has not been fixed is p_DSB (D), the probability for a DSB to be induced. This has been measured directly, at the same energies and doses of gamma rays that are used in therapy [64]. In the 0–50 Gray regime, the results are well-described by a simple linear interpolation,

$$p_{DSB}\left( D \right) \approx kD$$

(38)

with k = 3.90 × 10⁻⁷ per Gray per base pair. Over the entire genome, this corresponds to about 300 induced DSBs per Gray of radiation. This is substantially higher than older measurements of mutagenesis using X-rays—this is consistent with the understanding that gamma radiation is more mutagenic [61, 63, 64].

To result in malignancy, the cell carrying the new mutation has to survive. Following a dose of ionising radiation, a fraction of any population of cells will fail to replicate, and die. We will assume that all cells in the tumour have the same probability S(D) to survive the dose D administered. Following current models in radiobiology, we will use a “linear-quadratic” form for S(D) [65, 66]:

$$S\left( D \right) = {{{{{{{\mathrm{exp}}}}}}}}\left( { - \alpha D - \beta D^2} \right)$$

(39)

and take α = 0.77 Gy⁻¹ and β = 0.31 Gy⁻² [65, 66].

Combining Eqs. (36), (38), and (39), the probability that after a dose of radiation, a given cell loses one copy of TSX and survives is

$$P\left( {TSXlostandcellsurvives} \right) = kD{\it{\epsilon }}m_{TSX}S\left( D \right)$$

(40)

Because kD will be on the order of 10⁻⁵, and ϵ and S both on the order of 1, we can only expect this probability to be very large when m_TSX ≈10⁵. The largest known gene in the human genome, dystrophin, is 2.5 Mb long—the majority are much smaller [67]. It should be safe to assume that any candidate TSX will be short enough that the probability Eq. (40) is much smaller than 1.

Not all cells will become cancerous when hit by a dose of radiation. Cells that still have two copies of TSX can lose one copy and still retain one functional copy. Cells that have already lost one copy of TSX may become malignant if they lose the other. The only subpopulations that will be sensitive to irradiation are therefore N_l and N_m from Fig. 5.

The probability that a tumour gains a new malignant clone following irradiation should therefore be:

$$P\left( {TSXlostinatleast1cell} \right) = 1 - \left( {1 - kD{\it{\epsilon }}m_{TSX}S\left( D \right)} \right)^{N_l + N_m} \\ \approx 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - k{\it{\epsilon }}m_{TSX}DS\left( D \right)\left( {N_l + N_m} \right)} \right) \\ \approx 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - k{\it{\epsilon }}m_{TSX}\left( {n_{TSX}u + p_{LOH}} \right)DS\left( D \right)N_k} \right) \\ \approx 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - k{\it{\epsilon }}m_{TSX}\left( {n_{TSX}u + p_{LOH}} \right)DS\left( D \right)N} \right)$$

(41)

where we have used the solutions (25) and the approximation N_k ≈ N. The total number of cells in the tumour N can again be related to the volume of the tumour using Eq. (29).

Equation (41) gives the probability that the gene TSX will be lost. This can only result in malignancy if it has not been lost already. The probability to have developed a malignancy after a dose D should be

$$Pr\left( {malignancy} \right) = \; P\left( {TSXlostinatleast1cell} \right)Pr\left( {nomutantclonebefore} \right) \\ + Pr\left( {mutantclonebefore} \right) \\ = \; P\left( {TSXlostinatleast1cell} \right)\left( {1 - Pr\left( {mutantclonebefore} \right)} \right) \\ + Pr\left( {mutantclonebefore} \right)$$

(42)

Current practice defines the excess risk E.R. of malignant transformation as the absolute risk difference [68]: the difference between the probability Pr(malignancyafterirradiationdoseD) and the probability of a malignancy having occurred spontaneously beforehand, Pr(mutantclonebefore):

$$E.R.\left( D \right) = Pr\left( {malignancyafterirradiationdoseD} \right) - Pr\left( {mutantclonebefore} \right)$$

(43)

From Eqs. (42) and (43), it should be clear that

$$E.R.\left( D \right) = \left( {1 - Pr\left( {mutantclonebefore} \right)} \right)Pr\left( {TSXlostinatleast1cell} \right)$$

and finally, substituting Eq. (27) for Pr(mutantclonebefore) into the above yields the main result,

$$E.R.\left( D \right) = \left( {1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - k{\it{\epsilon }}m_{TSX}\left( {n_{TSX}u + p_{LOH}} \right)DS\left( D \right)N} \right)} \right) \\ { \times {{{{{{{\mathrm{exp}}}}}}}}\left( { - \frac{1}{2}n_{TSX}u\left( {n_{TSX}u + 2p_{LOH}} \right)N} \right)}$$

(44)

Some initial observations can be made. E.R.(D) initially increases linearly in dose D, with no lower threshold. At some point, the exponential fall-off of S(D) will cause E.R. to peak, then decrease rapidly. The stochastic effect of radiation-induced mutations is eventually overcome by the deterministic effect of necrotic cell death. It is remarkable that the linear dependence at low doses, below around 2 Gray, is consistent with the linear no threshold model [65].

Dose fractionation

Equation (44) assumes that the dose D is delivered in a single fraction. However, it is more common to deliver treatments in multiple fractions. This has been demonstrated to provide effective tumour control whilst also reducing side effects in healthy tissue [69, 70]. Fractionation schedules need to be chosen in order to balance what are typically referred to as the “5 R’s of radiotherapy”: repopulation, repair, redistribution, reoxygenation and radiosensitivity. Fractionation is beneficial from the perspective of tumour control, as it allows for the redistribution and reoxygenation of tumour cells during the course of treatment, whilst also allowing healthy cells a chance to repair and repopulate.

Fractionated doses should be administered so that tumour cells have no or little time to recover. Delivered in rapid enough succession, the total dose D will be what matters, and a tiny proportion of tumour cells will survive due to the rapid fall-off in S(D). How closely doses need to be spaced to prevent tumour cell recovery likely depends on the cell-cycle time. This is the rationale behind scheduling fractions to be delivered every day, or every 2 days.

A typical cell-cycle time for Schwann precursor cells is not known to a high degree of precision. The b = 25.5/yr estimated in section “Parameter estimation” corresponds to a cell-cycle time of around 14 days, but this was for wild-type precursor cells, not tumour cells. We can estimate the cell-cycle time for tumour cells using tumour expansion speed c and Schwann cell volume V_SC via a scaling argument:

$$c \approx V_{SC}^{1/3}b \\ \Rightarrow b \approx \frac{c}{{V_{SC}^{1/3}}} \\ \approx 90 - 272yr^{ - 1}$$

(45)

with V_SC = 1.6 × 10⁻⁶ mm³ from ref. [52], as in Eq. (29); and c = 1–3 mm/yr from ref. [6]. This rough estimate should be consistent with estimates from both Fisher–Kolmogorov and Eden model approaches to solid tumour growth [71, 72]. Note that this estimate is faster than the precursor cell division rate, which is consistent with tumour cells having a small selective advantage.

The surface growth rate b estimated in this way corresponds to a cell division timescale of 1–4 days. As long as fractionated doses are spaced more closely than this, they should have the same effect on tumour cells as a single large dose, and the excess risk will be governed by Eq. (44), and not Eq. (47), below.

However, enough is unknown about the dynamics of Schwann cells in vivo that we should also consider a pessimistic scenario, in which tumour cells manage to fully recover between fractionated doses. This should give us an upper bound—the worst-case scenario—on the risk of properly fractionated therapy, with Eq. (44) giving the lower bound—the best case scenario.

In this worst-case scenario, we treat the risk of mutations being induced following each fractional dose as completely independent events:

$$P\left( {TSXlostafter\;Fdoses} \right) = 1 - \left( {1 - P\left( {TSXlost,D/F} \right)} \right)^F$$

which implies

$$P\left( {TSXlostafterFdoses} \right) = 1 - \left( {1 - P\left( {TSXlost,D/F} \right)} \right)^F \\ = 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - k{\it{\epsilon }}m_{TSX}\left( {n_xu + p_{LOH}} \right)N\frac{D}{F}S\left( {D/F} \right)} \right)^F \\ = 1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - k{\it{\epsilon }}m_{TSX}\left( {n_xu + p_{LOH}} \right)NDS\left( {D/F} \right)} \right)$$

(46)

and the excess risk associated with a dose D delivered in F fractions is

$$\begin{array}{*{20}{c}} {E.R.\left( {D,F} \right) = \left( {1 - {{{{{{{\mathrm{exp}}}}}}}}\left( { - k{\it{\epsilon }}m_{TSX}\left( {n_{TSX}u + p_{LOH}} \right)NDS\left( {D/F} \right)} \right)} \right)} \\ { \times {{{{{{{\mathrm{exp}}}}}}}}\left( { - \frac{1}{2}n_{TSX}u\left( {n_{TSX}u + 2p_{LOH}} \right)N} \right)} \end{array}$$

(47)

The likelihood of inducing a new mutation is essentially unchanged, but the likelihood of its survival S(D/F) now depends strongly on the number of fractions F. This has important implications in section “Excess risk of malignancy following radiotherapy”.

Results

Sporadic vestibular schwannoma

The answer to our first research question, on modelling the risk of benign vestibular schwannoma, is that observed incidence can be explained well with a mechanistic, three-hit model involving NF2, SMARCB1, and a hypothetical oncogene GFX. A comparison of the model’s fit to empirical cumulative incidence curves from Evans et al. is shown in Fig. 3 [1]. The simple model in section “Modelling incidence of sporadic vestibular schwannoma” has a clear interpretation for each of its parameters, which allowed us to find new estimates for the mutation rate u of SNVs and indels, and the rate r_LOH of LOH on 22q, which improve on current estimates for Schwann cells (see Fig. 1). The technique of estimating parameters and bootstrapping uncertainties from measurements of the relative frequencies f_LOH and f_SMARCB1, derived from experimental data, should also be applicable to other neoplasias (see section “Uncertainties in parameters” and Fig. 1).

The estimation technique for r_LOH is very sensitive to the experimental frequency f_LOH of LOH events. If an experiment can’t detect all forms of LOH present, r_LOH will be underestimated. We have tried to anticipate this by drawing on studies that explicitly account for copy-neutral LOH [5]. Nonetheless, it is natural to ask why our estimate of r_LOH is almost 70 times lower than a previous estimate for colorectal cancer [13]. First, the cell division rate b is much slower in glia (b = 25.5/yr [25]) than in the colonic crypt (b = 73/yr [73]), so the values of r_LOH cannot be directly compared. We should instead compare p_LOH = r_LOH/b, the probability of an LOH event per division, and account for uncertainties in both estimates of p_LOH.

Performing a similar bootstrapping procedure for the estimates of p_LOH for colorectal crypt cells by resampling data from Huang et al. and using the same calculation as Paterson et al. to determine p_LOH confirms that there is still a statistically significant difference—a factor of 23 (see Fig. 7) [13, 74]. This stark difference suggests that rates of LOH are simply different between glial cells and colonic crypt cells, even controlling for cell division frequency, just as mutation rates vary substantially between tissues and tumour types [35, 73, 75]. Herrero-Jimenez et al. reported that p_LOH may vary by a factor of 30 between colonic crypt and hematopoietic stem cells, so a difference of a factor of 23 between different tissues is quite possible [76]. Such striking differences in p_LOH and mutation rates with tissue may suggest a coupling of mechanisms of DNA (mis)repair to cell differentiation [77].

**Fig. 7: Relative frequency p_LOH of LOH in Schwann cells and colonic crypt cells.**

As a result of the efforts of the Pan-Cancer Analysis of Whole Genomes Consortium (PCAWGC), a wealth of gene- and tumour-specific data on the prevalence of LOH is now available [75, 78]. While neither study seems to contain vestibular schwannoma, MPNST, or related tumours such as meningioma or oligodendroglioma, Gerstung et al. do report several interesting copy number alterations and hits to TP53 in glioblastoma multiforme (GBM). Copy number losses on 17p and coding mutations in TP53 occur with a similar frequency, f_LOH (17p) ≈27% ([75] Fig. 3g). As such, our finding of a comparable rate of LOH r_LOH ≈ 2.03 × 10⁻⁶/yr on 22q and mutation rate μ_NF2 ≈1.5 × 10⁻⁶/yr (see Table 1) of NF2 might hold more generally for tumours derived from glial precursor cells: while it is not possible to say with certainty without additional modelling, μ_TP53 and r_LOH (17p) (on the same loci) must at least have similar orders of magnitude for alterations to occur with such similar prevalence. Furthermore, the same study observed copy number losses on 22q in 44% of cases, which is consistent with a role for NF2 in GBM [79]. Conversely, loss-of-function variants in TP53 are unheard of in vestibular schwannoma [80].

This same work of the PCAWGC found that these hits to TP53 must occur “early” rather than “late”, and LOH on 22q must occur “late” in GBM [75]. If there is a role for NF2 in GBM, it is therefore likely to be a relatively “late” driver. In contrast, our model for malignant transformation in vestibular schwannoma suggests that the last alterations before malignancy should be in an unidentified tumour suppressor (or suppressors) TSX, with NF2 already being inactivated or lost in the benign tumour. If there is any role for TP53 in malignant schwannoma, it can therefore be expected to be “late”, with NF2 being “early”. This is the reverse of the case in GBM [75]. Loss-of-function of TP53 has been speculated to be involved in the malignant transformation of VS [7].

The number of sensitive sites on the third hit GFX, n_GFX, was found to be 2002. This high value may indicate that there are several oncogenes that contribute to sporadic incidence, representing a high level of genetic diversity. This is also likely to be a slight overestimate of the real figure, and may be revised once LZTR1 is taken into account.

The posterior distribution for n_GFX was found to be bimodal, with two peaks around 700 and 2000 (see Fig. 4). We believe this is due to the low rate of SMARCB1-positive samples in our experiments, which implies a high uncertainty in Eq. (13). A larger sample size for SMARCB1 levels would be very helpful to better constrain n_GFX.

It is also interesting to note that the theory developed in section “Modelling incidence of sporadic vestibular schwannoma” predicts that there should be no association between the relative frequency f_LOH of LOH and patient age. The same holds for f_SMARCB1. This is a consequence of neutrality: a change in allele frequency over time would indicate selection for one of the intermediate genotypes in Fig. 2 [81]. Previous work also suggests that indirect knowledge of selective advantages may be contained in the orders of appearance of mutations, even when the full dynamics over time are not available [13].

Malignant transformation

Our second goal was to establish the risk of sporadic malignant transformation in an untreated schwannoma. The model in the section “Modelling sporadic malignant transformation in schwannoma” culminates in Eq. (30): a simple linear relationship between risk of malignancy and tumour volume V. The most interesting finding here is that most of the observed lifetime risk of malignancy can be explained by sporadic incidence (see Fig. 6) [2]. If malignant transformation is associated with the loss-of-function of an unidentified tumour suppressor TSX, then plausible values for n_TSX and tumour volume can explain much of the observed lifetime risk (see Fig. 6).

This should be qualified by the uncertainty involved. The tumour suppressor TSX is currently unidentified, so a wide range of estimates for risk can be produced by adjusting n_TSX. Furthermore, our early estimates of n_TSX suggest a value on the order of 1245, which while not impossible is relatively high. This may suggest that there is a degree of genetic diversity that our modelling approach has not yet captured. On the other hand, there are tumour suppressors implicated in other cancers known to have higher n_gene values, so there is some chance that one gene explains most of the incidence [13].

To try and locate the genes involved in malignancy, we sketch an experimental study in the section “LOH in malignant schwannoma”. Our calculations suggest that “excess” LOH should be a frequent enough occurrence to detect by profiling at least 15 tumours. This is probably achievable by looking for gross copy number changes and SNP arrays—but higher resolution approaches like next-generation sequencing would be much more sensitive to copy-neutral LOH and smaller deletions [82,83,84,85]. While not a detailed experimental design, this sketch suggests that such an experiment should be feasible.

As was the case with the model of sporadic incidence, these levels of LOH should not be associated with tumour size in any significant way. This is a consequence of neutrality: an association with tumour size would indicate selection for one of the intermediate mutants [81], and thus point to haploinsufficiency of the implicated gene.

Excess risk of malignancy following radiotherapy

Our final goal was to develop an a priori model of the excess risk of malignant transformation following irradiation. For both unfractionated radiosurgery and for fractionated radiotherapy under ideal conditions, the excess risk of malignancy was found to be totally negligible for realistic doses. As can be seen in Fig. 8, the excess risk of malignant transformation for a total dose of >10 Gray is remote, less than 1 in a billion. This should hold for a wide range of tumour sizes and values of n_TSX. This is consistent with previous studies that found radiotherapy had no detectable association with new malignancies [3, 86].

**Fig. 8: Our model for the excess risk of malignancy associated with irradiation, from Eq. (44).**

For lower doses, on the order of 1–4 Gray, the risk is much greater. These doses are much lower than therapeutic targets, so current best practice should have a minimal risk. However, for cases where there are many diffuse tumours in the same region, it may not be possible to target individual tumours with optimal dosing. The relative risk for these cases should be revisited as a priority, ideally combining the above model with medical imaging data.

It must be emphasised that our conclusions about radiotherapy are theoretical, and an indirect extrapolation from experiments and incidence studies. We have tried to account for the inevitable uncertainty in this by erring on the side of pessimism: for example, with regards to the DSB repair error rate ϵ in section “Modelling excess risk of malignancy following irradiation”. This is so that we can estimate a reasonable upper bound on risk, despite the unknowns.

If doses are spaced so that fewer than one cell-cycle passes for the tumour cells, so that the cells “see” the fractionated doses as a single, large dose, then the risk of malignancy should be negligible. We estimate the cell division timescale to be on the order of 1–4 days (section “Dose fractionation”). If at least 4 Gray can be delivered each cell cycle, the excess risk of malignancy should be less than the expected lifetime risk, even for pessimistic estimates of n_TSX (see Fig. 8).

Slow-growing schwannomas can probably be treated safely at a minimum dose rate of 1 Gray per day. Faster growing tumours could justify more aggressive treatment: the 3 mm/yr cases from Paldor et al. 2016 might require 4 Gray per day to ensure that we are on the right side of the peak risk from Fig. 8 [6]. It would be interesting to learn how this difference in growth rates is correlated with specific genetic alterations.

This does assume that solid growth can be reliably distinguished from inflammation and “pseudo-progression”, which may not always be possible.

For completeness, we also studied the “worst case scenario” in which fractionated therapy is delivered in independent doses several days apart. This would maximise the chance that tumour cells “recover” from the deterministic effects of cell death, and go on to divide again. This should not be the case in practice [69].

As the number of fractions increases, the proportion of mutant cells that survive the therapy also increases. As a result, poorly administered hyperfractionated therapy shows a much higher risk for realistic doses and fractions. This risk overtakes the lifetime risk at F ≈10 fractions, and may grow as high as ≈10% (see Fig. 9). This enhancement of risk is attributable to the increased survival of mutants following therapy, and not due to enhanced mutagenicity as such (see Fig. 9).

Fig. 9: Worst-case scenario excess risk of radiation-associated malignant transformation for tumours of diameters 20mm (+ signs), 30 mm (× signs) and 40 mm (◦ signs) each receiving a dose of 50 Gray, if the two hits are a hypothetical gene with m_TSX = 100.

These results must be qualified by the uncertainties in the identity of the hypothetical tumour suppressor (or suppressors) TSX, and also in the dynamics of cell division, DNA repair and survival. The assumption that the dose is homogeneous may also be violated for multifocal tumour clusters commonly observed in NF2 patients [20]. In addition, our model for the cell survival curves S(D) could be improved upon. This may affect our conclusions regarding the “worst-case scenario”, but probably not best practice.

Discussion

In addition to improved estimates of the underlying mutation rates and dynamics of LOH in Schwann cells, our mechanistic approach to modelling schwannomas has enabled new theoretical estimates of the excess risk of malignant transformation in radiosurgery. This shows that the modelling approach of branching processes on graphs generalises to other neoplasias, enabling new connections and inferences to be made [13]. Furthermore, the uncertainties in this model have raised new questions, creating opportunities for future modelling and experimental work.

Firstly, a better picture of genetic diversity in sporadic VS may be achievable with a more detailed model. This more detailed mechanistic model could include LZTR1 as one of the first hits, as well as explicitly studying multiple candidate third-hit oncogenes. This could be coupled with a comprehensive study of copy number alterations in VS biopsies, as well as the status of SMARCB1, NF2 and LZTR1. This might be achieved by coupling a copy number analysis with an SNP array, or a higher resolution approach leveraging next-generation sequencing [82,83,84,85]. A more complex model would require new experiments to determine its parameters, but the method for doing so should be essentially similar to the method in the section “Parameter estimation”: for each gene included in the new model, measure the proportion of cases where pathogenic variant alleles were detected, and compare the frequencies f_gene to those predicted by the model.

One important limitation of our approach is that the mutation rate u does not distinguish between transitions and transversions, and the underlying model assumes all substitutions are equally likely [28, 29]. Different base-pair substitutions are known to have different frequencies in different cancers, but data specific to schwannoma does not yet seem to be available [87]. In the absence of sequence-specific data on transition/transversion frequencies in schwannoma, multi-parameter models of substitutions will run into problems of overfitting. However, a recent study of meningioma, a related tumour, suggests that data on transition and transversion frequencies could be derived in a future study of schwannoma with existing technology and methods [88]. This would allow much more detailed inferences about substitution rates than the single parameter u reported here.

When applied to malignant transformation, our approach showed that the observed lifetime risk can be explained by a simple two-hit model in the growing tumour (see Fig. 6). This theory also predicts excess LOH on the locus of the tumour suppressor (or suppressors) responsible, TSX. Our estimates of n_TSX in section “Parameter estimation” suggest that a promising experimental approach could be to look for LOH in at least ten samples of malignant schwannomas. This might uncover new risk factors, new roles for risk factors known from other tumours such as TP53, as well as markers of malignant transformation. The rarity of malignancy in vestibular schwannoma could also provide an opportunity to observe malignant transformation “in slow motion”, with a large and definite reference population of benign tumours.

The extension of this modelling approach to radiotherapy suggested that at tumour sizes and radiation doses typical of therapy, the excess risk of malignancy for sporadic schwannoma is negligible. Cells that receive high doses, and thus have the highest likelihood of induced mutations, will be the least likely to survive. Stochastic effects, which include malignancy, should peak at intermediate doses of around 1 Gray. Above about 4 Gray, deterministic effects dominate, and cell survival falls off very rapidly. This dose is well below recommended prescriptions [70, 89].

The excess risk should also be very small for fractionated therapies, as long as dose rates are higher than 4 Gray over the course of one tumour cell cycle, which we estimate at around 4 days (see section “Dose fractionation”). This is broadly in line with current recommendations. Current clinical practice is therefore expected to have a negligible excess risk of radiation-induced malignancy, consistent with previous empirical studies [3, 9, 86].

The highly pessimistic worst-case scenario in which tumour cells fully recover in between doses should be completely avoidable if the most rapidly growing tumours are treated more aggressively (3mm/year expansion = 4 Gray/day) than more common cases. This pessimistic scenario, detailed in section “Dose fractionation”, is unlikely. Nonetheless, it may be interesting to investigate whether there is an association between malignancy and failure to complete treatment. It should also be cautioned that we do not consider cases of familial NF2.

These findings need to be qualified by the uncertainties both in the identity of the relevant genes. These are questions for which experimental answers are urgently needed. As well as revealing new connections between genomics, epidemiology, and microscopic mechanisms, this work underscores the critical need to identify the genes responsible for both tumour initiation and malignant transformation.

Data availability

The datasets generated and analysed during this study are available from the corresponding author on reasonable request.

Code availability

All relevant Python code is available in the supplemental material, alongside instructions regarding versioning and dependencies.

References

Evans DGR, Moran A, King A, Saeed S, Gurusinghe N, Ramsden R. Incidence of vestibular schwannoma and neurofibromatosis 2 in the North West of England over a 10-year period: higher incidence than previously thought. Otol Neurotol. 2005;26:93–97.
Kshettry VR, Hsieh JK, Ostrom QT, Kruchko C, Barnholtz-Sloan JS. Incidence of vestibular schwannomas in the United States. J neuro-Oncol. 2015;124:223–8.
Article Google Scholar
Håvik AL, Bruland O, Myrseth E, Miletic H, Aarhus M, Knappskog PM, et al. Genetic landscape of sporadic vestibular schwannoma. J Neurosurg JNS. 2017;128:911–22.
Article Google Scholar
Woods R, Friedman J, Evans DGR, Baser ME, Joe H. Exploring the “two-hit hypothesis” in NF2: tests of two-hit and three-hit models of vestibular schwannoma development. Genet Epidemiol. 2003;24:265–72.
Article PubMed Google Scholar
Carlson ML, Smadbeck JB, Link MJ, Klee EW, Vasmatzis G, Schimmenti LA. Next generation sequencing of sporadic vestibular schwannoma: necessity of biallelic NF2 inactivation and implications of accessory non-NF2 variants. Otol Neurotol. 2018;39:e860–e871.
Article PubMed Google Scholar
Paldor I, Chen AS, Kaye AH. Growth rate of vestibular schwannoma. J Clin Neurosci. 2016;32:1–8.
Article PubMed Google Scholar
Demetriades AK, Saunders N, Rose P, Fisher C, Rowe J, Tranter R, et al. Malignant transformation of acoustic neuroma/vestibular schwannoma 10 years after gamma knife stereotactic radiosurgery. Skull Base. 2010;20:381–7.
Article PubMed PubMed Central Google Scholar
King AT, Rutherford SA, Hammerbeck-Ward C, Lloyd SK, Freeman SR, Pathmanaban ON, et al. Malignant peripheral nerve sheath tumors are not a feature of neurofibromatosis type 2 in the unirradiated patient. Neurosurgery. 2017;83:38–42.
Goldbrunner R, Weller M, Regis J, Lund-Johansen M, Stavrinou P, Reuss D, et al. EANO guideline on the diagnosis and treatment of vestibular schwannoma. Neuro-Oncol. 2020;22:31–45.
Article PubMed Google Scholar
Carlson ML, Jacob JT, Habermann EB, Glasgow AE, Raghunathan A, Link MJ. Malignant peripheral nerve sheath tumors of the eighth cranial nerve arising without prior irradiation. J Neurosurg JNS. 2016;125:1120–9.
Article Google Scholar
Hadfield KD, Smith MJ, Urquhart JE, Wallace AJ, Bowers NL, King AT, et al. Rates of loss of heterozygosity and mitotic recombination in NF2 schwannomas, sporadic vestibular schwannomas and schwannomatosis schwannomas. Oncogene. 2010;29:6216–21.
Sughrue ME, Yeung AH, Rutkowski MJ, Cheung SW, Parsa AT. Molecular biology of familial and sporadic vestibular schwannomas: implications for novel therapeutics. J Neurosurg. 2011;114:359–66.
Article CAS PubMed Google Scholar
Paterson C, Bozic I, Clevers H. A mathematical model of colorectal cancer initiation. Proc Natl Acad Sci USA. 2020;117:20681–88.
Article CAS PubMed PubMed Central Google Scholar
Moolgavkar SH, Luebeck EG. Multistage carcinogenesis: population-based model for colon cancer. JNCI: J Natl Cancer Inst. 1992;84:610–8.
Article CAS PubMed Google Scholar
Hadfield KD, Newman WG, Bowers NL, Wallace A, Bolger C, Colley A, et al. Molecular characterisation of SMARCB1 and NF2 in familial and sporadic schwannomatosis. J Med Genet. 2008;45:332–9.
Article CAS PubMed Google Scholar
Paganini I, Capone GL, Vitte J, Sestini R, Putignano AL, Giovannini M, et al. Double somatic SMARCB1 and NF2 mutations in sporadic spinal schwannoma. J Neuro-Oncol. 2018;137:33–38.
Article CAS Google Scholar
Sadler KV, Bowers NL, Hartley C, Smith PT, Tobi S, Wallace AJ, et al. Sporadic vestibular schwannoma: a molecular testing summary. J Med Genet. 2021;58:227–33.
Article CAS PubMed Google Scholar
Kurtz TG. Limit theorems for sequences of Jump Markov processes approximating ordinary differential processes. J Appl Probab. 1971;8:344–56.
Article Google Scholar
Stivaros SM, Stemmer-Rachamimov AO, Alston R, Plotkin SR, Nadol JB, Quesnel A, et al. Multiple synchronous sites of origin of vestibular schwannomas in neurofibromatosis Type 2. J Med Genet. 2015;52:557–62.
Article CAS PubMed Google Scholar
Dewan R, Pemov A, Kim HJ, Morgan KL, Vasquez RA, Chittiboina P, et al. Evidence of polyclonality in neurofibromatosis type 2–associated multilobulated vestibular schwannomas. Neuro-Oncol. 2015;17:566–73.
Article CAS PubMed Google Scholar
Nowell PC. The clonal evolution of tumor cell populations. Science. 1976;194:23–28.
Article CAS PubMed Google Scholar
Fearon ER, Hamilton SR, Vogelstein B. Clonal analysis of human colorectal tumors. Science. 1987;238:193–7.
Article CAS PubMed Google Scholar
Armitage P, Doll R. The age distribution of cancer and a multi-stage theory of carcinogenesis. Br J Cancer. 1954;8:1.
Article CAS PubMed PubMed Central Google Scholar
Naguib NNN, Hey C, Shaaban MS, Elabd AM, Hassan HHM, Gruber-Rouh T, et al. Assessment of the cochlear nerve to facial nerve size ratio using MR multiplanar reconstruction of the internal auditory canal in patients presenting with acquired long-standing hearing loss. Br J Radiol. 2017;90:20160870 https://doi.org/10.1259/bjr.20160870.
Article PubMed PubMed Central Google Scholar
Gupta R, Steward O. Chronic nerve compression induces concurrent apoptosis and proliferation of Schwann cells. J Compar Neurol. 2003;461:174–86.
ENA Archive: Sequence CAA76992. https://www.ebi.ac.uk/ena/browser/view/CAA76992.
ENA Archive: Sequence CR456581. https://www.ebi.ac.uk/ena/browser/view/CR456581.
Jukes TH, Cantor CR. Evolution of protein molecules. Mamm Protein Metab. 1969;3:21–132.
Article CAS Google Scholar
Posada D, Crandall KA. Selecting models of nucleotide substitution: an application to human immunodeficiency virus 1 (HIV-1). Mol Biol Evol. 2001;18:897–906.
Article CAS PubMed Google Scholar
Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980;16:111–20.
Article CAS PubMed Google Scholar
Tamura K, Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993;10:512–26.
CAS PubMed Google Scholar
Tavaré S. Some probabilistic and statistical problems in the analysis of DNA sequences. Lectures Math Life Sci. 1986;17:57–86.
Google Scholar
Hulsebos TJ, Kenter S, Verhagen WI, Baas F, Flucke U, Wesseling P. Premature termination of SMARCB1 translation may be followed by reinitiation in schwannomatosis-associated schwannomas, but results in absence of SMARCB1 expression in rhabdoid tumors. Acta Neuropathol. 2014;128(Sep):439–48.
Article CAS PubMed Google Scholar
Hexter A, Jones A, Joe H, Heap L, Smith MJ, Wallace AJ, et al. Clinical and molecular predictors of mortality in neurofibromatosis 2: a UK national analysis of 1192 patients. J Med Genet. 2015;52:699–705.
Article CAS PubMed Google Scholar
Bozic I, Antal T, Ohtsuki H, Carter H, Kim D, Chen S, et al. Accumulation of driver and passenger mutations during tumor progression. Proc Natl Acad Sci USA. 2010;107:18545–50.
Article CAS PubMed PubMed Central Google Scholar
Frampton GM, Fichtenholtz A, Otto GA, Wang K, Downing SR, He J, et al. Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing. Nat Biotechnol. 2013;31:1023–31.
Article CAS PubMed PubMed Central Google Scholar
Lu Y, Soong TD, Elemento O. A novel approach for characterizing microsatellite instability in cancer cells. PLoS ONE. 2013;8:e63056.
Article CAS PubMed PubMed Central Google Scholar
Chen J, Guo JT. Comparative assessments of indel annotations in healthy and cancer genomes with next-generation sequencing data. BMC Med Genomics. 2020;13:1–11.
Article Google Scholar
Lauritzen SL. Lectures on contingency tables. Aalborg University Press; 2002.
Lapunzina P, Monk D. The consequences of uniparental disomy and copy number neutral loss-of-heterozygosity during human development and cancer. Biol Cell. 2011;103:303–17.
Article PubMed Google Scholar
Giannikou K, Malinowska IA, Pugh TJ, Yan R, Tseng YY, Oh C, et al. Whole exome sequencing identifies TSC1/TSC2 biallelic loss as the primary and sufficient driver event for renal angiomyolipoma development. PLoS Genet. 2016;12:e1006242.
Article PubMed PubMed Central Google Scholar
Tuyl F, Gerlach R, Mengersen K. A comparison of Bayes-Laplace, Jeffreys, and other priors: the case of zero events. Am Statistician. 2008;62:40–44.
Article Google Scholar
Morgan E, Rozée S: National life tables 2004-5. Office of National Statistics. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/lifeexpectancies/datasets/nationallifetablesgreatbritainreferencetables.
Tomasetti C, Bozic I. The (not so) immortal strand hypothesis. Stem Cell Res. 2015;14:238–41. https://doi.org/10.1016/j.scr.2015.01.005.
Article CAS PubMed PubMed Central Google Scholar
Keogh MJ, Wei W, Aryaman J, Walker L, Van Den Ameele J, Coxhead J, et al. High prevalence of focal and multi-focal somatic genetic variants in the human brain. Nat Commun. 2018;9:1–12.
Article CAS Google Scholar
Ku HH, et al. Notes on the use of propagation of error formulas. J Res Natl Bur Stand. 1966;70:263–73.
Google Scholar
Efron B. Bootstrap methods: another look at the Jackknife. Ann Stat. 1979;7:1–26. https://doi.org/10.1214/aos/1176344552.
Article Google Scholar
Hesterberg T. What teachers should know about the bootstrap: resampling in the undergraduate statistics curriculum. arXiv:1411.5279. 2014:42–54.
Hanahan D, Weinberg RA. The Hallmarks of cancer. Cell. 2000;100:57–70.
Article CAS PubMed Google Scholar
Antal T, Krapivsky PL, Nowak MA. Spatial evolution of tumors with successive driver mutations. Phys Rev E. 2015;92:022705.
Article Google Scholar
Lewis D, Roncaroli F, Agushi E, Mosses D, Williams R, Li KL, et al. Inflammation and vascular permeability correlate with growth in sporadic vestibular schwannoma. Neuro-Oncol. 2018;21:314–25. https://doi.org/10.1093/neuonc/noy177.
Article PubMed Central Google Scholar
Conlon I, Raff M. Differences in the way a mammalian cell and yeast cells coordinate cell growth and cell-cycle progression. J Biol. 2003;2:7.
Article PubMed PubMed Central Google Scholar
Crist J, Hodge JR, Frick M, Leung FP, Hsu E, Gi MT, et al. Magnetic resonance imaging appearance of schwannomas from head to toe: a pictorial review. J Clin Imaging Sci. 2017;7. https://doi.org/10.4103/jcis.JCIS_40_17.
Carlson ML, Habermann EB, Wagie AE, Driscoll CL, Gompel JJV, Jacob JT, et al. The changing landscape of vestibular schwannoma management in the United States—a shift toward conservatism. Otolaryngol–Head Neck Surg. 2015;153:440–6. https://doi.org/10.1177/0194599815590105.
Article PubMed Google Scholar
Tanbouzi Husseini S, Piccirillo E, Taibah A, Paties CT, Rizzoli R, Sanna M. Malignancy in vestibular schwannoma after stereotactic radiotherapy: a case report and review of the literature. Laryngoscope. 2011;121:923–8.
Article PubMed Google Scholar
Ryland GL, Doyle MA, Goode D, Boyle SE, Choong DY, Rowley SM, et al. Loss of heterozygosity: what is it good for? BMC Med genomics. 2015;8:1–12.
Article CAS Google Scholar
Ceccaldi R, Rondinelli B, D’Andrea AD. Repair pathway choices and consequences at the double-strand break. Trends Cell Biol. 2016;26:52–64.
Article CAS PubMed Google Scholar
Rodgers K, McVey M. Error-prone repair of DNA double-strand breaks. J Cell Physiol. 2016;231:15–24.
Bétermier M, Bertrand P, Lopez BS. Is non-homologous end-joining really an inherently error-prone process? PLoS Genet. 2014;10:e1004086.
Article PubMed PubMed Central Google Scholar
Zucman-Rossi J, Legoix P, Victor JM, Lopez B, Thomas G. Chromosome translocation based on illegitimate recombination in human tumors. Proc Natl Acad Sci USA. 1998;95:11786–91. https://doi.org/10.1073/pnas.95.20.11786.
Article CAS PubMed PubMed Central Google Scholar
Vilenchik MM, Knudson AG, Endogenous DNA. double-strand breaks: production, fidelity of repair, and induction of cancer. Proc Natl Acad Sci USA. 2003;100:12871–6.
Article CAS PubMed PubMed Central Google Scholar
Rothkamm K, Kühne M, Jeggo PA, Löbrich M. Radiation-induced genomic rearrangements formed by nonhomologous end-joining of DNA double-strand breaks. Cancer Res. 2001;61:3886–93.
CAS PubMed Google Scholar
Stenerlöw B, Karlsson KH, Cooper B, Rydberg B. Measurement of prompt DNA double-strand breaks in mammalian cells without including heat-labile sites: results for cells deficient in nonhomologous end joining. Radiat Res. 2003;159:502–10.
Article PubMed Google Scholar
Obeidat M, McConnell KA, Li X, Bui B, Stathakis S, Papanikolaou N, et al. DNA double-strand breaks as a method of radiation measurements for therapeutic beams. Med Phys. 2018;45:3460–5. https://doi.org/10.1002/mp.12956.
Article CAS PubMed Google Scholar
Podgarsak EB. Radiation oncology physics: a handbook for teachers and students. Vienna, Austria: IAEA; 2005.
Linskey ME. Stereotactic radiosurgery versus stereotactic radiotherapy for patients with vestibular schwannoma: a Leksell Gamma Knife Society 2000 debate. J Neurosurg. 2000;93(Supplement 3):90–92.
Article PubMed Google Scholar
Blake DJ, Weir A, Newey SE, Davies KE. Function and genetics of dystrophin and dystrophin-related proteins in muscle. Physiological Reviews. 2002.
Porta M. A dictionary of epidemiology. Oxford, UK: Oxford University Press; 2016.
Mayles P, Nahum A, Rosenwald JC. Handbook of radiotherapy physics: theory and practice. Boca Raton, Florida: CRC Press; 2007.
Beyzadeoglu M, Ozyigit G, Ebruli C. Basic radiation oncology. Berlin, Germany: Springer Science & Business Media; 2010.
Paterson C. Minimal models of invasion and clonal selection in cancer. PhD Thesis, University of Edinburgh. 2018.
Engwer C, Wenske M. Estimating the extent of glioblastoma invasion. J Math Biol. 2021;82:1–24.
Article Google Scholar
Tomasetti C, Vogelstein B. Variation in cancer risk among tissues can be explained by the number of stem cell divisions. Science. 2015;347:78–81.
Article CAS PubMed PubMed Central Google Scholar
Huang J, Papadopoulos N, McKinley AJ, Farrington SM, Curtis LJ, Wyllie AH, et al. APC mutations in colorectal tumors with mismatch repair deficiency. Proc Natl Acad Sci USA. 1996;93:9049–54.
Article CAS PubMed PubMed Central Google Scholar
Gerstung M, Jolly C, Leshchiner I, Dentro SC, Gonzalez S, Rosebrock D, et al. The evolutionary history of 2,658 cancers. Nature. 2020;578:122–8.
Article CAS PubMed PubMed Central Google Scholar
Herrero-Jimenez P, Tomita-Mitchell A, Furth E, Morgenthaler S, Thilly W. Population risk and physiological rate parameters for colon cancer. The union of an explicit model for carcinogenesis with the public health records of the United States. Mutat Res/Fundamental Mol Mechanisms Mutagenesis. 2000;447:73–116.
Article CAS Google Scholar
Sjakste N, Riekstiņa U. DNA damage and repair in the differentiation of stem cells and cells of connective cell lineages: a trigger or a complication? Eur J Histochemistry: EJH. 2021;65:3236.
Google Scholar
The Pan-Cancer Analysis of Whole Genomes Consortium. Pan-cancer analysis of whole genomes. Nature. 2020;578:82.
Article Google Scholar
Petrilli AM, Fernández-Valle C. Role of Merlin/NF2 inactivation in tumor biology. Oncogene. 2016;35:537–48.
Article CAS PubMed Google Scholar
Chen Y, Wang ZY, Wu H. P14ARF deficiency and its correlation with overexpression of p53/MDM2 in sporadic vestibular schwannomas. Eur Arch Oto-Rhino-Laryngol. 2015;272:2227–34.
Article Google Scholar
Rice SH. Evolutionary theory: mathematical and conceptual foundations. Sunderland, MA: Sinauer Associates; 2004.
Van Loo P, Nordgard SH, Lingjærde OC, Russnes HG, Rye IH, Sun W, et al. Allele-specific copy number analysis of tumors. Proc Natl Acad Sci USA. 2010;107:16910–5.
Article PubMed PubMed Central Google Scholar
Van Loo P, Nilsen G, Nordgard SH, Vollan HKM, Børresen-Dale AL, Kristensen VN, et al. Analyzing cancer samples with SNP arrays. In: Next generation microarray bioinformatics. Wang J, Tan AC, Tian T, Editors. Berlin, Germany: Springer; 2012. pp. 57–72.
Schuster SC. Next-generation sequencing transforms today’s biology. Nat Methods. 2008;5:16–18.
Article CAS PubMed Google Scholar
Shendure J, Balasubramanian S, Church GM, Gilbert W, Rogers J, Schloss JA, et al. DNA sequencing at 40: past, present and future. Nature. 2017;550:345–53.
Article CAS PubMed Google Scholar
Rowe J, Grainger A, Walton L, Silcocks P, Radatz M, Kemeny A. Risk of malignancy after gamma knife stereotactic radiosurgery. Neurosurgery. 2007;60:60–66.
Article PubMed Google Scholar
Alexandrov LB, Nik-Zainal S, Wedge DC, Aparicio SA, Behjati S, Biankin AV, et al. Signatures of mutational processes in human cancer. Nature. 2013;500:415–21.
Article CAS PubMed PubMed Central Google Scholar
Tang M, Wei H, Han L, Deng J, Wang Y, Yang M, et al. Whole-genome sequencing identifies new genetic alterations in meningiomas. Oncotarget. 2017;8:17070.
Article PubMed PubMed Central Google Scholar
Apicella G, Paolini M, Deantonio L, Masini L, Krengli M. Radiotherapy for vestibular schwannoma: review of recent literature results. Rep. Practical Oncol Radiother. 2016;21:399–406.
Article Google Scholar
Evans DGR, Huson SM, Donnai D, Neary W, Blair V, Teare D, et al. A genetic study of type 2 neurofibromatosis in the United Kingdom. I. Prevalence, mutation rate, fitness, and confirmation of maternal transmission effect on severity. J Medical Genet. 1992;29:841–6.
Hanley JA, Lippman-Hand A. If nothing goes wrong, is everything all right? Interpreting zero numerators. J Am Med Assoc. 1983;249:1743–5.
Ince EL. Ordinary differential equations. Longmans, Green and Co.; 1926.
Zwillinger D. Handbook of differential equations. Cambridge, Massachusetts, USA: Academic Press; 1998.
Razzaghi M. On the estimation of binomial success probability with zero occurrence in sample. J Mod Appl Stat Methods. 2002;1:41.
Article Google Scholar
Ramsay JO, Hooker G, Campbell D, Cao J. Parameter estimation for differential equations: a generalized smoothing approach. J R Stat Soc: Ser B (Stat Methodol). 2007;69:741–96.
Article Google Scholar
Saltelli A, Ratto M, Andres T, Campolongo F, Cariboni J, Gatelli D, et al. Global sensitivity analysis: the primer. New York City, USA: John Wiley & Sons; 2008.
Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006.
Article CAS PubMed PubMed Central Google Scholar
Haupt S, Zeilmann A, Ahadova A, Bläker H, von Knebel Doeberitz M, Kloor M, et al. Mathematical modeling of multiple pathways in colorectal carcinogenesis using dynamical systems with Kronecker structure. PLoS Comput Biol. 2021;17:e1008970.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors are grateful to everyone who contributed comments and constructive criticism to the unfinished manuscript. Particular thanks are due to Joshua D Hellier for constructive discussions regarding our statistical methods; and to Mohammad Obeidat for confirming that Eq. (38) and the corresponding value for k were consistent with his experiments.

Funding

DGE and MJS were supported by all Manchester NIHR Biomedical Research Centre (IS-BRC-1215-20007). MJS was also supported by a USAMRAA CDMRP Neurofibromatosis Research Program, Investigator-Initiated Research Award (W81XWH1910334).

Author information

Authors and Affiliations

Division of Evolution, Infection and Genomics, School of Biological Sciences, University of Manchester, Manchester, UK
Chay Paterson, Miriam J. Smith & D. Gareth R. Evans
Department of Applied Mathematics, University of Washington, Seattle, WA, USA
Ivana Bozic
Manchester Centre for Genomic Medicine, Manchester University NHS Foundation Trust, Manchester, UK
Miriam J. Smith & D. Gareth R. Evans
Radiation Protection Group, Medical Physics, University Hospital Southampton NHS Foundation Trust, Southampton, UK
Xanthe Hoad

Authors

Chay Paterson
View author publications
You can also search for this author in PubMed Google Scholar
Ivana Bozic
View author publications
You can also search for this author in PubMed Google Scholar
Miriam J. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Xanthe Hoad
View author publications
You can also search for this author in PubMed Google Scholar
D. Gareth R. Evans
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CP and DGRE conceived the research, CP developed the model and performed the parameter estimation and MJS conducted the experimental tests of SMARCB1 status. All authors wrote the manuscript.

Corresponding author

Correspondence to Chay Paterson.

Ethics declarations

Ethics approval and consent to participate

Genetic testing of schwannoma DNA samples was undertaken in accordance with the ethics protocol, 10/H1008/74, approved by the North West—Greater Manchester Central Research Ethics Committee. The samples were collected and analysed in a manner consistent with the Declaration of Helsinki.

Consent to publish

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

counter.py

bootstrap.py

Mathematical and methodological appendices

Reproducibility checklist

Legends to the supplemental material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Paterson, C., Bozic, I., Smith, M.J. et al. A mechanistic mathematical model of initiation and malignant transformation in sporadic vestibular schwannoma. Br J Cancer 127, 1843–1857 (2022). https://doi.org/10.1038/s41416-022-01955-8

Download citation

Received: 28 October 2021
Revised: 13 July 2022
Accepted: 08 August 2022
Published: 12 September 2022
Issue Date: 09 November 2022
DOI: https://doi.org/10.1038/s41416-022-01955-8

Subjects

Abstract

Background

Methods

Results

Discussion

Similar content being viewed by others

Background

Methods

Modelling incidence of sporadic vestibular schwannoma

Parameter estimation

Precursor cell population and division rate

Sensitive sites

Base-pair mutation rates, rates of LOH and n GFX

Uncertainties in parameters

Modelling sporadic malignant transformation in schwannoma

Parameter estimation

LOH in malignant schwannoma

Modelling excess risk of malignancy following irradiation

Dose fractionation

Results

Sporadic vestibular schwannoma

Malignant transformation

Excess risk of malignancy following radiotherapy

Discussion

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent to publish

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links

Base-pair mutation rates, rates of LOH and n _GFX