SOX2 protein biochemistry in stemness, reprogramming, and cancer: the PI3K/AKT/SOX2 axis and beyond

Article metrics

Abstract

Research of the past view years expanded our understanding of the various physiological functions the cell-fate determining transcription factor SOX2 exerts in ontogenesis, reprogramming, and cancer. However, while scientific reports featuring novel and exciting aspects of SOX2-driven biology are published in near weekly routine, investigations in the underlying protein-biochemical processes that transiently tailor SOX2 activity to situational demand are underrepresented and have not yet been comprehensively summarized. Largely unrecognizable to modern array or sequencing-based technology, various protein secondary modifications and concomitant function modulations have been reported for SOX2. The chemical modifications imposed onto SOX2 are inherently heterogeneous, comprising singular or clustered events of phosphorylation, methylation, acetylation, ubiquitination, SUMOylation, PARPylation, and O-glycosylation that reciprocally affect each other and critically impact SOX2 functionality, often in a tissue and species-specific manner. One recurring regulatory principle though is the canonical PI3K/AKT signaling axis to which SOX2 relates in various entangled, albeit not exclusive ways. Here we provide a comprehensive review of the current knowledge on SOX2 protein modifications, their proposed relationship to the PI3K/AKT pathway, and regulatory influence on SOX2 with regards to stemness, reprogramming, and cancer.

An introduction to SOX2 and underexplored aspects of its molecular regulation

The SOX/Sox (SRY homology box) family of proteins comprises 20 individual members in man and mouse [1], of which SOX2/Sox2 is the most explored. SOX proteins are principally defined by a conserved DNA-binding element, the so-called high mobility group (HMG) that relates to a transcriptional master regulator of virility (i.e., SEX determining factor Y, SRY) and thus functionally qualifies SOX/Sox proteins as DNA-binders [2]. Accounting for HMG-residue conservation, the SOX family branches into eight sub-categories (SoxA-H), of which SoxB further divides into SoxB1 and SoxB2 groups reflecting more recent gene duplication events [3, 4]. While Sox proteins contribute to various cellular functionalities, reprogramming capacity is largely confined to members of the SoxB1 group (i.e., Sox1, Sox2, and Sox3) [5]. Early reporter assays involving designed plasmid-borne promoter-reporter constructs assigned an expression co-modulatory activity to C-proximal parts of SOX2, termed “trans-activating domain” (TAD) [6]. However, while the HMG is an evolutionary conserved folding unit defined by cognate sequence elements and structurally resolved in considerable molecular detail (see PDB deposits 1O4X and 2LE4), the TAD remains a functional concept not yet supported by structural data.

Noteworthy, SOX2 imposes transcription modulatory significance often in conjunction with co-factors [7] such as OCT3/4 in ES cells [8, 9] or PAX6 in the visual system primordia [10]. Accordingly, while the well-resolved N-terminus with its contained HMG-domain and superimposed nucleo-cytoplasmatic transport motifs (nuclear localization sequence (NLS) [11] and nuclear export sequence (NES) [12], respectively) provides for proper subcellular distribution and DNA-binding, the C-terminus is assumed to engage in interactions with co-factors. While this concept is compelling, the SOX2 C-terminus nevertheless lacks secondary structural elements classically associated with protein–protein interactions such as ANK-repeats, SH2-, WW-, Bromo-, or RING-domains (to name but a view). Indeed, despite upward 6900 peer-reviewed scientific contributions featuring various aspects of SOX2 biology (pubmed, status June 2019), the protein’s C-terminus remained structurally enigmatic and eludes classical sequence motif prediction tools.

By contrast, the HMG/DNA-interaction has been resolved in considerable detail and a DNA recognition consensus long been defined for SOX2/Sox2 (i.e., CCCATTGTTC in man and CTTTGTC in mouse) [13, 14]. However, with the central TTGT element being the preferred recognition motif of all SOX/Sox proteins and several thousand copies of the extended consensus in the human genome, SOX2 resembles a rather promiscuous transcription factor. A certain degree of selectivity however is imposed by co-factors (see above), although many of which are likewise shared amongst Sox proteins [15, 16]. In reality therefore, the current loading status of an individual consensus site in a particular SOX2+ cell remains effectively unpredictable. To give an example: While an in silico search of the glioblastoma cancer genome identified 4883 potential SOX2 docking sites in advanced human glioma, altered RNA expression was subsequently confirmed for no more but 489 protein coding genes and 105 pre-miRNAs by Chip-seq [17]. Underscoring these numbers, 699 significantly modulated SOX2 downstream targets were more recently identified in nasopharyngeal cancer cells by cDNA microarray [18]. Nevertheless, also the mere occupation of a binding site without concomitant expression modulation of juxtapositioned genes may be a regulatory function, as secondary effects amongst downstream targets may be suppressed this way. Vice versa, expression changes faithfully assigned to SOX2 must not inevitably relate to its presence at a given locus, but may reflect indirect effects. In any case though, SOX2-driven phenotypes, often laid out to the reader as linear causal narratives, are almost inevitably multifactorial.

Interestingly, also the organization and regulation of the SOX2 gene on chromosome locus 3q26.3-q27 are remarkable and in their functional consequences not fully resolved yet. Not only can SOX2 potentiate the expression of its own gene as part of autoregulatory feed-forward mechanisms [19], the single exon SOX2 moreover falls into the intron of a much greater, superimposed genetic element called SOX2OT (SOX2 overlapping transcript). This peculiar gene arrangement is phylogenetically conserved between man and mouse, where Sox2ot was recently shown to modulate Sox2 expression in cortical neuronal progenitors, indeed [20]. This finding may be of particular relevance, as near every analysis of SOX2/Sox2 involves knock-down or knock-out technology, whereas potential side-effects on SOX2OT are rarely considered. Vice versa, an experimental outcome faithfully assigned to SOX2 may further involve SOX2OT functional contributions.

Functional roles of SOX2

SOX2 is probably best known for its technical application in reprogramming where, in conjunction with co-factors (e.g., OCT4, KLF4, and cMYC [21, 22] or subsets thereof [5, 23]), SOX2 enables the derivation of human or murine induced pluripotent stem cells (iPSCs) from terminally differentiated somatic cells. Although a milestone in present day cell biology and clearly of unprecedented potential for future biomedical application, the enforced overexpression of a subset of exogenously introduced transcription factors in otherwise terminally differentiated body cells remains an artificial intervention, and (more relevant in the given context) remains mechanistically not fully elucidated with regard to specific contributions of SOX2.

More instructive in this sense are murine in vivo analyses that illustrate a pivotal physiological significance of Sox2 throughout the vertebrate life cycle, from early embryogenesis to adult tissue homeostasis [24]. In particular: Deletion of Sox2 in the zygote results in early embryonic lethality with concomitant malformation of the epiblast [25]. Throughout morulation, SOX2 expression coincides with pluripotent embryonic stem (ES) cells that later localize to the inner cell mass of the blastula. Underscoring a critical role in ES cell maintenance, both deletion and overexpression of Sox2 impose ES cell differentiation in vitro [8, 26]. In the developing embryo, Sox2 expression is maintained in lineage committed, multipotent progenitor cells mostly of neuronal or epithelial fate [24]. These ontological paths are probably best resolved for the developing (central) nervous system. Here, Sox2 expression has been traced from the cerebral tubing in early embryogenesis to the ventricular layer of the neuronal cortex (where neural stem/progenitor cells reside at mid-fetal development) and finally to the subventricular zone of the lateral ventricle and the granular zone lining the hippocampus (where Sox2 positive NSCs locate in the adult brain) [27]. Another well-defined ontological branch concerns the developing gastro-intestinal tract, where Sox2 expression associates with the engulfing foregut and derived endodermal structures from which esophagus and anterior stomach evolve [28]. In the developing skin, Sox2 positive cells are first detected in the mesenchymal condensates giving rise to the dermal sheath and the dermal papilla [29], while at adulthood Sox2 marks multipotent skin stem cells with multilineage differentiation capacity [30, 31]. Sox2 is further instrumental for proper dental development [32], while its expression in murine optic cup [33] and retina [34] progenitors proved critical for eye development. Underscoring conserved functions in man, hereditary SOX2 haplo-insufficiency was associated with syndromic microphthalmia type 3 (MCOPS3) [35], a disease characterized by micro- or an-ophthalmia and concomitant cognitive and/or developmental impairment [36]. Noteworthy, a strong dependence on Sox2 expression also persists in adulthood as illustrated by an inducible murine Sox2 knock-out model [37] that shows enforced tissue cell apoptosis and onset of lethality within two weeks of Sox2 depletion.

During the last decade, aberrant SOX2 expression has been further associated with various forms of cancer, particularly of epithelial or neuronal origin [38,39,40,41,42,43,44,45,46], for review see [47, 48]. In some but not all of these SOX2 has been linked to cancer stemness. In patients with breast or ovarian cancer for example, SOX2 induction is already observed at early disease stages and further associates with disease progression, metastasis, and relapse [40, 43, 45, 46]. In glioblastoma as another example, elevated SOX2 expression associates with increased cell motility and tumor spreading and is also detected amongst circulating CSC islets [49,50,51]. Further involving SOX2 in cancer stemness, elevated SOX2 expression associates with chemotherapy-resistance effects [52], induces stemness and endothelial-to-mesenchymal-transition gene signatures [17], and promotes clonogenicity and in vivo tumorigenicity in respective model systems [43, 52]. In sum, various lines of experimental evidence underscore connectivity between SOX2 expression and CSCs in some cancers, although likely not all SOX2-positive cancer cells are CSCs and vice versa. Overall the CSC concept is controversially discussed in solid tumors, and plasticity has been noted between stem and non-stem cells [53].

Depending on disease entity and state, aberrant SOX2 expression may either result from gene duplication events or dysregulated transcription of the endogenous locus [47, 48]. As a consequence, divergent focally restricted to ubiquitous intra-tumoral distribution of SOX2 expression has been reported. In cancers in which SOX2 expression associates with stemness, SOX2 occurs mostly in the absence of gene amplification. By contrast, in squamous cell lung carcinoma, SOX2 expression frequently coincides with amplifications at its chromosomal locus, so that SOX2 stains rather homogenously in tumor biopsies and has been described as a lineage-survival factor [38, 54]. Here, SOX2 obviously cannot be considered a distinguishing marker of the CSC compartment.

What is more, although near univocally described as an oncogene and driver of transformation, in gastric and non-small lung cell carcinoma SOX2 expression was in fact reported as a favorable molecular prognostic factor [55,56,57,58,59]. This remains tissue specific and might relate to even more aggressive phenotypes mediated by alternative pathways or external stimuli. As such, Helicobacter pylori infections was shown to influence SOX2 expression in gastric cancer [60, 61], where SOX2 has been described as a trans-activator of PTEN, thus inhibiting PI3K/AKT-driven cell cycle progression and anti-apoptosis effects [55, 62, 63]. External influences also concern non-small cell lung cancer, where a smoking-induced injury response [64] was shown to trigger a premalignant state characterized by PI3K induction and molecularly uncoupled SOX2 expression in basal epithelial cells [64], while in dysplasia expression of SOX2 may depend on FGF triggered MAPK/ERK signaling [65].

Taken together, SOX2 is a cell-fate determining transcription factor that has been functionally implicated in the induction and maintenance of pluripotent iPS and ES cells, multipotent lineage-committed progenitors, and tissue stem cells of mostly epithelial or neural fate. Moreover, SOX2 has been recognized as a powerful oncogene in various cancer types, where it regulates CSCs and functionally relates to several other hallmarks (see Fig. 1 for schematic overview of SOX2 functionality). While the cell biological relevance of SOX2 thus appears clearly defined, the molecular understanding of SOX2 protein regulation remains surprisingly fragmental and comprises various aspects of effective uncertainty, as can be deduced from the examples given above.

Fig. 1
figure1

The versatile roles of pluripotency factor SOX2 in stemness, reprogramming, and cancer. The transcriptional master regulator SOX2 (SEX determining region (SRY) homology box 2) relates to various aspects of naturally occurring, healthy stem and progenitor cells (green), pluripotency induction amongst terminally defined somatic cells (blue), and cancer stemness and progression (red). With regards to stemness in healthy tissues, the protein sustains pluripotency amongst embryonic stem cells (ESCs) in early embryogenesis, co-segregates with lineage committed stem/progenitor cells of mostly epithelial or neuronal fate in advanced ontogenesis, and finally co-localizes with tissue-resident stem cells that warrant for tissue homeostasis and regeneration in adulthood (green). Moreover, SOX2 is an indispensable reprogramming factor that in conjunction with co-factors drives the conversion of terminally differentiated somatic cells into induced pluripotent stem cell conditions (iPSCs, blue). Finally, SOX2 is a powerful oncogene that functionally relates to cancer stemness and various further hallmark functionalities amongst which clonogenicity, tumorigenicity, endothelial-to-mesenchymal transition (EMT), cancer cell mobility, tumor cell dissemination, metastasis, chemotherapy resistance, and relapse (red)

The PI3K/AKT signaling pathway

Key cell biological features such as growth, proliferation, and survival are functional manifestations of highly intertwined underlying molecular networks probably best illustrated by systems biology approaches [66]. Although a crude underestimation of cellular complexity, such cell-biological features have been historically assigned to a hand-full of major cognate signaling cascades or pathways, amongst which the PI3K/AKT axis [67]. This molecular module converts extracellular stimuli (mostly receptor/ligand interaction or ligand-imposed receptor dimerization events at the plasma membrane) via a phosphorylation cascade into molecular downstream responses, and furthermore relates to upward mTORC2 and downstream mTORC1 complexes as associated regulatory relays. Although a myriad of cell-type specific input and output functions have been assigned to the pathway [67], an evolutionary conserved primordial functionality of the PI3K/AKT axis is that of a nutrient sensor. Triggered by insulin or amino acid availability on the cell surface, the PI3K/AKT axis converts information on the current bioavailability of these metabolites into downstream effector functions within the cell [68,69,70,71,72]. In response to insulin for example, PI3K/AKT modulates cellular glucose uptake (more actually the cell surface expression of individual glucose transporters, GLUT) involving AS160, DGKzeta, and PIP5K proteins [73, 74]. In response to amino acids such as glutamate, PI3K/AKT stimulates mTORC1, the mammalian target of rapamycin complex 1, that induces phospho-activation of ribosomal protein S6 (RPS6) by S6 kinase (S6K1) [75, 76] and a release of eIF4E from its inhibitor 4E-BP1 [77, 78] to launch protein synthesis. Further prominent effector functions to which PI3K/AKT signaling relates, either in direct or indirect terms, comprise cell cycle progression, (anti)-apoptosis, and more [67]. Accordingly, mechanisms that overactivate the PI3K/AKT axis or impair its inactivation (such as mutations in PIK3CA and/or PTEN genes, for example) range amongst the most prominent molecular aberrations in cancer and resemble mutational hot spots for resistance development [79, 80].

Interestingly, cancers of immanent high SOX2 expression often show intrinsically high mutations rates in PI3KCA or PTEN genes as well [50, 64], thus suggesting functional connectivity between PI3K/AKT signaling and stemness-inductory roles of SOX2.

Intertwined relationships between PI3K/AKT signaling and SOX2 in stemness

Cell growth and cell proliferation are two phenotypically entangled but mechanistically rather distinct processes. While growth largely depends on PI3K/AKT/mTORC1-imposed protein synthesis (see above), proliferation is a principle reflection of cell-cycle progression and thus cyclin kinase activities [81, 82]. Although these processes are easily experimentally uncoupled in vitro (e.g., with the mTORC1 selective inhibitor rapamycin or else with cyclin kinase inhibitors), such distinction is much harder to make in vivo. Classical examples however comprise arborization (i.e., the outgrowth of dendritic and axonal connections amongst central nervous cells in early postnatal development) [83], which represents an extreme case of cell mass enrichment (growth) at near persistent cellularity. By contrast, the initial stages of embryogenesis comprise a rapid succession of cell division events that occur essentially without size increase (i.e., the diameter of oocyte and blastula is near invariant) [84]. Accordingly, in cells that constitute the morula or the inner cell mass of the blastula (i.e., ESCs), protein synthesis evidently is attenuated to a degree that neither supports mass enrichment (growth) nor differentiation. These adaptations appear conserved in resting tissue stem cells, where tightly constrained protein synthesis is likewise required to suppress differentiation and thus sustain stemness. In line with these notions, somatic stem cells are generally smaller (with size being an indicator of protein synthesis) and show lower protein formation rates than surrounding tissue cells [85, 86]. Noteworthy, while the PI3K/AKT/mTORC1 axis clearly relates to protein formation, also SOX2 has been implicated in these processes, as a SOX2-imposed translation factor, eIF2A, initiates protein synthesis from unconventional start sites within the normally untranslated 5′-UTR of proto-oncogenes [87].

As growth and differentiation become inevitable, e.g., in the developing embryo or injury settings, protein synthesis must be enforced amongst stem cells or their derived progeny. Consistently, PI3K activity has been further associated also with cellular differentiation (e.g., the conversion between luminal and basal layers of the mammary epithelium) [88]. In cancer finally, such growth regulatory mechanisms may be largely uncoupled and overruled by mutations inducing and sustaining clonal expansion (e.g., involving aberrant PI3K/AKT/mTORC1 signaling). The PI3K/AKT pathway has thus been long recognized as potential target for therapeutic intervention in cancer [80, 89], and very recently, a combination of the PI3K-alpha selective inhibitor BYL719 (alpelisib, Pigray®) with the estrogen receptor antagonist Fulvestrant® was approved for the treatment of advanced mammary carcinoma, as this treatment nearly doubled the median progression-free survival in PIK3CA-positive HR+/HER2+ breast cancer patients (SOLAR-1 trial, FDA approval: May 24th, 2019).

While these recent developments underscore the therapeutic potential of PI3K/AKT inhibition in general, the particular significance of SOX2+ CSCs as a cellular reservoir of disease relapse has not been therapeutically exploited thus far. This is remarkable, as several lines of experimental evidence gathered in healthy and transformed cell matter, clearly underscore an interdependence between PI3K/AKT signaling on the one hand and SOX2-imposed (cancer) stem cell characteristics on the other: Mutations in PTEN for example support the outgrowth of healthy and malignant mammary epithelium derived stem cells, which can be antagonized by the PI3K/AKT cross-reactive inhibitor perifosine [90]. Effectively separating these effects, either inhibition of PI3 kinase or knock-down of downstream AKT1 were later shown to impair the survival of tumor initiating supposed breast carcinoma stem cells [91].

An immediate molecular connectivity between Akt and Sox2 however was first established in mouse where Akt-imposed phosphorylation supports Sox2 driven transcription amongst ESCs [92] and functionally cooperates with Oct4, Klf4, and c-Myc in the induction of iPSCs [93]. These observations were further substantialized in man, where AKT-driven phosphorylation was shown to sustain SOX2 protein stability, nuclear localization, and thereby regulate in vitro clonogenicity and in vivo tumorigenicity of breast carcinoma cells [52]. Similar results were reported for nasopharyngeal cancer cells, albeit with a less well defined AKT-inhibitory reagent, DC120, that blocks AKT in its Thr308 and Ser473 phosphorylated state [94]. Further validating AKT as a conserved upstream regulator of SOX2 expression, AKT pathway inhibition or siRNA mediated knock-down reduced SOX2 levels also in esophageal squamous cells [95, 96]. While one report proposes nitrosylation of AKT as a regulatory principle in SOX2 expression [95], the other describes ubiquitin-ligase UBR5 as a mediator of SOX2 protein turnover in absence of AKT kinase activity [96].

The dependencies between PI3K/AKT and SOX2 however are reciprocal, as probably best exemplified in glioblastoma, where PI3K/AKT-imposed phosphorylation sustains nuclear SOX2 expression as a driving force of tumor cell dissemination, while in turn SOX2 supports PI3KCA gene expression (which actually locates in near juxtaposition of the SOX2 locus on chromosome band 3q26-28) and subsequent downstream activation of AKT [50]. Similarly, SOX2 and KLF4 jointly drive PIK3CA expression in nasopharyngeal carcinoma, thus enhancing PI3K/AKT activity and tumorigenesis, while overexpression of PI3KCA rescues growth inhibitory effects imposed by SOX2 knock-down [18]. In Ewing’s sarcoma finally, the disease characteristic EWS1/FLT1 gene fusion has been identified as an inductor of SOX2 expression, which functionally correlates with enforced cell proliferation, anti-apoptosis, and enhanced PI3K/AKT signaling [97].

Interestingly, an interconnectivity between SOX2 and PI3K/AKT is also evident in reprogramming settings, despite the fact that iPS cell induction involves artificial overexpression of exogenously introduced SOX2/Sox2. While enforced SOX2 expression coincides with elevated AKT activity during reprogramming [98], PI3K/AKT signaling provides critical contributions to iPS cell survival also thereafter [99]. The endogenous Sox2 locus however seems to be the target of regulatory feedback inhibition, as in reprogramming settings AKT1 displaces gene activatory FoxO1 from Sox2, whereas FoxO1 is re-activated and endogenous Sox2 expression restored upon inhibition of AKT [98].

These various interdependencies between PI3K/AKT and SOX2, detected across a spectrum of different malignant and healthy (stem) cell types, have led to the notion of an independent PI3K/AKT/SOX2 signaling branch that diverges from the canonical PI3K/AKT/mTORC1 axis (see refs. [100, 101] and Fig. 2 for schematic illustration). This distinction is underscored by experimental evidence indicating that various AKT or PI3K inhibitors, but not the mTORC1 selective inhibitor rapamycin, affect SOX2 protein homeostasis in several tumor entities [50, 52]. However, inhibition of mTORC1 has also been described to attenuate SOX2 and further downstream SOX9 expression in glioma [102], and both PI3K inhibitors and rapamycin reported to suppress SOX2-driven tumorigenicity from esophageal squamous cells in a murine xenografts [103]. These discrepancies might be explained by the fact that the PI3K/AKT signaling pathway involves various feed-forward and feed-backward mechanisms, so that rapamycin may indeed regulate SOX2, albeit in indirect terms.

Fig. 2
figure2

The PI3K/AKT/SOX2 signaling axis is a conserved module in stemness, reprogramming, and cancer. The PI3K/AKT pathway converges extracellular stimuli conferred via respective receptors on the plasma membrane into downstream effector functions within the cell. Classical PI3K/AKT mediated phenotypes involve adapted sugar metabolism in response to insulin receptor stimulation (maroon) and aggravated protein synthesis in response to amino acid availability, as sensed by e.g., the glutamate receptor and mediated by the mTORC1 complex (black). Noteworthy, both PI3 and AKT kinases relate to various further cell biological features amongst which cell cycle control, (anti)-apoptosis, DNA damage response, nitric oxide signaling, and others (not depicted), while yet another functionality characteristic of SOX2 positive cells has been identified in the PI3K/AKT/SOX2 axis (blue). This signaling branch critically determines SOX2 protein stability, nuclear localization, and DNA regulation, and thus functionally relates to sustained pluripotency amongst naturally occurring embryonic stem cells (ESCs), the conversion of terminally differentiated body cells into induced pluripotent stem cells (iPSCs), and furthermore critically contributes to cancer initiation, progression, and relapse. SOX2 has been identified as a direct target of AKT-imposed phosphorylation, and a functional dependence on PI3K/AKT signaling experimentally validated involving various inhibitory reagents (green) directed against either AKT or further upstream PI3K. By contrast, inhibition of downstream mTORC1 (rapamycin, red) does not immediately affect SOX2 protein characteristics. However, the PIK3/AKT pathway involves various feed-back and feed-forward regulatory loops, amongst which SOX2 imposed transcriptional auto-stimulation and a reciprocal induction of PI3KCA gene expression (dotted blue lines)

Putting these observations in an even greater context, a mass spectrometry based systematic phosphorylation site analysis identified several AKT-imposed modifications not only in SOX2, but also in other pluripotency inducing transcription factors [104]. Intriguingly, many of these sites superimpose with DNA-binding elements or sequence stretches functionally linked to nucleo-cytoplasmatic transport (NLS, NES), thus suggesting AKT-imposed phosphor-modification as a shared regulatory principle of subcellular distribution and chromatin binding amongst pluripotency inducing TFs [104, 105]. Further supporting these notions, a glycoproteomic analysis performed to identify surface molecules associating with pluripotency, retrieved leucine-rich repeat neuronal progenitor 1 (LRRN1) as a factor critically regulating SOX2, OCT4, and NANOG protein localization and turnover in hESCs [106]. Intriguingly, shRNA-mediated knock-down of LRRN1 associated with impaired AKT activity and thereby nuclear depletion and proteasomal degradation of SOX2, OCT4, and NANOG [106]. While these findings underscore AKT-imposed phosphorylation as a common regulatory determinant of pluripotency TFs, they moreover indicate that AKT-dependent regulation of SOX2 protein turnover is a conserved functional feature shared also with healthy (ES) cells.

While the molecular regulation of most other SOX proteins remains poorly defined, further connectivity to the PI3K/AKT axis has been proposed for SOX4. In mammary carcinoma, elevated SOX4 expression promotes PI3K/AKT signaling [107], while glioblastoma cell cycle progression involves SOX4-dependent modulation of AKT and p53 [108]. Interestingly, SOX4 and SOX2 proteins only distantly relate within the SOX family [3], but the two proteins show nearly overlapping occurrence in human cancers [48]. In mammary carcinoma and glioblastoma stem cells specifically SOX4 has been implicated as a regulator of SOX2 [109, 110], suggesting that the observed connectivity of this factor with PI3K/AKT signaling may also reflect an indirect regulatory circuitry further involving SOX2.

Taken together, various lines of evidence elaborated in mouse models, human cancer cells, reprogramming settings, and clinical datasets, exemplify a functional cooperativity of SOX2 with the PI3K/AKT signaling pathway. Although divergent in detail, two recurrent features are noted: (i) nuclear localization and protein stabilization of SOX2 depend on AKT kinase activity, and (ii) transcription of PIK3CA, the gene encoding the catalytic subunit of PI3Kα, is re-enforced by SOX2.

SOX2 secondary modifications

Throughout the past years, various examples of SOX2/Sox2 protein modifications have been identified, but not comprehensively summarized in standard protein data depositories such as UniProt/Swiss-Prot/ExPASy (status June 2019). Protein biochemical modifications reported for SOX2/Sox2 comprise phosphorylation, acetylation, methylation, ubiquitination, SUMOylation, PARPylation, and O-GlcNAcylation events that are successively described in the subsequent sections, systematically listed in Table 1, and their relative localization and functional implications schematically illustrated in Fig. 3.

Table 1 Compendium of the secondary modifications imposed onto SOX2/Sox2
Fig. 3
figure3

SOX2/Sox2 is a molecular target of various secondary modifications. Human (top) and murine (bottom) SOX2/Sox2 protein sequences contain a N-terminal DNA-binding domain (high mobility group, HMG, blue), a C-terminal trans-activation domain (TAD, green), and (due to a slightly deviant glycine repeat near the proteins’ start) differ in length by two amino acids. Note that the TAD is a functional concept, not a sequence- or structure-based domain signature, so that segments of variable length have been annotated as such, with longest forms commencing right after the HMG. Conserved functional elements are indicated: nuclear export signal (NES), nuclear localization signal (NLS, red), AKT-recognition consensus (AKT), and phosphorylation-dependent SUMOylation motif (PDSM). Note that the NLS (red) comprises two separate stretches, of which the latter superimposes with the AKT consensus motif near the C-terminal end of the HMG domain. NES, NLS/AKT, and PDMS sequences are further detailed below with indication of the particular chemical modifications identified thereat: phosphorylation (P); acetylation (Ac); methylation (Me); ubiquitination (UBI); SUMOylation (SUMO); O-GlcNAcylation (Glc). Wherever known, corresponding functional manifestations are indicated (arrows). Note that further putative acylation sites (i.e., residues K37, K60, K67, K89, K97, K105, K111, and K123) have been proposed for Sox2, but not functionally validated thus far. Moreover, SOX2 is a target of PARPylation, while the specific site of this modification remains to be determined

SOX2 protein phosphorylation

Of all secondary modification assigned to SOX2, phosphorylation is by far best documented. This modification primarily concerns residue Thr116 (alias Thr118 in mouse; note that the murine protein is two residues longer than its human orthologue) that is part of an AKT recognition motif (RPRRX-S/T). This sequence stretch superimposes with the nuclear-localization sequence (NLS) near the C-terminal end of the HMG-domain [11], suggesting that phosphorylation of Thr116 may modulate SOX2 subcellular distribution. Interestingly, the RPRRX-T signature is stringently conserved only amongst SoxB1 proteins (i.e., Sox2 and its closest paralogues Sox1 and Sox3) [3]. In other Sox proteins by contrast the motif is either functionally compromised by loss of a hydroxy acceptor chain (SoxF), structurally distorted by insertion of proline residues (SoxB2, SoxG, and SoxH classes), charge modified by arginine to glutamine replacement (SoxE), or further derivatized by exchange of the acceptor threonine for serine (Sox12 and Sox15) thus modifying the steric context of the phosphorylation site.

Underscoring the concept of AKT-imposed SOX2 modification, a physical interaction between AKT/Akt and SOX2/Sox2 proteins was independently demonstrated in murine and human cells [52, 93]. However, while phosphorylation of mouse Thr118 sustains Sox2 driven transcription in ES and iPS cells [93], alanine replacement analysis revealed that Thr116 is not stringently required for the nuclear localization of human SOX2 [52]. What is more, also a mass spectrometry based systematic phosphorylation site analysis involving heterologous expression of SOX2 and AKT in Sf9 cells, did not detect AKT imposed phosphorylation at residue Thr116 [104]. Instead, it uncovered a further, hitherto unrecognized AKT-imposed phosphorylation at Ser83, the functional relevance of which however remains to be determined.

In sum, AKT activity unquestionably sustains SOX2 nuclear localization, DNA binding, and prevents SOX2 from proteasomal decay in the cytosol [52, 96]. However, while a phosphorylation of the indicated residues has been repeatedly linked to AKT (see above), it remains a matter of debate, if AKT imposed phosphorylation indeed regulates SOX2 nuclear import (i.e., the association with a nuclear import adapter or transporter) and if this functionality alone has a decisive significance in these regards. Noteworthy, various further modifications have been associated with the nucleo-cytoplasmatic shuttling of SOX2, and residue Thr116 furthermore linked to SOX2 protein turnover.

Phosphorylation-induced SUMOylation of SOX2

Additional phosphorylation sites described for SOX2 comprise the residues Ser249, Ser250, and Ser251 originally identified in a phospho-proteomic analysis of human ESCs [111]. Of these, Ser251 was reconfirmed in a subsequent independent phospho-proteomic analysis in the same cell type [112]. Proposed as being functionally redundant [111], all three phosphorylation sites relate to nearby Lys245/247 that is a target of protein SUMOylation [113]. While a conservative amino acid replacement at this position (i.e., K247R) has no evident effect on Sox2 protein stability or localization, SUMOylation at Lys247 coincides with reduced Sox2 binding to an Fgf4 enhancer element in the context of Oct3/4 co-expression, so that a transcription modulatory significantly was proposed for this residue modification [113].

Documenting functional connectivity of phosphorylation and nearby SUMOylation events in this segment, a SOX2(S249-251D) mutant mimicking constitutive phosphorylation at the indicated residues strongly induced SUMOylation of SOX2, whereas a phosphorylation incompetent SOX2(S249-251A) mutant impairs this modification. SUMOylation was fully obstructed though only in a SOX2(K245A) mutant, thus reconfirming Lys245 as the target site for SOX2 SUMOylation [111]. These data jointly define a phosphorylation-dependent SUMOylation motif (PDSM, ΨKxExxSP) in SOX2, as previously described also for SOX3, SOX8, and SOX9 [114], albeit in slightly different forms (i.e., SOX2 contains an additional amino acid insertion between SUMOylation and phosphorylation sites, so that the motif was initially overseen).

Noteworthy, although AKT is a stereotype Ser/Thr kinase and the PI3K/AKT pathway indeed activated in ES cells [111], a functional involvement of AKT in PDSM phosphorylation appears unlikely as no corresponding signature peptides were identified in an AKT/SOX2 co-induction MS analysis [104] and none of the above described mutants shown to impair nuclear localization [111, 113]. While a cognate kinase thus remains to be defined, the phosphorylation status of Ser249, Ser250, and Ser251 did not significantly change upon addition of differentiation-inducing BMP (bone morphogenic protein) to hESCs, suggesting that the PDSM mechanism—although a modulator of transcription [113]—may be overruled by even more dominant regulatory elements [111].

Acetylation of the SOX2 NES motif

As p300/CBP-imposed protein acetylation and HDAC3-related deacetylation had been implicated in the nucleo-cytoplasmatic distribution of SRY [115], also the Sox2 protein was scanned for acetylation events in a two-armed MS-based approach [12]. This analysis uncovered p300/CBP-imposed in vitro acetylation on various Sox2 sites including lysine residues K37, K60, K67, K75, K89, K97, K105, K111, K117, K119, and K123 that all match the HMG domain except for K123. While not all of these modifications may occur in vivo—even more unlikely in stringent coupling—residue K75 was noted to fall into the nuclear export sequence (NES) of Sox2, and the functional relevance of this site thus further examined. The Sox2 NES motif was identified on basis of sequence homology to Sry and Sox9 [12], where a nuclear export activity had first been identified and functionally validated by impaired sexual differentiation in mice expressing NES-mutant alleles [116]. Interestingly, just as the NLS motif, also the NES sequence locates to the HMG domain and is phylogenetically conserved between SRY and SOX/Sox proteins [12]. Confirming functional relevance, an interaction with the nuclear export receptor, Crm1, is impaired in NES mutant forms of Sox2 thus causing their subcellular mislocalization [12]. Finally connecting these findings to acetylation events, K75A (acetylation null) and K75Q (acetylation mimicry) alleles were cloned and found to modulate Crm1-docking and Sox2 nucleo-cytoplasmatic distribution, indeed [12]. Moreover, deacetylating reagents retain Sox2 in the nucleus and sustain target gene expression, whereas elevated acetylation enforces nuclear export resulting in increased Sox2 protein decay [12].

Methylation and methylation-imposed ubiquitination of SOX2

A physical association of Sox2 with co-activator associated arginine methyltransferase 1 (CARM1) has been reported for murine embryonal carcinoma P19 cells, and CARM1-imposed Sox2 methylation at residue Arg113 further investigated in breast carcinoma MCF7 cells [117]. A methylation at this site was proposed to enforce Sox2 homo-dimerization and functionally linked to the expression of Sox2 target genes, while chromatin binding blocks methylation of arginine 113 [117].

Moreover, a phosphorylation/methylation switch has been proposed to modulate Sox2 protein stability in murine ESC maintenance and differentiation, and two E3-ligases, WWP2 and UBR5 implicated in the ubiquitination-dependent decay of Sox2/SOX2, albeit in a species- and tissue-selective manner [92, 96]. Jointly, these reports confirm AKT/Akt as a driving force of SOX2/Sox2 nuclear localization and cell-fate determination, and cytosolic proteasomal degradation of SOX2/Sox2 in absence of AKT/Akt activity. While these notions were first elaborated in murine ESCs [93] and human cancer cells (e.g., breast and esophageal carcinoma cell lines) [52, 94], Wong and co-workers embed these findings into following intriguing regulatory concept: The aforementioned Thr116/118 phosphorylation site is flanked by two lysine residues (one on either side, see Fig. 3 for schematic illustration). The C-proximal of these residues (Lys119) is subject to Set7 imposed protein methylation that fosters WWP2-driven ubiquitination and cytosolic proteasomal decay of Sox2. These processes emerge during ESC differentiation, but compete with stemness sustaining Akt1-driven phosphorylation of neighboring Thr118 [92]. While the exact ubiquitination site was initially not resolved, it may have been identified in a more recent publication by Wong and co-workers in human esophageal cancer cells [96]. Here, the authors describe a competition between AKT1 imposed phosphorylation on Thr116 (the human paralogue of murine Thr118) and UBR5-driven ubiquitination of its N-proximal Lys115. While both lines of evidence have been elaborated in different species and tissue/cell backgrounds, they nevertheless merge in proposing residue Thr116/118 as the central mediator of SOX2/Sox2 protein stability and decay.

According to this model, AKT imposed phosphorylation of Thr116/118 stabilizes SOX2/Sox2 by prevention of its ubiquitination, while AKT-enforced nuclear accumulation of SOX2/Sox2 may be a secondary effect. This distinction may indeed help resolve the uncertainties associated with this site (see paragraph: Phosphorylation of SOX2). Moreover, while K115 may be a target of ubiquitination in human SOX2, this very residue has also been associated with acetylation in murine ESCs (i.e., K117ac), similarly to nearby K119 that has been proposed as a target of Set7 imposed methylation [12, 92]. These findings suggest that acetylation, methylation, phosphorylation, and ubiquitination events imposed onto or nearby the AKT recognition consensus (RPRRK-T116/118) in fact compete with each other.

PARPylation of SOX2

Sox2 has furthermore been proposed as a target of PARP1 imposed poly(ADP)ribosylation (so-called PARPylation) [118]. In murine ESCs, a competitive association of either PARP1 or Sox2 with the enhancer element of FGF4, a gene classically associated with Sox2/Oct4 mediated expression regulation, was described. Furthermore, poly(ADP)ribosylation of Sox2—but not of Oct4—was indicated to causally contribute to its molecular displacement from the enhancer [118]. Although a specific poly(ADP)ribosylation site was not identified, a physical association of SOX2 and PARP1 has been independently confirmed in murine ESCs and human glioma cells [119, 120]. More recently, PARP1 was proposed to assist Sox2 in its target site recognition within densely compacted nucleosomal DNA [121]. These genetically silenced regions are accessible only to so-called pioneer transcription factors, amongst which SOX2 [122], but this cooperativity occurs independent of ADP-ribosylation activity [121].

Accordingly, while a connectivity of SOX2 and PARP1 has been repeatedly observed and appears highly plausible in the context of DNA-activity modulation, and while PARP proteins clearly can transfer poly(ADP) moieties onto neighboring factors, the regulatory significance of SOX2 PARPylation awaits more comprehensive examination.

O-Glycosylation of SOX2

Finally, SOX2/Sox2 is subject to protein O-glycosylation (O-GlcNAcylation, more specifically) at residue Ser246/248 as shown in the human pancreatic cancer cell line S2VP10, as well as in murine ESCs and iPSCs [119, 123]. Glycosylation at this site decreased with differentiation [123] along with modulation of PARP1 binding [119]. The enzyme imposing this secondary modification has been identified in O-GlcNAc transferase and an inhibition thereof proposed as a novel mode of therapeutic intervention in pancreatic cancer, where small inhibitory compounds of the glycosyl transferase machinery (e.g., OSMI) were shown to suppress tumorigenesis in xenograft models [113]. As with the aforementioned PI3K/AKT imposed phosphor-regulation, protein O-glycosylation also likely represents a recurring molecular theme amongst pluripotency inducing TFs since related modifications were identified in Oct4 and further potential O-GlcNAcylation sites proposed for Sox2 (i.e., Thr258 and Ser259) [123].

Concluding remarks and perspectives

Taken together, research of the past 25 years near univocally describes SOX2 as a transcriptional modulator that—in association with co-factors—imposes cell-fate determining expression patterns. The molecular understanding of SOX2 nevertheless contains various aspect of effective uncertainty. This involves amongst others how target-site specificity is achieved, relations to the SOX2 overlapping transcript SOX2OT, and finally also, the protein-biochemical fine tuning of SOX2 itself.

Indeed, various secondary modifications have been described for SOX2 (involving phosphorylation, methylation, acetylation, ubiquitination, SUMOylation, PARPylation, and O-GlcNAcylation), but their relevance remains largely underexplored. Identified either in murine or human protein, healthy or transformed cells, native or induced settings, and further obstructed by a somewhat unfortunate residue number shift that distinguishes the otherwise near-identical human and murine orthologues, the various protein-biochemical modifications imposed onto SOX2 were not comprehensively cataloged before.

Our review fills this gap and—as a result of synoptic evaluation—comes to two main conclusions (Fig. 3). Firstly, chemical modification sites described for SOX2 do not randomly scatter throughout the protein’s primary sequence, but cluster in particularly conserved stretches of regulatory significance (i.e., nuclear export sequence (NES), nuclear localization sequence (NLS), and a phosphorylation-dependent SUMOylation motif (PDSM)). Secondly, despite their heterogeneous chemical nature, near all secondary modifications imposed onto SOX2 relate to principally just three functional aspects: DNA-binding, nucleo-cytoplasmatic transport, and protein turnover, which are at the very basis of SOX2 imposed phenotypes. In ignorance of these underlying mechanisms, the biological significance of SOX2 may be described but remains mechanistically unresolved. Even more so as many further but less prominent secondary modifications (e.g., neddylation, urmylation, nitrosylation, or thiolation) have not even been investigated yet, but will complement these concepts with near certainty in the future.

Prospectively, the field would certainly benefit from stringent serial analyses assessing the prevalence of a given secondary modification across multiple cell types on an automated, high-throughput scale. Such endeavor may help assign significance scores to individual site change modifications, define conditions where this regulation is most relevant, and (through the identification of outliers) furthermore indicate compensatory mechanisms. This knowledge could add an important perspective to personalized treatment strategies in patients with SOX2 positive cancers.

Finally, a comprehensive molecular understanding of SOX2 would require a wholistic structural model to enable interpretations of individual covalent side chain modifications in 3-dimensional context, so that steric effects can be assessed and resolved in their functional consequence. Such structural information may guide rational-based drug design against SOX2-driven cancer or improve reprogramming efficacy through molecular fine-tuning of SOX2.

References

  1. 1.

    Schepers GE, Teasdale RD, Koopman P. Twenty pairs of sox: extent, homology, and nomenclature of the mouse and human sox transcription factor gene families. Dev Cell. 2002;3:167–70.

  2. 2.

    Bowles J, Schepers G, Koopman P. Phylogeny of the SOX family of developmental transcription factors based on sequence and structural indicators. Dev Biol. 2000;227:239–55.

  3. 3.

    Kamachi Y, Kondoh H. Sox proteins: regulators of cell fate specification and differentiation. Development. 2013;140:4129–44.

  4. 4.

    Guth SI, Wegner M. Having it both ways: Sox protein function between conservation and innovation. Cell Mol Life Sci. 2008;65:3000–18.

  5. 5.

    Nakagawa M, Koyanagi M, Tanabe K, Takahashi K, Ichisaka T, Aoi T, et al. Generation of induced pluripotent stem cells without Myc from mouse and human fibroblasts. Nat Biotechnol. 2008;26:101–6.

  6. 6.

    Nowling TK, Johnson LR, Wiebe MS, Rizzino A. Identification of the transactivation domain of the transcription factor Sox-2 and an associated co-activator. J Biol Chem. 2000;275:3810–8.

  7. 7.

    Kamachi Y, Uchikawa M, Kondoh H. Pairing SOX off: with partners in the regulation of embryonic development. Trends Genet. 2000;16:182–7.

  8. 8.

    Masui S, Nakatake Y, Toyooka Y, Shimosato D, Yagi R, Takahashi K, et al. Pluripotency governed by Sox2 via regulation of Oct3/4 expression in mouse embryonic stem cells. Nat Cell Biol. 2007;9:625–35.

  9. 9.

    Hori N, Yamane M, Kouno K, Sato K. Induction of DNA demethylation depending on two sets of Sox2 and adjacent Oct3/4 binding sites (Sox-Oct motifs) within the mouse H19/insulin-like growth factor 2 (Igf2) imprinted control region. J Biol Chem. 2012;287:44006–16.

  10. 10.

    Kamachi Y, Uchikawa M, Tanouchi A, Sekido R, Kondoh H. Pax6 and SOX2 form a co-DNA-binding partner complex that regulates initiation of lens development. Genes Dev. 2001;15:1272–86.

  11. 11.

    Li J, Pan G, Cui K, Liu Y, Xu S, Pei D. A dominant-negative form of mouse SOX2 induces trophectoderm differentiation and progressive polyploidy in mouse embryonic stem cells. J Biol Chem. 2007;282:19481–92.

  12. 12.

    Baltus GA, Kowalski MP, Zhai H, Tutter AV, Quinn D, Wall D, et al. Acetylation of sox2 induces its nuclear export in embryonic stem cells. Stem Cells. 2009;27:2175–84.

  13. 13.

    Hou L, Srivastava Y, Jauch R. Molecular basis for the genome engagement by Sox proteins. Semin Cell Dev Biol. 2017;63:2–12.

  14. 14.

    Yesudhas D, Anwar MA, Panneerselvam S, Kim HK, Choi S. Evaluation of Sox2 binding affinities for distinct DNA patterns using steered molecular dynamics simulation. FEBS Open Bio. 2017;7:1750–67.

  15. 15.

    Wilson M, Koopman P. Matching SOX: partner proteins and co-factors of the SOX family of transcriptional regulators. Curr Opin Genet Dev. 2002;12:441–6.

  16. 16.

    Wissmuller S, Kosian T, Wolf M, Finzsch M, Wegner M. The high-mobility-group domain of Sox proteins interacts with DNA-binding domains of many transcription factors. Nucleic Acids Res. 2006;34:1735–44.

  17. 17.

    Fang X, Yoon JG, Li L, Yu W, Shao J, Hua D, et al. The SOX2 response program in glioblastoma multiforme: an integrated ChIP-seq, expression microarray, and microRNA analysis. BMC Genom. 2011;12:11.

  18. 18.

    Tang J, Zhong G, Wu J, Chen H, Jia Y. SOX2 recruits KLF4 to regulate nasopharyngeal carcinoma proliferation via PI3K/AKT signaling. Oncogenesis. 2018;7:61.

  19. 19.

    Boyer LA, Lee TI, Cole MF, Johnstone SE, Levine SS, Zucker JP, et al. Core transcriptional regulatory circuitry in human embryonic stem cells. Cell. 2005;122:947–56.

  20. 20.

    Knauss JL, Miao N, Kim SN, Nie Y, Shi Y, Wu T, et al. Long noncoding RNA Sox2ot and transcription factor YY1 co-regulate the differentiation of cortical neural progenitors by repressing Sox2. Cell Death Dis. 2018;9:799.

  21. 21.

    Takahashi K, Yamanaka S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell. 2006;126:663–76.

  22. 22.

    Takahashi K, Tanabe K, Ohnuki M, Narita M, Ichisaka T, Tomoda K, et al. Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell. 2007;131:861–72.

  23. 23.

    Mao J, Zhang Q, Ye X, Liu K, Liu L. Efficient induction of pluripotent stem cells from granulosa cells by Oct4 and Sox2. Stem Cells Dev. 2014;23:779–89.

  24. 24.

    Feng R, Wen J. Overview of the roles of Sox2 in stem cell and development. Biol Chem. 2015;396:883–91.

  25. 25.

    Avilion AA, Nicolis SK, Pevny LH, Perez L, Vivian N, Lovell-Badge R. Multipotent cell lineages in early mouse development depend on SOX2 function. Genes Dev. 2003;17:126–40.

  26. 26.

    Kopp JL, Ormsbee BD, Desler M, Rizzino A. Small increases in the level of Sox2 trigger the differentiation of mouse embryonic stem cells. Stem Cells. 2008;26:903–11.

  27. 27.

    Zhao S, Nichols J, Smith AG, Li M. SoxB transcription factors specify neuroectodermal lineage choice in ES cells. Mol Cell Neurosci. 2004;27:332–42.

  28. 28.

    Que J, Okubo T, Goldenring JR, Nam KT, Kurotani R, Morrisey EE, et al. Multiple dose-dependent roles for Sox2 in the patterning and differentiation of anterior foregut endoderm. Development. 2007;134:2521–31.

  29. 29.

    Driskell RR, Giangreco A, Jensen KB, Mulder KW, Watt FM. Sox2-positive dermal papilla cells specify hair follicle type in mammalian epidermis. Development. 2009;136:2815–23.

  30. 30.

    Fernandes KJ, McKenzie IA, Mill P, Smith KM, Akhavan M, Barnabe-Heider F, et al. A dermal niche for multipotent adult skin-derived precursor cells. Nat Cell Biol. 2004;6:1082–93.

  31. 31.

    Biernaskie J, Paris M, Morozova O, Fagan BM, Marra M, Pevny L, et al. SKPs derive from hair follicle precursors and exhibit properties of adult dermal stem cells. Cell Stem Cell. 2009;5:610–23.

  32. 32.

    Juuri E, Jussila M, Seidel K, Holmes S, Wu P, Richman J, et al. Sox2 marks epithelial competence to generate teeth in mammals and reptiles. Development. 2013;140:1424–32.

  33. 33.

    Matsushima D, Heavner W, Pevny LH. Combinatorial regulation of optic cup progenitor cell fate by SOX2 and PAX6. Development. 2011;138:443–54.

  34. 34.

    Taranova OV, Magness ST, Fagan BM, Wu Y, Surzenko N, Hutton SR, et al. SOX2 is a dose-dependent regulator of retinal neural progenitor competence. Genes Dev. 2006;20:1187–202.

  35. 35.

    Williamson KA, Hever AM, Rainger J, Rogers RC, Magee A, Fiedler Z, et al. Mutations in SOX2 cause anophthalmia-esophageal-genital (AEG) syndrome. Hum Mol Genet. 2006;15:1413–22.

  36. 36.

    Schneider A, Bardakjian T, Reis LM, Tyler RC, Semina EV. Novel SOX2 mutations and genotype-phenotype correlation in anophthalmia and microphthalmia. Am J Med Genet A. 2009;149A:2706–15.

  37. 37.

    Arnold K, Sarkar A, Yram MA, Polo JM, Bronson R, Sengupta S, et al. Sox2(+) adult stem and progenitor cells are important for tissue regeneration and survival of mice. Cell Stem Cell. 2011;9:317–29.

  38. 38.

    Bass AJ, Watanabe H, Mermel CH, Yu S, Perner S, Verhaak RG, et al. SOX2 is an amplified lineage-survival oncogene in lung and esophageal squamous cell carcinomas. Nat Genet. 2009;41:1238–42.

  39. 39.

    Maier S, Wilbertz T, Braun M, Scheble V, Reischl M, Mikut R, et al. SOX2 amplification is a common event in squamous cell carcinomas of different organ sites. Hum Pathol. 2011;42:1078–88.

  40. 40.

    Pham DL, Scheble V, Bareiss P, Fischer A, Beschorner C, Adam A, et al. SOX2 expression and prognostic significance in ovarian carcinoma. Int J Gynecol Pathol. 2013;32:358–67.

  41. 41.

    Schrock A, Bode M, Goke FJ, Bareiss PM, Schairer R, Wang H, et al. Expression and role of the embryonic protein SOX2 in head and neck squamous cell carcinoma. Carcinogenesis. 2014;35:1636–42.

  42. 42.

    Schrock A, Goke F, Wagner P, Bode M, Franzen A, Braun M, et al. Sex determining region Y-box 2 (SOX2) amplification is an independent indicator of disease recurrence in sinonasal cancer. PLoS ONE. 2013;8:e59201.

  43. 43.

    Bareiss PM, Paczulla A, Wang H, Schairer R, Wiehr S, Kohlhofer U, et al. SOX2 expression associates with stem cell state in human ovarian carcinoma. Cancer Res. 2013;73:5544–55.

  44. 44.

    Boumahdi S, Driessens G, Lapouge G, Rorive S, Nassar D, Le Mercier M, et al. SOX2 controls tumour initiation and cancer stem-cell functions in squamous-cell carcinoma. Nature. 2014;511:246–50.

  45. 45.

    Lengerke C, Fehm T, Kurth R, Neubauer H, Scheble V, Muller F, et al. Expression of the embryonic stem cell marker SOX2 in early-stage breast carcinoma. BMC Cancer. 2011;11:42.

  46. 46.

    Leis O, Eguiara A, Lopez-Arribillaga E, Alberdi MJ, Hernandez-Garcia S, Elorriaga K, et al. Sox2 expression in breast tumours and activation in breast cancer stem cells. Oncogene. 2012;31:1354–65.

  47. 47.

    Wuebben EL, Rizzino A. The dark side of SOX2: cancer—a comprehensive overview. Oncotarget. 2017;8:44917–43.

  48. 48.

    Grimm D, Bauer J, Wise P, Kruger M, Simonsen U, Wehland M, et al. The role of SOX family members in solid tumours and metastasis. Semin Cancer Biol. 2019; pii: S1044-579X(18)30141-X. https://doi.org/10.1016/j.semcancer.2019.03.004.

  49. 49.

    Mansouri S, Nejad R, Karabork M, Ekinci C, Solaroglu I, Aldape KD, et al. Sox2: regulation of expression and contribution to brain tumors. CNS Oncol. 2016;5:159–73.

  50. 50.

    Schaefer T, Ramadoss A, Leu S, Tintignac L, Tostado C, Bink A, et al. Regulation of glioma cell invasion by 3q26 gene products PIK3CA, SOX2 and OPA1. Brain Pathol. 2019;29:336–50. https://doi.org/10.1111/bpa.12670.

  51. 51.

    Krol I, Castro-Giner F, Maurer M, Gkountela S, Szczerba BM, Scherrer R, et al. Detection of circulating tumour cell clusters in human glioblastoma. Br J Cancer. 2018;119:487–91.

  52. 52.

    Schaefer T, Wang H, Mir P, Konantz M, Pereboom TC, Paczulla AM, et al. Molecular and functional interactions between AKT and SOX2 in breast carcinoma. Oncotarget. 2015;6:43540–56.

  53. 53.

    Wahl GM, Spike BT. Cell state plasticity, stem cells, EMT, and the generation of intra-tumoral heterogeneity. NPJ Breast Cancer. 2017;3:14.

  54. 54.

    Cancer Genome Atlas Research N. Comprehensive genomic characterization of squamous cell lung cancers. Nature. 2012;489:519–25.

  55. 55.

    Carrasco-Garcia E, Santos JC, Garcia I, Brianti M, Garcia-Puga M, Pedrazzoli J Jr., et al. Paradoxical role of SOX2 in gastric cancer. Am J Cancer Res. 2016;6:701–13.

  56. 56.

    Luo J, Yan R, He X, He J. SOX2 inhibits cell proliferation and metastasis, promotes apoptotic by downregulating CCND1 and PARP in gastric cancer. Am J Transl Res. 2018;10:639–47.

  57. 57.

    Sarkar A, Huebner AJ, Sulahian R, Anselmo A, Xu X, Flattery K, et al. Sox2 suppresses gastric tumorigenesis in mice. Cell Rep. 2016;16:1929–41.

  58. 58.

    Wilbertz T, Wagner P, Petersen K, Stiedl AC, Scheble VJ, Maier S, et al. SOX2 gene amplification and protein overexpression are associated with better outcome in squamous cell lung cancer. Mod Pathol. 2011;24:944–53.

  59. 59.

    Velcheti V, Schalper K, Yao X, Cheng H, Kocoglu M, Dhodapkar K, et al. High SOX2 levels predict better outcome in non-small cell lung carcinomas. PLoS ONE. 2013;8:e61427.

  60. 60.

    Asonuma S, Imatani A, Asano N, Oikawa T, Konishi H, Iijima K, et al. Helicobacter pylori induces gastric mucosal intestinal metaplasia through the inhibition of interleukin-4-mediated HMG box protein Sox2 expression. Am J Physiol Gastrointest Liver Physiol. 2009;297:G312–322.

  61. 61.

    Matsuda K, Yamauchi K, Matsumoto T, Sano K, Yamaoka Y, Ota H. Quantitative analysis of the effect of Helicobacter pylori on the expressions of SOX2, CDX2, MUC2, MUC5AC, MUC6, TFF1, TFF2, and TFF3 mRNAs in human gastric carcinoma cells. Scand J Gastroenterol. 2008;43:25–33.

  62. 62.

    Wang S, Tie J, Wang R, Hu F, Gao L, Wang W, et al. SOX2, a predictor of survival in gastric cancer, inhibits cell proliferation and metastasis by regulating PTEN. Cancer Lett. 2015;358:210–9.

  63. 63.

    Otsubo T, Akiyama Y, Yanagihara K, Yuasa Y. SOX2 is frequently downregulated in gastric cancers and inhibits cell growth through cell-cycle arrest and apoptosis. Br J Cancer. 2008;98:824–31.

  64. 64.

    Kim BR, Van de Laar E, Cabanero M, Tarumi S, Hasenoeder S, Wang D, et al. SOX2 and PI3K cooperate to induce and stabilize a squamous-committed stem cell injury state during lung squamous cell carcinoma pathogenesis. PLoS Biol. 2016;14:e1002581.

  65. 65.

    Wang K, Ji W, Yu Y, Li Z, Niu X, Xia W, et al. FGFR1-ERK1/2-SOX2 axis promotes cell proliferation, epithelial-mesenchymal transition, and metastasis in FGFR1-amplified lung cancer. Oncogene. 2018;37:5340–54.

  66. 66.

    Gunawardena J. Systems biology. Biological systems theory. Science. 2010;328:581–2.

  67. 67.

    Manning BD, Toker A. AKT/PKB signaling: navigating the network. Cell. 2017;169:381–405.

  68. 68.

    Siddle K. Signalling by insulin and IGF receptors: supporting acts and new players. J Mol Endocrinol. 2011;47:R1–10.

  69. 69.

    Boucher J, Kleinridders A, Kahn CR Insulin receptor signaling in normal and insulin-resistant states. Cold Spring Harb Perspect Biol. 2014;6: pii: a009191. https://doi.org/10.1101/cshperspect.a009191.

  70. 70.

    Mackenzie RW, Elliott BT. Akt/PKB activation and insulin signaling: a novel insulin signaling pathway in the treatment of type 2 diabetes. Diabetes Metab Syndr Obes. 2014;7:55–64.

  71. 71.

    Liao XH, Majithia A, Huang X, Kimmel AR. Growth control via TOR kinase signaling, an intracellular sensor of amino acid and energy availability, with crosstalk potential to proline metabolism. Amino Acids. 2008;35:761–70.

  72. 72.

    Nicoletti F, Arcella A, Iacovelli L, Battaglia G, Giangaspero F, Melchiorri D. Metabotropic glutamate receptors: new targets for the control of tumor growth? Trends Pharm Sci. 2007;28:206–13.

  73. 73.

    Miinea CP, Sano H, Kane S, Sano E, Fukuda M, Peranen J, et al. AS160, the Akt substrate regulating GLUT4 translocation, has a functional Rab GTPase-activating protein domain. Biochem J. 2005;391:87–93.

  74. 74.

    Liu T, Yu B, Kakino M, Fujimoto H, Ando Y, Hakuno F, et al. A novel IRS-1-associated protein, DGKzeta regulates GLUT4 translocation in 3T3-L1 adipocytes. Sci Rep. 2016;6:35438.

  75. 75.

    Meyuhas O. Ribosomal protein S6 phosphorylation: Four decades of research. Int Rev Cell Mol Biol. 2015;320:41–73.

  76. 76.

    Ruvinsky I, Meyuhas O. Ribosomal protein S6 phosphorylation: from protein synthesis to cell size. Trends Biochem Sci. 2006;31:342–8.

  77. 77.

    Gingras AC, Kennedy SG, O’Leary MA, Sonenberg N, Hay N. 4E-BP1, a repressor of mRNA translation, is phosphorylated and inactivated by the Akt(PKB) signaling pathway. Genes Dev. 1998;12:502–13.

  78. 78.

    Josse L, Xie J, Proud CG, Smales CM. mTORC1 signalling and eIF4E/4E-BP1 translation initiation factor stoichiometry influence recombinant protein productivity from GS-CHOK1 cells. Biochem J. 2016;473:4651–64.

  79. 79.

    Martini M, De Santis MC, Braccini L, Gulluni F, Hirsch E. PI3K/AKT signaling pathway and cancer: an updated review. Ann Med. 2014;46:372–83.

  80. 80.

    Porta C, Paglino C, Mosca A. Targeting PI3K/Akt/mTOR signaling in cancer. Front Oncol. 2014;4:64.

  81. 81.

    van den Heuvel S, Harlow E. Distinct roles for cyclin-dependent kinases in cell cycle control. Science. 1993;262:2050–4.

  82. 82.

    Grana X, Reddy EP. Cell cycle control in mammalian cells: role of cyclins, cyclin dependent kinases (CDKs), growth suppressor genes and cyclin-dependent kinase inhibitors (CKIs). Oncogene. 1995;11:211–9.

  83. 83.

    Semple BD, Blomgren K, Gimlin K, Ferriero DM, Noble-Haeusslein LJ. Brain development in rodents and humans: Identifying benchmarks of maturation and vulnerability to injury across species. Prog Neurobiol. 2013;106-107:1–16.

  84. 84.

    Foulk RA. From fertilization to implantation. Early Pregnancy. 2001;5:61–62.

  85. 85.

    Signer RA, Magee JA, Salic A, Morrison SJ. Haematopoietic stem cells require a highly regulated protein synthesis rate. Nature. 2014;509:49–54.

  86. 86.

    Blanco S, Bandiera R, Popis M, Hussain S, Lombard P, Aleksic J, et al. Stem cell function and stress response are controlled by protein synthesis. Nature. 2016;534:335–40.

  87. 87.

    Sendoel A, Dunn JG, Rodriguez EH, Naik S, Gomez NC, Hurwitz B, et al. Translation from unconventional 5’ start sites drives tumour initiation. Nature. 2017;541:494–9.

  88. 88.

    Koren S, Reavie L, Couto JP, De Silva D, Stadler MB, Roloff T, et al. PIK3CA(H1047R) induces multipotency and multi-lineage mammary tumours. Nature. 2015;525:114–8.

  89. 89.

    Martelli AM, Evangelisti C, Follo MY, Ramazzotti G, Fini M, Giardino R, et al. Targeting the phosphatidylinositol 3-kinase/Akt/mammalian target of rapamycin signaling network in cancer stem cells. Curr Med Chem. 2011;18:2715–26.

  90. 90.

    Korkaya H, Paulson A, Charafe-Jauffret E, Ginestier C, Brown M, Dutcher J, et al. Regulation of mammary stem/progenitor cells by PTEN/Akt/beta-catenin signaling. PLoS Biol. 2009;7:e1000121.

  91. 91.

    Gargini R, Cerliani JP, Escoll M, Anton IM, Wandosell F. Cancer stem cell-like phenotype and survival are coordinately regulated by Akt/FoxO/Bim pathway. Stem Cells. 2015;33:646–60.

  92. 92.

    Fang L, Zhang L, Wei W, Jin X, Wang P, Tong Y, et al. A methylation-phosphorylation switch determines Sox2 stability and function in ESC maintenance or differentiation. Mol Cell. 2014;55:537–51.

  93. 93.

    Jeong CH, Cho YY, Kim MO, Kim SH, Cho EJ, Lee SY, et al. Phosphorylation of Sox2 cooperates in reprogramming to pluripotent stem cells. Stem Cells. 2010;28:2141–50.

  94. 94.

    Qin J, Ji J, Deng R, Tang J, Yang F, Feng GK, et al. DC120, a novel AKT inhibitor, preferentially suppresses nasopharyngeal carcinoma cancer stem-like cells by downregulating Sox2. Oncotarget. 2015;6:6944–58.

  95. 95.

    Asanuma K, Huo X, Agoston A, Zhang X, Yu C, Cheng E, et al. In oesophageal squamous cells, nitric oxide causes S-nitrosylation of Akt and blocks SOX2 (sex determining region Y-box 2) expression. Gut. 2016;65:1416–26.

  96. 96.

    Wang Z, Kang L, Zhang H, Huang Y, Fang L, Li M, et al. AKT drives SOX2 overexpression and cancer cell stemness in esophageal cancer by protecting SOX2 from UBR5-mediated degradation. Oncogene. 2019;38:5250–64.

  97. 97.

    Ren C, Ren T, Yang K, Wang S, Bao X, Zhang F, et al. Inhibition of SOX2 induces cell apoptosis and G1/S arrest in Ewing’s sarcoma through the PI3K/Akt pathway. J Exp Clin Cancer Res. 2016;35:44.

  98. 98.

    Ormsbee Golden BD, Wuebben EL, Rizzino A. Sox2 expression is regulated by a negative feedback loop in embryonic stem cells that involves AKT signaling and FoxO1. PLoS ONE. 2013;8:e76345.

  99. 99.

    Hossini AM, Quast AS, Plotz M, Grauel K, Exner T, Kuchler J, et al. PI3K/AKT signaling pathway is essential for survival of induced pluripotent stem cells. PLoS ONE. 2016;11:e0154770.

  100. 100.

    Jeong HC, Park SJ, Choi JJ, Go YH, Hong SK, Kwon OS, et al. PRMT8 controls the pluripotency and mesodermal fate of human embryonic stem cells by enhancing the PI3K/AKT/SOX2 axis. Stem Cells. 2017;35:2037–49.

  101. 101.

    Tripathi SC, Fahrmann JF, Celiktas M, Aguilar M, Marini KD, Jolly MK, et al. MCAM mediates chemoresistance in small-cell lung cancer via the PI3K/AKT/SOX2 signaling pathway. Cancer Res. 2017;77:4414–25.

  102. 102.

    Garros-Regulez L, Aldaz P, Arrizabalaga O, Moncho-Amor V, Carrasco-Garcia E, Manterola L, et al. mTOR inhibition decreases SOX2-SOX9 mediated glioma stem cell activity and temozolomide resistance. Exp Opin Ther Targets. 2016;20:393–405.

  103. 103.

    Gen Y, Yasui K, Nishikawa T, Yoshikawa T. SOX2 promotes tumor growth of esophageal squamous cell carcinoma through the AKT/mammalian target of rapamycin complex 1 signaling pathway. Cancer Sci. 2013;104:810–6.

  104. 104.

    Malak PN, Dannenmann B, Hirth A, Rothfuss OC, Schulze-Osthoff K. Novel AKT phosphorylation sites identified in the pluripotency factors OCT4, SOX2 and KLF4. Cell Cycle. 2015;14:3748–54.

  105. 105.

    Schaefer T, Lengerke C. AKT-driven phospho-patterns of pluripotency. Cell Cycle. 2015;14:3784–5.

  106. 106.

    Liao CH, Wang YH, Chang WW, Yang BC, Wu TJ, Liu WL, et al. Leucine-rich repeat neuronal protein 1 regulates differentiation of embryonic stem cells by post-translational modifications of pluripotency factors. Stem Cells. 2018;36:1514–24.

  107. 107.

    Mehta GA, Parker JS, Silva GO, Hoadley KA, Perou CM, Gatza ML. Amplification of SOX4 promotes PI3K/Akt signaling in human breast cancer. Breast Cancer Res Treat. 2017;162:439–50.

  108. 108.

    Zhang J, Jiang H, Shao J, Mao R, Liu J, Ma Y, et al. SOX4 inhibits GBM cell growth and induces G0/G1 cell cycle arrest through Akt-p53 axis. BMC Neurol. 2014;14:207.

  109. 109.

    Zhang J, Liang Q, Lei Y, Yao M, Li L, Gao X, et al. SOX4 induces epithelial-mesenchymal transition and contributes to breast cancer progression. Cancer Res. 2012;72:4597–608.

  110. 110.

    Ikushima H, Todo T, Ino Y, Takahashi M, Saito N, Miyazawa K, et al. Glioma-initiating cells retain their tumorigenicity through integration of the Sox axis and Oct4 protein. J Biol Chem. 2011;286:41434–41.

  111. 111.

    Van Hoof D, Munoz J, Braam SR, Pinkse MW, Linding R, Heck AJ, et al. Phosphorylation dynamics during early differentiation of human embryonic stem cells. Cell Stem Cell. 2009;5:214–26.

  112. 112.

    Rigbolt KT, Prokhorova TA, Akimov V, Henningsen J, Johansen PT, Kratchmarova I, et al. System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation. Sci Signal. 2011;4:rs3.

  113. 113.

    Tsuruzoe S, Ishihara K, Uchimura Y, Watanabe S, Sekita Y, Aoto T, et al. Inhibition of DNA binding of Sox2 by the SUMO conjugation. Biochem Biophys Res Commun. 2006;351:920–6.

  114. 114.

    Hietakangas V, Anckar J, Blomster HA, Fujimoto M, Palvimo JJ, Nakai A, et al. PDSM, a motif for phosphorylation-dependent SUMO modification. Proc Natl Acad Sci USA. 2006;103:45–50.

  115. 115.

    Thevenet L, Mejean C, Moniot B, Bonneaud N, Galeotti N, Aldrian-Herrada G, et al. Regulation of human SRY subcellular distribution by its acetylation/deacetylation. EMBO J. 2004;23:3336–45.

  116. 116.

    Gasca S, Canizares J, De Santa Barbara P, Mejean C, Poulat F, Berta P, et al. A nuclear export signal within the high mobility group domain regulates the nucleocytoplasmic translocation of SOX9 during sexual determination. Proc Natl Acad Sci USA. 2002;99:11199–204.

  117. 117.

    Zhao HY, Zhang YJ, Dai H, Zhang Y, Shen YF. CARM1 mediates modulation of Sox2. PLoS ONE. 2011;6:e27026.

  118. 118.

    Gao F, Kwon SW, Zhao Y, Jin Y. PARP1 poly(ADP-ribosyl)ates Sox2 to control Sox2 protein levels and FGF4 expression during embryonic stem cell differentiation. J Biol Chem. 2009;284:22263–73.

  119. 119.

    Myers SA, Peddada S, Chatterjee N, Friedrich T, Tomoda K, Krings G, et al. SOX2 O-GlcNAcylation alters its protein–protein interactions and genomic occupancy to modulate gene expression in pluripotent cells. Elife. 2016;5:e10647.

  120. 120.

    Fang X, Yoon JG, Li L, Tsai YS, Zheng S, Hood L, et al. Landscape of the SOX2 protein-protein interactome. Proteomics. 2011;11:921–34.

  121. 121.

    Liu Z, Kraus WL. Catalytic-independent functions of PARP-1 determine Sox2 pioneer activity at intractable genomic loci. Mol Cell. 2017;65:589–603 e589.

  122. 122.

    Gaullier G, Luger K. PARP1 and Sox2: an unlikely team of pioneers to conquer the nucleosome. Mol Cell. 2017;65:581–2.

  123. 123.

    Jang H, Kim TW, Yoon S, Choi SY, Kang TW, Kim SY, et al. O-GlcNAc regulates pluripotency and reprogramming by directly acting on core components of the pluripotency network. Cell Stem Cell. 2012;11:62–74.

  124. 124.

    Sharma NS, Gupta VK, Dauer P, Kesh K, Hadad R, Giri B et al. O-GlcNAc modification of oncogenetic transcription factor Sox2 promotes protein stability and regulates self-renewal in pancreatic cancer.https://doi.org/10.1101/345223 (deposited June 12, 2018)

Download references

Acknowledgements

We acknowledge funding assigned to TS through the University of Basel Forschungsfonds (Grant No. DMS-2351) and apologize to authors whose work could not be cited or laid out in further detail due to reference or space limitations.

Author information

Correspondence to Thorsten Schaefer.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Schaefer, T., Lengerke, C. SOX2 protein biochemistry in stemness, reprogramming, and cancer: the PI3K/AKT/SOX2 axis and beyond. Oncogene (2019) doi:10.1038/s41388-019-0997-x

Download citation