Exploring hematological alterations and genetics linked to SNV rs10974944 in myeloproliferative neoplasms among Amazon patients

BCR::ABL1-negative myeloproliferative neoplasms are hematopoietic disorders characterized by panmyelosis. JAK2 V617F is a frequent variant in these diseases and often occurs in the 46/1 haplotype. The G allele of rs10974944 has been shown to be associated with this variant, specifically its acquisition, correlations with familial cases, and laboratory alterations. This study evaluated the association between the 46/1 haplotype and JAK2 V617F in patients with myeloproliferative neoplasms in a population from the Brazilian Amazon. Clinical, laboratory and molecular sequencing analyses were considered. Carriers of the G allele of rs10974944 with polycythemia vera showed an increase in mean corpuscular volume and mean corpuscular hemoglobin, while in those with essential thrombocythemia, there was an elevation in red blood cells, hematocrit, and hemoglobin. Associations were observed between rs10974944 and the JAK2 V617F, in which the G allele (OR 3.4; p < 0.0001) and GG genotype (OR 4.9; p = 0.0016) were associated with JAK2 V617F + and an increase in variant allele frequency (GG: OR 15.8; p =  < 0.0001; G: OR 6.0; p = 0.0002). These results suggest an association between rs10974944 (G) and a status for JAK2 V617F, JAK2 V617F + _VAF ≥ 50%, and laboratory alterations in the erythroid lineage.

also been associated with more pronounced alterations in laboratory exams, presence of splenomegaly, inflammatory dysregulation, familial cases of MPNs (increasing the risk of developing any myeloproliferative neoplasm by 5 to 7 times) and abnormal methylation of the gene promoter [10][11][12][13] .Therefore, the JAK2 46/1 haplotype confers predisposition to the development of myeloproliferative neoplasms associated with the JAK2 V617F mutation (OR = 3.7; 95% CI = 3.1-4.3)and provides a conceptual framework in which a constitutional genetic component is associated with a substantial increase in the risk of acquiring a specific somatic mutation 14 .
In this study, we performed genetic sequencing of intron 12 of the JAK2 gene to identify the rs10974944 variant (C > G), in strong linkage disequilibrium with the 46/1 haplotype, in 100 patients with BCR::ABL1-negative myeloproliferative neoplasms (polycythemia vera: n = 39; essential thrombocythemia: n = 61) for whom clinical and laboratory information was available for clinical and laboratory characterization.

Characterization of the study population
The study included individuals clinically diagnosed with polycythemia vera (PV) (n = 39) or essential thrombocythemia (ET) (n = 61), whose clinical-laboratory characteristics are presented in the supplementary material.The female gender was more prevalent among individuals diagnosed with ET (n = 48, p = 0.002).The median age of the participants ranged between the fifth and sixth decades of life (p = 0.441).
Regarding hematological results, the medians of overall red blood cell count (RBC), hematocrit (Ht), hemoglobin (Hb), and total white blood cell count (WBC) were significantly higher in the PV group compared to the ET group (p < 0.05) (see Table SI).Other hematological markers, such as mean corpuscular volume (103.9 pg, p < 0.0001), mean corpuscular hemoglobin (33.5 fL, p < 0.0001), and overall platelet count (467,000 × cells/mm 3 , p < 0.0001), were also significantly elevated in the ET group compared to the PV group.Hemorrhagic events were more frequent in patients with ET compared to PV (p = 0.003), while the frequency of splenomegaly and thrombotic events did not differ significantly between PV and ET (p > 0.05) groups.
A greater frequency of patients with ET (95.1%, p < 0.0001) received cytoreductive treatment in comparison to PV patients (66.6%).

Identified genetic variants
Data on the allelic and genotypic frequency of rs10974944 (C > G) are presented in Figs.2c and d.Of all the individuals included in the study, 63% exhibited the rs10974944 variant (G): 26% in homozygosity (GG) and 37% in heterozygosity (CG).The GG genotype of rs10974944 was more prevalent in the PV group (36%), whereas CG was more homogeneous between the groups (33.3% in PV and 39.3% in ET).Regarding allelic frequency, the G allele was more frequent in the PV (53.6%) group, and the wild-type allele proved to be more prevalent in the ET (60.7%) group.
Table 1 presents the hematological data of individuals with polycythemia vera and essential thrombocythemia stratified according to the absence or presence of the rs10974944 (CC and G carriers, respectively).In PV, G carriers showed significantly increased values for MCV and MCH (p = 0.030 and p = 0.041, respectively), while in ET, patients with the variant exhibited elevated indices of RBC, Ht, and Hb with demonstrated statistical significance (p < 0.05).2. A significant correlation was observed between the G allele and thrombotic events in patients with PV (p = 0.041) and a similar trend in ET, however, without significance statistics (p = 0.073).These data suggest that the G allele of rs10974944 may be associated with an increased risk of thrombotic events in patients with PV.

Distribution of variants in patients stratified according to JAK2 V617F status and variant allele frequency
Considering the possible association of rs10974944 with JAK2 V617F, the genotypic frequency analysis of rs10974944 (C > G) was performed according to the positive (+) or negative (−) status of JAK2 V617F and its variant allele frequency (VAF), with data described in Table 3.
Homozygous individuals for rs10974944 (GG) showed a significantly higher frequency of JAK2 V617F + status and a higher likelihood of being positive for this variant when compared to the CC genotype (42.2% vs 12.5%; OR 4.9; 95% CI 1.8-13.9;p = 0.00016) (Table 3).We emphasize the correlation of the rs10974944 G allele with the V617F variant, which demonstrated a 3.4-fold higher probability of being present in JAK2 V617F + individuals compared to individuals carrying the C allele (61.1% vs 38.9%; OR 3.4; 95% CI 1.9-6.2;p < 0.0001).
Additionally, the analyses revealed that individuals with the GG genotype of rs10974944 had a 13.1-fold higher probability of having a VAF greater than 50% when compared to individuals with the CC genotype (75% vs 15%; OR 13.1; 95% CI 1.8-72.3;p = 0.004).Regarding the allele, carriers of the G allele showed a sixfold higher risk of having a VAF of ≥ 50% compared to the wild-type allele (C) (82.5% vs 17.5%; OR 6.0; 95% CI 2.1-14.8;www.nature.com/scientificreports/p = 0.0002).These results demonstrate an association between rs10974944 and the variation in VAF in JAK2 V617F.

Identified haplotypes
The linkage disequilibrium (LD) of rs10974944 and JAK2 V617F (rs77375493) is demonstrated in Fig. 3.The variants identified in the analyzed region were included in the haplotype analysis.When these genetic changes are paired, they give rise to nine haplotypes (Table 4).Haplotype analysis revealed that haplotype 2 (rs10974944G/ rs10815151C/rs1011004A/rs77375493T) was more prevalent in individuals with JAK2 V617F + (46.5%;OR 19.6; 95% CI 3.1-208; p = < 0,0001), which indicates a strong correlation between the variants.This information is in accordance with that contained in supplementary figure I and supplementary table III, where it is possible to

Discussion
Myeloproliferative neoplasms have characteristic alterations in laboratory exams, as well as genetic findings that permit their identification and differentiation.Findings involving genetic alterations in introns are not yet fully understood, but this scenario is becoming of increasing interest for understanding the etiopathogenic aspects and the role of these DNA regions in these diseases.Essential thrombocythemia proved to be the most frequent myeloproliferative neoplasm, which are findings that align with the premises established by Torres 15 , who studied a population with BCR::ABL1-negative myeloproliferative neoplasms in the state of Amazonas (Brazil).Similar data were described by Macedo 16 , who reported a similar scenario in patients from the states of Paraná and São Paulo who had the same hematologic malignancy, and these data converge with descriptions found in other countries 17,18 .
The age range of individuals was between the fifth and seventh decades of life, which is consistent with what is stated in other studies 19,20 .The progressive accumulation of genetic variations in hematopoietic stem cells and the biological machinery of the DNA repair system 21,22 , an increase or decrease in telomeres 23,24 and cumulative exposure to risk factors throughout life, such as smoking and obesity 25,26 , may explain the prevalence of this age group in the context of myeloproliferative neoplasms.
Regarding clinical characteristics, polycythemia vera (PV) showed an equal proportion of men and women, while essential thrombocythemia (ET) revealed a majority of cases involving women, and these data are in line with the literature 27,28 .Some studies have demonstrated that women have an increased risk of developing myeloproliferative neoplasms 29 and a higher likelihood of developing cardiovascular complications and splenomegaly 26 .The reason for this risk is uncertain, but changes in sex chromosomes, hormonal factors and gene expression may be possible contributors to this process 28 .Laboratory data, and thrombotic and hemorrhagic events presented as expected for each neoplasm: PV demonstrated a higher prevalence of increased erythrogram values and ET showed changes in the megakaryocytic series, with a higher risk of hemorrhagic events, as described by the World Health Organization 3 , and in other studies on the subject 27,30 .
Regarding the genetic findings, PV demonstrates a higher prevalence of positive cases for the JAK2 V617F variant, since it is directly associated with the specific pathogenesis of this hematologic malignancy 36 and plays a role in the constitutive activation of the JAK-STAT pathway 5 .It is interesting to note that 58% of our PV population was positive for the variant, which may initially differ from findings commonly described in the literature that point to JAK2 V617F frequencies of over 70% in Brazilian, Korean, Chinese, Japanese, and European patients [31][32][33][34][35] .
Our analysis reveals a notable specificity in our population compared to the data documented in the literature, especially in patients with PV, where 42% of these patients did not present the JAK2 V617F variant or other pathogenic genetic alterations along the coding region of JAK2, as established by WHO diagnostic criteria 3 .This atypical behavior suggests significant gaps in our understanding of the genetic factors underlying the etiopathogenesis of myeloproliferative neoplasms in the Amazonian population.This gap underscores the pressing need for further studies to achieve a more comprehensive understanding of the genetic profile of these diseases and other contributing factors.Therefore, additional studies in our population are recommended, exploring other genes relevant to myelopoiesis and epigenetic regulation, such as DNMT3A (DNA Methyltransferase 3 Alpha), NFE2 (Nuclear factor erythroid 2), SF3B1 (Splicing Factor 3b Subunit 1), TET2 (Tet Methylcytosine Dioxygenase 2), ASXL1 (ASXL Transcriptional Regulator 1) and EZH2 (Enhancer Of Zeste 2 Polycomb Repressive Complex 2 Subunit) 21,52 .Analysis of these genes may provide valuable insights into the genetic behavior of myeloproliferative neoplasms in the Amazonian population and elucidate other factors involved in PV pathophysiology, beyond the known variants in JAK2 V617F and JAK2 exons 12 and 14.
In the literature, the germline haplotype 46/1, identified by the rs10974944 (C > G) variant, has a well-documented association with JAK2 V617F 14,[36][37][38] as also observed in our study.The data regarding the frequency of the minor allele of rs10974944 in the Brazilian population and the Amazon region remain scarce, making this study pioneering in this investigation.The absence of previous studies on this variant in the Amazonian population underscores the importance of the current work in filling this gap in the genetic knowledge of this population.However, the frequency of the 46/1 haplotype, associated with rs10974944, has been linked to a higher prevalence in patients with myeloproliferative neoplasms, especially those harboring JAK2 V617F + .This association has been observed not only in other Brazilian populations as described by Macedo et al. 16 but also in studies conducted across various populations worldwide, including Asian, European, and North American populations, as discussed in one of our previous integrative reviews 13 .Additionally, the ancestral contribution to the Brazilian population, particularly in the Amazon region, is characterized by a mixture of three main ethnic groups: Native Americans (NAM), Europeans (EUR), and Africans (AFR) 39 .Therefore, it is plausible to infer that the genetic behavior of the variant in these populations, as described previously, is similar, thus strengthening the discussion regarding similar behavior in our population.
The high frequency of the G allele of rs10974944 in individuals positive for JAK2 V617F contributes to discussions about the non-random correlation between these two genetic alterations 13,40 This relationship is in line with another finding from our study, haplotype 2 (rs10974944G/rs10815151C/rs1011004A/rs77375493T), which strengthens concepts based on the interaction between rs10974944 (C > G) and JAK2 V617F (rs77375493-G > T).These propositions are in agreement with findings involving haplotype 46/1 in other Brazilian, Taiwanese, European, Chinese, and Japanese populations 16,[32][33][34]41 , indicating that the possible mechanisms preceding the acquisition of JAK2 V617F are not limited to a specific ethnic group; therefore, its evolutionary basis can be considered as a genetic predisposition factor for the disease 8  www.nature.com/scientificreports/ Stdies report a higher risk of individuals with the GG genotype of rs10974944 being positive for JAK2 V617F 14,40,42 .Consistent with the results of the aforementioned studies, our population exhibited a four-fold increase in the risk of positive JAK2 V617F in individuals with the GG genotype of rs10974944 (OR 4.1; 95% CI 8-13.9).These findings support the hypothesis of hypermutability, which establishes haplotype 46/1 as a dysregulating agent of the JAK2 gene, which increases the risk of DNA replication errors and conditions a mutagenic scenario for the acquisition of variants with selective advantages, such as JAK2 V617F [43][44][45] .
The association of rs10974944 (G) and the JAK2 V617F VAF suggests a possible involvement of haplotype 46/1 in clonal expansion.We identified a six-fold higher risk of individuals carrying the G allele of rs10974944 and JAK2 V617F VAF of ≥ 50%.Our data indicate that the marker of haplotype 46/1 may play a role not only in the acquisition of JAK2 V617F but is also attributed to clonal expansion, maintenance, and survival.Tefferi 46 suggests that JAK2 V617F is not the initial clonogenic event in MPNs but rather one of several subclones derived from an ancestral clone.This is in accordance with the notes of Pardanani et al. 47 , which support the hypothesis that this haplotype is located in a favorable cis regulatory environment, which facilitates the acquisition of JAK2 V617F, and which, in turn, is responsible for clonal expansion and the development MPNs.
Furthermore, the possible role of acquired uniparental disomy, a genetic event that leads to mitotic recombination associated with neutral loss of heterozygosity of chromosome 9p in MPN patients, reducing both the haplotype and JAK2 V617F to a homozygous state 14,48,49 , cannot be ruled out.In this context, cells with both variants theoretically have a selective advantage, which conditions greater myeloproliferative potential and favors the establishment of variant cells over healthy cells, thus explaining the increased VAF in individuals with the combination rs10974944 (G) + rs77375493 (T) (JAK2 V617F) in homozygosity.
Association between changes in hematological indices, clinical characteristics and the presence of 46/1 is observed in the literature 16,33,50 ; however, this is not a consensus among the scientific community 8 .Our data show significant differences in MCV, MCH values in the PV group, and RBC, Hb, and Ht in TE carriers of the G allele of rs10974944, which has been observed in previous studies 7,42,51 .The significant demonstration of rs10974944 with thrombotic events strengthens the use of this variant as a tool for monitoring patients and investigating clinical findings of polycythemia vera.For a more reliable correlation of this correlation, new studies are needed, with more robust populations, to observe the behavior of the variant in relation to clinical and hematological characteristics in PV patients.
The present research is the first to analyze the 46/1 haplotype using the rs10974944 variant, present in intron 12 of JAK2, in a population from the Brazilian Amazon.The results of this study show that the rs10974944 (G) variant has a strong correlation with the JAK2 V617F + variant, demonstrated especially in PV_JAK2 V617F + patients.A correlation of the variant with a high allelic variant burden of JAK2 V617F, thrombotic events and hematological changes was also observed.The variant is a promising possibility for clinical use for investigating and monitoring laboratory changes and/or increased VAF in identified hematological malignancies.

Population
One hundred individuals clinically diagnosed with BCR::ABL1-negative myeloproliferative neoplasms were included in the study.The study was conducted from February 2021 to January 2023.Laboratory analysis was performed at the Genomics Laboratory of the Foundation Hospital for Hematology and Hemotherapy of the State of Amazonas.

Figure 2 .
Figure 2. Distribution of genetic data for (a) JAK2 V617F, (b) Variant allele frequency of JAK2 V617F + , and (c) Genotypic frequency and (d) Allelic frequency of rs10974944 in patients with polycythemia vera or essential thrombocythemia.

Table 1 .
Laboratory characteristics of G carriers (rs10974944) and individuals without the variant who were diagnosed with polycythemia vera or essential thrombocythemia.RBC Red blood cell count, Ht Hematocrit,

Table 2 .
Clinical characteristics of G carriers (rs10974944) and individuals without the variant who were diagnosed with polycythemia vera or essential thrombocythemia.Significant values are in bold.