Introduction

The clonal selection theory, which states that each lymphocyte expresses only one specific antigen receptor, serves as the foundation for T and B-cell development, tolerance selection, and specific immune responses. The theory primarily relies on the mechanism of allelic exclusion during TCR or BCR V(D)J gene rearrangement where only one allele assembles a functional gene. Consequently, mature T (or B) cells express a functional alpha (light) and beta (heavy) chain. T (or B) cells that fail to adhere to the rearrangement rules and self-tolerance selection undergo apoptosis [1]. The mechanism of allelic exclusion is regulated by various factors such as including feedback inhibition [2], pTα chain [3], protein kinase [4], cis-acting elements [5], transcription factor [6], and the quality of recombination signal sequence [7, 8] etc. Several hypothetical mechanisms, such as asynchronous recombination models, stochastic models, and feedback inhibition models [9], have been widely acknowledged. However, the precise mechanism of allelic exclusion remains incompletely understood [10], and the existence of single T (or B) cells expressing dual TCR (or BCR) challenges this theory.

Incomplete V(D)J allelic exclusion (or allelic inclusion, or allelic exclusion escape) was first found on B cells. In 1961, MAKELA et al. found that 4 of 455 cells can produce anti H and O antibodies at the same time [11]. Subsequently, B cells equipped with two functional VHDJH or VKJK/VλJλ were found in both human and mouse with different proportions (1–20%) [12,13,14,15]. Single-cell sequencing of human peripheral blood also observed two or more VHDJH (5.90–8.71%) or VLJL (9.71–13.07%) recombination patterns of IgH chain or IgL chain, and each Ig class showed unique VHDJH recombination pattern in a single B-cell expressing multiple Ig classes [16]. In addition to physiological conditions, dual BCR B cells have been intensively studied in plasma cell tumors and other B-cell-related tumors [17,18,19]. Recently, the relationship of single lymphocytes expressing dual BCR chains or simultaneous expression of “TCR and BCR” in autogenetic immune diseases was emphasized: a quarter of SLE subjects showed higher frequency B cells with dual BCR [20], 2.2% lymphocytes possess dual TCR and BCR in type 1 diabetes [21].

Extensive experiments have provided support for the existence of single T cells expressing two functional α (V–J) and β chain (V-D-J). In 1988, two productive Vα and Vβ gene rearrangements in monoclonal T cells were detected at mRNA level [22, 23], which raised questions concerning the level at which allelic exclusion operates in T cells. Then, several laboratories have confirmed the incomplete V(D)J allelic exclusion of TCR chain and T-cell expresses more than one TCR both in human and mouse central and peripheral immune organs [22,23,24,25,26,27,28]. However, the proportion of T cells with dual TCR varies significantly (ranging from 0.01 to 30%) due to differences in research subjects and methods.

Single-cell sequencing presents valuable opportunities for investigating the rearrangement of single T and B cells that possess more than two receptor chains. The characteristics and proportion of rearranged V(D)J can be analyzed from the transcriptome of an individual lymphocyte. In this study, we systematically analyzed the expression of TCR chains in 34 single-cell sequencing samples, and compared them according to the central and peripheral, human and mouse, and different T-cell categories (Fig. 1). Remarkably, a high proportion of single T cells expressing multiple TCR (>2) were identified. Furthermore, we performed traceability analysis on T cells with more than three TCR chains, which provides new insights for evaluating the proportion, molecular characteristics, mechanism of dual TCR T cells, and potential significance of allelic exclusion escape of V(D)J rearrangement.

Fig. 1
figure 1

Illustration of experimental procedures.

Results

Expression of TCR chains in single T cells of human thymus

The expression of TCR α and β chains of total T, CD45+, and CD45− cells category in the human thymus were analyzed. The number and proportion of different TCR α and β chains detected in a single T-cell were shown in Table S1, and we selected three representative samples from total T cells (Fig. 2A), CD45+ (Fig. S1A) and CD45− (Fig. S1B) cells category for visualization. Single cells of all samples mainly displayed 23 types of TCR α and (or) β chain, including functional (F) and non-functional (N) mRNA. TCR β (F) was the most common type of cell with only one chain, and TCR α (F)+ TCR β (F) was the main type of cells with two chains. The proportion of T cells with three different TCR chains was the lowest in all three groups (Fig. S1C). The proportion of cells with two chains detected in total T cells was the largest, accounting for 46.400%, significantly higher than CD45+ cells and CD45− cells. The proportion of CD45+ cells expressing one type of chain and two types of chains was similar (42.170 and 41.290%, respectively). CD45− cells expressing one type chain predominated, accounting for 74.228%, the proportion significantly higher than that of the other two groups (Fig. 2B).

Fig. 2: Single-cell immune profiling revealed multiple TCR chain patterns in human thymic single T cells.
figure 2

A The pattern and proportion of different TCR α and β chains detected in a single T-cell of thymus total T-cell (one sample, T01). B The difference of T cells with one, two, and three (or more) types of chains among total T cells, CD45+cells, and CD45 cells in humans thymus. C The difference of T cells with single or multiple TCR among total T cells, CD45+cells, and CD45− cells in humans thymus. * is interpreted as P < 0.05. F functional, N non-functional.

The assembly of a TCR heterodimer consists of an α chain and a β chain. We assume that functional TCR contains at least one TCR α sequence (Functional) and one TCR β sequence (Functional). Although many TCR α and TCR β rearrangements were obtained from the human thymus, less than half of the cells that could assemble functional TCR (Table 1 and Fig. S1E). In total T cells, CD45+ and CD45 cells samples, 49.407, 39.780, and 10.896% of the cells assembled to TCR, respectively. Notably, a certain proportion of cells with dual or more TCR were found in the three cell types. We screened functional T cells (with functional TCR) for further analysis, and the T cells with single TCR dominated in all three groups (Fig.S1D). There was no statistical difference in the proportion of cells with dual (or more) TCR in the total T cells, CD45+, and CD45 cell subsets, which were 19.930, 15.905, and 10.040%, respectively (Fig. 2C).

Table 1 The number and proportion of cells resolved from single or multiple TCR.

Expression of TCR chains in single T cells of human peripheral blood

In order to further understand the expression of TCR α and β chains in human peripheral T cells, we analyzed the peripheral blood single-cell sequencing data of frail, young, and old populations. Similarly, single or multiple TCR α and (or) β chains were detected in a single T-cell (Table S2 and Fig. 3A). TCR α (N) and TCR β (N) were not found in all samples, which was consistent with the characteristics of T cells after self-tolerance selection. T-cell expressing one type of chain was ~18% in three groups, and TCR β (F) was the domination. The number of T cells expressing two types of chains was most, and the combination of TCR α (F) and TCR β (F) had the highest frequency in all samples. The number of T cells expressing two types of TCR chains in all three populations was the highest (Fig. S2C) in all three populations. The number of T cells with three type chains was significantly different among frail (35.379%), young (41.181%), and old (29.575%) populations (Fig. 3B). Three representative samples from frail (Fig. S2A), young (Fig. 3A) and old (Fig. S2B) populations were selected for visualization.

Fig. 3: Single-cell immune profiling revealed multiple TCR chain patterns in human peripheral blood single T cells.
figure 3

A The pattern and proportion of different TCR α and β chains detected in a single T-cell of young peripheral blood (one sample, C01). B The difference of T cells with one, two, and three (or more) types of chains among young, frail, and old populations. C The difference of T cells with single or multiple TCR among young, frail, and old populations. D The difference in the proportion of single and multiple TCR T cells between human thymus and human peripheral blood. * is interpreted as P < 0.05. F functional, N non-functional.

The proportion of human peripheral blood T cells containing single TCR, multiple TCR, and abnormal TCR were also analyzed (Table S2). The proportion of cells with potential functional TCR detected in frail, adult young, and old individuals were 68.760, 59.847, and 63.186%, respectively (Table 1). The number of T cells with single TCR was significantly higher than that with abnormal TCR and multiple TCR in all samples (Fig. S2E). Among the three groups, the highest proportion of single TCR was found in frail individuals, the multiple TCR proportion was the highest in old individuals, and the proportion of abnormal TCR is relatively higher in young individuals. We screened functional T cells (excluding abnormal TCR T cells) for further analysis, and the T cells with single TCR dominated in all three groups (Fig. S2D). T cells with multiple TCR appeared more frequently (P < 0.05) in the old population compared to frail and young groups (Fig. 3C).

The comparative analysis was performed among the human thymus and human peripheral blood. In functional T cells, the proportion of single TCR T cells in peripheral blood was higher than that of thymus, while the proportion of dual TCR T cells was opposite (Fig. 3D). In all T cells, the proportion of abnormal TCR T cells in thymus was significantly higher than that in peripheral blood (Fig. S2F), and the proportion of single TCR T cells and dual TCR T cells were similar to that of functional T cells.

Expression of TCR chain in single T-cell of mouse lymphoid tissue

As an important model animal, the single-cell sequencing data of lymph nodes, spleen, and peripheral blood of mice were also included in this study. Single T cells were found to express single or multiple TCR α and (or) β chains in all three mice tissues (Table S3). TCR β (F) was the domination mRNA in T-cells with one chain, the combination of TCR α (F) and TCR β (F) had the highest frequency in all samples, and T cells possess three TCR α(N) or three TCR β(N) were not found. Selected one representative sample from each group of lymph nodes (Fig. S3A), spleen (Fig. 4A), and peripheral blood (Fig. S3B) of mice for visualization.

Fig. 4: Single-cell immune profiling revealed multiple TCR chain patterns in mouse single T cells.
figure 4

A The pattern and proportion of different TCR α and β chains detected in a single T-cell of mouse spleen (one sample, S1). B The difference of T cells with one, two, and three (or more) types of chains among lymph nodes, spleen, and peripheral blood. C The difference of T cells with single or multiple TCR among lymph nodes, spleen, and peripheral blood.* is interpreted as P < 0.05. F functional, N non-functional.

T cells from lymph nodes, spleen, and peripheral blood mainly expressed two types of mRNA chains, and the proportion of one or three types of mRNA chains was both ~20% (Fig. S3C). T cells detected with one, two, or three or more chains was no significant difference among lymph nodes, spleen, and peripheral blood (Fig. 4B). The number of T cells with single TCR was significantly higher than that with abnormal TCR and multiple TCR in all three groups, and the proportion and trend of cells with single TCR, abnormal TCR and multiple TCR in the three groups were consistent (Fig. S3E). We screened functional T cells (excluding abnormal TCR T cells) for further analysis, and the T cells with single TCR dominated in all three groups (Fig. S3D). The proportion of dual (or more) TCR cells were 20.328, 19.233, and 17.938% in lymph nodes, spleen, and peripheral blood, respectively, and the proportion in lymph nodes was significantly higher than that in peripheral blood (Fig. 4C).

Traceability of dual (or more) TCR

Most TCR α and TCR β "V (D) J" mRNA sequences conform to the classical rules of 12RSS/23RSS rearrangement and allelic exclusion of TCR gene. However, single-cell expressed multiple TCR α and/or TCR β mRNA sequences were consistently observed in 34 single-cell sequencing samples, encompassing both central and peripheral immune organs, indicating that a single T-cell has three (or more) rearrangements and can undergo transcription and expression. Taking T01 sample of thymus as an example, we listed the V (D) JC family names and CDR3 AA sequences of eight representative TCR α and TCR β, including single TCR α or TCR β (Fig. 5A), dual TCR α or TCR β (Fig. 5 B, C), single TCR α and single TCR β (Fig. 5D), three TCR β (Fig. 5E), single TCR with multiple (≥3) TCR mRNA sequences (Fig. 5 F, G), dual TCR (Fig. 5H), three TCR (Fig. 5I), four TCR (Fig. 5J) and five TCR (Fig. 5K).

Fig. 5: V(D)JC recombination patterns and CDR3 AA sequences in single T cells.
figure 5

A T-cell expressing single TCR α or TCR β; B T-cell expressing dual TCR β; C T-cell expressing dual TCR α; D T-cell expressing one TCR α and one TCR β; E T-cell expressing three TCR β; F, G T-cell expressing multiple chains and with one potential TCR; H T-cell with two potential TCR. I T-cell with three potential TCR. J T-cell with four potential TCR. K T-cell with five potential TCR. F functional, N non-functional.

Based on the composition, location, RSS category, and direction of V (D) J gene at human and mouse TCR α and TCR β loci, we innovatively found that one of the three (or more) TCR β (or TCR α) in single T cells should be derived from the transcription of TREC circular DNA formed by secondary successful rearrangement. The first possible source of two TCR β transcripts on a single chromosome was shown in Fig. 6A, and the example of corresponding single T cells sequenced in the samples was shown in Fig. 6D. The possible source of two TCR α transcripts on a single chromosome was shown in Fig. 6B, and the example of corresponding single T cells sequenced was shown in Fig. 6E. Moreover, there is a reverse V gene (human V30 and mouse V31) on human and mouse TCR β loci. Therefore, the second possible source of two TCR β transcripts from a single chromosome was the "reverse" recombinant transcription (Fig. 6C) involving human V30 (or mouse V31), and the example of the corresponding single T-cell sequenced is shown in Fig. 6F.

Fig. 6: More than once TCR rearrangement on one chromosome.
figure 6

A The mechanism of two TCR β transcripts form one single chromosome by TREC circular DNA, and sequencing validation. B The mechanism of two TCR α transcripts form one single chromosome by TREC circular DNA. C The mechanism of two TCR β transcripts form one single chromosome by "reverse" recombinant. D The validation of mechanism A based on single T-cell sequencing data. E The validation of mechanism B based on single T-cell sequencing data. F The validation of mechanism C based on single T-cell sequencing data. F functional, N non-functional.

Dual TCR chains expression at protein level

To further understand the expression of multiple TCRα (or TCRβ) chains in single T cells, we conducted flow cytometry analyses on human peripheral blood and mouse lymphoid tissue. Among six volunteers, we detected the expression of dual TCRα chains in two individuals (TCR Vα7.2 and TCR Vα24), but no dual TCRβ chains were found (specifically, TCR Vβ8 and TCR Vβ13.1) (Table S4 and Fig. S4). In eleven mice from different age groups, we did not detect any T cells expressing dual TCRα chains, such as TCR Vα3.2, TCR Vα2, or TCR Vα11.1/11.2, or dual TCRβ chains, such as TCR Vβ5.1/5.2 or TCR Vβ13 (Table S4 and Fig. S5).

Discussion

The classical theory that "one lymphocyte only expresses one specific antigen receptor" is supported by extensive experimental evidence, and its mechanism mainly depends on allelic exclusion [1]. However, the detailed mechanism of allelic exclusion has not been clarified [9, 10] and the lack of allelic exclusion has also been discovered by several teams, which a single T-cell expressed two TCR beta or alpha chains in humans and mice [22, 24, 26]. Single-cell sequencing can simultaneously sequence the mRNA of TCR α and TCR β chain from substantial single T cells. This innovative technology has provided new avenues for studying allelic exclusion escape and the proportion of dual TCR expression [29]. In this study, we leveraged single-cell sequencing data of human and mouse (central and peripheral) T cells shared by multiple laboratories to analyze the functional and non-functional TCR α and TCR β sequences of single T cells, including the proportion of one, two, and three or more types of chains and whether they can be paired as functional TCR. In addition, the proportion of single and multiple TCR in central and peripheral immune organs and their possible source mechanisms was also carried out [30, 31].

T cells with a"TCR α (F) + TCR β (F)" pattern are considered to have a functional TCR (Table 1). In all 34 samples, many T cells possess more than one "TCR α (F) + TCR β (F)" pattern, which could be assembled into dual (or more) TCR in single T cells (Fig. 5). In 12 thymus samples, the proportion of T cells with dual (or more) TCR accounts for more than 15% of the total number of T cells that can be paired to assemble TCR. The low proportion of T cells with dual (or more) TCR in CD45− and CD45+ subpopulations may be attributed to the early development of T cells, at which the TCR rearrangement of T cells has not been completed. The high proportion of dual (or more) TCR in total T cells (19.930%) indicates that TCR β (or TCR α) chains have experienced two or more functional rearrangements during the development of T cells, while more than two functional rearrangements of TCR α or TCR β chain in a single T-cell reveal that TCR rearrangements exist allelic exclusion escape.

Disease and aging will lead to a change in the normal functioning of the immune system [32]. It has been proved that aging-related thymic degeneration, repeated antigen contact, and inherent cell aging process will cause changes in T cells [33], and dual V beta-expressing cells expanded dramatically in the periphery with age, and such expanded cells had an activated phenotype [34]. In this study, the proportion of T cells with multiple TCR in an old group (13.668%) was significantly higher than that in young volunteers (9.963%) and frail individuals (8.703%). The increased T cells with dual (or more) TCR may be related to the disorder of rearrangement or abnormal tolerance selection after thymus recession.

The proportion of T cells with dual (or more) TCR in mouse lymph nodes (20.328%) was slightly higher than that in the spleen (19.233%) and peripheral blood (17.938%), which was consistent with Heath’s report that dual TCR was mainly found in mouse lymph nodes [35]. Moreover, the proportion of dual TCR cells found in this study is quite different from the existing reports (range from 1 to 30%) [34, 36, 37], indicating the complexity of dual TCR T cells and the necessity of further research. Interestingly, the proportion of T cells with multiple TCR in mouse peripheral lymphoid tissue is significantly higher than that in human peripheral blood, suggesting that the mechanism of allelic inclusion can be studied with the help of mice.

Theoretically, T cells express only one TCR after self-tolerance selection according to clonal selection theory. However, single-cell sequencing found a high proportion of multiple TCR α and TCR β chain types both in the central and peripheral immune tissues, and most of them could not assemble functional TCR. Therefore, we suggest that researchers should first classify and evaluate the cells with single TCR, multiple TCR, and abnormal TCR when performing the single T-cell sequencing data, and then unveil the relationship between their physiology and pathology to avoid deviation or even error in the results.

Single-cell sequencing showed a high proportion of TCR α or TCR β mRNA sequences which cannot be assembled into TCR. The possible reason is that single-cell sequencing technology failed to detect TCR α and TCR β mRNA in all single T cells at the same time, or failed to detect V (D) J sequences completely. Up to now, a similar phenomenon has been found in the research of single-cell sequencing applied to T or B-cell repertoire [16, 29,30,31]. Therefore, unpaired TCR chains should be separated in single-cell sequencing data and that should not be included in physiological or pathological correlation analysis, otherwise, TCR repertoire information will be expanded or distorted.

The characteristics of T-cell rearrangement in the thymus are as follows (TCR β example): (1) 1/3 of the functional TCR β that undergoing one rearrangement; (2) \(\left( {1 - \frac{1}{3}} \right) \times \frac{1}{3}\) of the functional TCR β undergoing two rearrangements. The ratio of TCR β (F) and TCR β (N) of total T cells in the thymus in this study is basically consistent with the rearrangement rule. If both TCR β rearrangements fail, will "the third rearrangement" be started to save the apoptotic T cells? The research on this point is blank at present. There are TCR β (N) + TCR β (N) + TCR β (F) mRNA in a single cell, but the order of non-functional rearrangement and functional rearrangement cannot be determined.

T cells with three (or more) TCR β rearrangement events were identified in this study (Table S1-S3), which is inconsistent with the universally acknowledged TCR rearrangement rules: a single T-cell can only form two types of V (D) J combinations to participate in transcription. By analyzing RSS category, RSS direction, and the V (D) J gene position at TR locus of single T-cell which contains three (or more) TCR β or TCR α mRNA sequences, we creatively found that one of three TCR β (or TCR α) should be derived from the transcription of TREC circular DNA formed after the twice successful rearrangement of the same chromosome (Fig. 6). For TCR β chain, there is a reverse V gene (human V30 and mouse V31), so another possible mechanism is that human V30 (or mouse V31) participates in a "reverse" recombination and transcription. the forward and reverse V genes of TCR β locus participate in the rearrangement in single T-cells has been confirmed in V31 transgenic mice [38, 39]. This study is the first to propose a new mechanism of secondary (or more) TCR rearrangement on one chromosome, which provides a new idea for research on allelic exclusion escape. However, the underlying reasons for T cells undergoing three or more functional rearrangements of TCR α and β chain under physiological conditions remain unclear. Further research is needed to explore the potential significance of this phenomenon.

The proportion of mRNA level and protein level of dual TCR in a single T cell is reported differently in different laboratories [29, 40, 41]. Most of them are detected at mRNA level, but less at protein level. In this study, no "dual TCR β chain" T cells were detected at the protein level, but dual TCR α chain T cells could be detected in human peripheral blood samples. The high proportion of multiple TCR α or TCR β chain detected by single-cell sequencing requires more TCR alpha and beta chain antibodies for protein-level verification. Moreover, allelic exclusion may not only regulate on V(D)J gene rearrangement and transcription but also on the level of protein expression to ensure that most T cells ultimately express only one specific receptor.

In recent years, T cells with dual TCR have been found in autoimmune diseases [37, 42], aging [34], infection [43], malignant T-cell tumors [25], and GVHD [44]. With the help of single-cell sequencing technology, further analyze the mechanism and significance of allelic exclusion escape, which can provide new insights into the relationship between T cells with dual TCR and these diseases.

Conclusion

This study found a high proportion of multiple TCR T cells by analyzing 34 single-cell sequencing data of humans and mice, including central and peripheral T cells, and proposed for the first time the new mechanism of secondary (or more) TCR rearrangement on one chromosome. Moreover, the single-cell sequencing data of T cells contained substantial sequences that could not be successfully assembled into functional TCR, suggesting that the data of single-cell sequencing should be classified according to single, multiple, and abnormal TCR when analyze the T-cell repertoire to avoid the deviation or even error of the repertoire on the physiological and pathological significance.

Materials and methods

Single-cell sequencing datasets acquisition

Three scRNA-seq datasets of T cells were downloaded from ArrayExpress and Gene Expression Omnibus (GEO) data repository, including three different T-cell categories (total, CD45+ and CD45− cells) of human thymus (EMTAB-8581), peripheral blood T cells of volunteers (GSE157007) under three different physiological conditions (frail, young and old) and T cells from three C57BL/6 mice lymph nodes, spleen and peripheral blood (GSE168944). The basic information of the above datasets was shown in Table 1. The total T cells (5 samples), CD45+ cells (5 samples), and CD45− cells (2 samples) of the human thymus are derived from thymic tissue during embryonic (9–39 weeks of pregnancy), neonatal (0–28 days), and adult (18–40 years of age) periods. The average age of the young (3 samples) and the olderly (5 samples) were 30.7 ± 10.0 years and 85.8 ± 11.1 years, respectively. According to the frailty index test, which includes chronic disease, disability, cognitive impairment, and stress index, elderly individuals aged 65 years or older with a score greater than 0.2 are defined as frailty (5 samples).

Datasets preprocessing

The raw single-cell sequencing files of human thymus (T01-T05, P01-P05, N01-01), human peripheral blood (R01-R05, C01-C03, L01-L05) samples, and mouse peripheral tissue (LN1-LN3, S1-S3, B1-B2) samples were downloaded from Array Express database and NCBI database. Data quality control was performed in Microsoft Office Excel by screening and deleting sequences with the following four criteria: sequences in "is_cell" with "FALSE"; sequences in "high_confidence" with "FALSE"; sequences in "chain" that are not "TRA, TRB"; and sequences in "productive" with "None". Cells with high confidence, expressing TCR chains and defined as functional (F) or non-functional (N) can be used for subsequent analysis. Cells were classified according to the number of TCR chains expressed by a single cell. Then, combination analysis was performed according to the functionality of TCR chain include separate TCR α or TCR β chains and TCR α + TCR β. For example, cells with two TCR β chains were assembled into the following pattern: TCR β(F) + TCR β(F), TCR β(F) + TCR β(N), and TCR β(N) + TCR β(N), cells with one TCR α and one TCR β chain were assembled into the following pattern: TCR α(F) + TCR β(F), TCR α(F) + TCR β(N), TCR α(N) + TCR β(F) and TCR α(N) + TCR β(N). Finally, the cells were divided into 23 types according to the number of different TCR and the potential paired of TCR chain, which were (1) TCR β(F) (2) TCR β(N) (3) TCR α(F) (4) TCR α(N) (5) TCR β(F) + TCR β(F) (6) TCR β(F) + TCR β(N) (7) TCR β(N) + TCR β(N) (8) TCR α(F) + TCR α(F) (9) TCR α(F) + TCR α(N) (10) TCR α(N) + TCR α(N) (11) TCR β(F) + TCR α(F) (12) TCR β(F) + TCR α(N) (13) TCR β(N) + TCR α(F)(14) TCR β(N) + TCR α(N) (15) TCR β(F) + TCR β(F) + TCR β(F) (16) TCR β(F) + TCR β(F) + TCR β(N) (17) TCR β(F) + TCR β(N) + TCR β(N) (18) TCR β(N) + TCR β(N) + TCR β(N) (19) TCR α(F) + TCR α(F) + TCR α(F) (20) TCR α(F) + TCR α(F) + TCR α(N) (21) TCR α(F) + TCR α(N) + TCR α(N) (22) TCR α(N) + TCR α(N) + TCR α(N) and (23) others. Based on the number of different types of chains expressed in single T cells, they were divided into three groups: "one type of chain", "two types of chains", and "three or more types of chains". Based on the pairing of TRA and TRB chains, single T cells were statistically compared and analyzed in terms of "single TCR", "multiple TCR", and "abnormal TCR".

The functional V(D)J gene (no pseudogene family) recombined to form a fully transcribed mRNA sequence, which is called functional TCR α sequence or functional TCR β sequence. Functional T cells refer to a single T-cell that contains at least one TCR α (F) sequence and one TCR β (F) sequence. Non-functional T cells refer to (1) the absence of TCR α (F) and TCR β (F) sequences in a single T-cell; (2) Only TCR α (F) sequences in a single T-cell; (3) Only TCR β (F) sequences in a single T-cell.

“One type of chain" refers to the detection of only one TRA or TRB chain in a single T-cell, such as: TRA(F), TRA(N), and so on. That is, T cells that cannot be assembled into functional TCR at the transcriptome level. "Two types of chains" refer to the detection of two different TCR chains in a single T-cell, and the V(D)JC composition of these two sequences were different, such as: TRB(F) + TRA(F), TRB(F) + TRA(N), TRB(N) + TRA(F), TRB(N) + TRA(N) and so on. Among them, TRB(F) + TRA(F) is a cell type that can be assembled into a single functional TCR, and other combinations cannot be assembled into a functional TCR in a single cell. "Three or more types of chains" refers to the detection of three or more different chains in a single T-cell, and the V(D)JC composition of these three or more sequences were different, such as: TRB(F) + TRB(F) + TRB(F), TRB(F) + TRB(F) + TRB(N), TRB(F) + TRB(N) + TRB(N) and so on.

Statistics of TCR expressed in single cells

Cells with a pair of TCR β (F) + TCR α (F) were defined as single TCR T cells, eg: TCR β (F) + TCR α (F)+nTCR α (N)+nTCR β (N). Three or more functional TCR chains expressed by a single cell can be combined into different patterns of TCR β (F) + TCR α (F), and these cells were defined as multiple TCR cells, eg: TCR β(F) + TCR α(F)+nTCR β(F/N)+nTCR α(F/N). The T cells without any combination of TCR β (F) + TCR α (F) were defined as abnormal TCR cells. Then, the analysis of the proportion and difference of cells with single TCR, multiple TCR, and abnormal TCR was performed among all samples.

Traceability of multiple TCR

T cells expressing three or more TCR chains were identified in all samples, which is inconsistent with the generally recognized principle of allosteric exclusion. Therefore, we screened the cells expressing three TCR chains and their sequences for traceability analysis according to TCR rearrangement rule, including RSS category, RSS direction, and the V(D)J gene position at TR locus.

Flow cytometry

Peripheral blood samples were collected from six healthy volunteers. Additionally, spleen, lymph nodes, and thymus samples were collected from eleven BALB/c mice (Table S5). This experiment was approved by the Ethics Committee of Zunyi Medical University and with the informed consent of healthy volunteers (ZMU [2021]1-022). Single-cell suspensions of lymphocytes from human peripheral blood were incubated with red blood cell lysis buffer (0.7 M NaCl and 17 mM Tris HCl). Cells were washed with FACS staining buffer (PBS containing 0.5% BSA) and stained with the following Biolgend antibodies: APC-TCR Vα7.2 (351707), PC5.5-TCR Vα24 (360003), FITC-CD3 (317305), PE-TCR Vβ13.1 (362409), APC-TCR Vβ8 (348105), APC-TCRβ5.1/5.2 (139505), PE-TCR Vβ13 (140703), APC-TCR Vα2 (127809), FITC-TCR Vα3.2 (135403), PE-TCR Vα11.1/11.2 (139903) and PC5.5-CD3(100217). Cells were stained in FACS staining buffer. All flow cytometry analyses were performed on a Partec PAS. Data were analyzed with the help of FlowJo software.

Statistical analysis

Statistical analysis was performed with IBM SPSS Statistics 26 software. Two-tailed t tests were suitable for comparison analysis between two groups of data that conform to the normal distribution, one-way ANOVA was applied for three or more independent groups with normally distributed and homogenous variance data. While data that were not normally distributed were analyzed with the non-parametric Mann–Whitney test and Kruskal–Wallis test. Data were represented as means ± SEM. Significance levels were labeled as *P < 0.05, **P < 0.01. GraphPad Prism software and Adobe Illustrator software were used for data visualization.