Main

Prestige plays a central role in structuring the US professoriate. Analyses of faculty hiring networks, which map who hires whose graduates as faculty, show unambiguously in multiple fields that prestigious departments supply an outsized proportion of faculty, regardless of whether prestige is measured by an extrinsic ranking or reputation scheme11,12,13 or derived from the structure of the faculty hiring network itself14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29. Prestigious departments also exhibit ‘social closure’15 by excluding those who lack prestige, facilitated by relatively stable hierarchies over time, both empirically17 and in mathematical models of self-reinforcing network dynamics30,31.

These observations are important because of the broad impacts of prestige itself. Prestigious affiliations improve paper acceptance rates in single- versus double-anonymous review32; faculty at prestigious universities have more resources and write more papers33,34, receive more citations and attention35,36,37 and win more awards38,39; and graduates of more prestigious universities experience greater growth in wages in the years immediately after graduating40. Furthermore, the vast majority of faculty are employed by departments less prestigious than those at which they were trained27, making prestigious departments central in the spread of ideas3 and academic norms and culture more broadly.

Less well studied are the processes of attrition that, together with hiring, shape the data underpinning the analyses reviewed above. Evidence suggests that women in science and engineering (but not mathematics) and foreign-born faculty leave the academy in mid-career at higher rates than do men41 and US-born42 faculty, respectively, making clear the fact that the US professoriate is structured by more than just prestige. These processes are particularly important in light of clear evidence that the topics studied by faculty depend not only on their field of study, but also on their (intersecting) identities43.

However, the difficulty of assembling comprehensive data on US faculty across fields, across universities and over time has limited analyses and comparisons, leaving it unclear how much of the observed patterns and differences are universal, vary by field or are driven by current or past hiring or attrition. Less visible but just as important are the inherent limitations of focusing only on the placement of faculty within the US system, to the exclusion of US faculty trained abroad. A broad cross-disciplinary understanding of academic hierarchies and their relationship to persistent social and epistemic inequalities would inform empirically anchored policies aimed at accelerating scientific discovery or diversifying the professoriate.

Data and approach

Our analysis examines tenured or tenure-track faculty employed in the years 2011–2020 at 368 PhD-granting universities in the United States, each of whom is annotated by their doctoral university, year of doctorate, faculty rank and gender. To be included in our analysis, a professor must be a member of the tenured or tenure-track faculty at a department that appears in the majority of sampled years, which yields n = 295,089 faculty in 10,612 departments.

This dataset resulted from cleaning and preprocessing a larger US faculty census obtained under a data use agreement with the Academic Analytics Research Center (AARC). To facilitate comparisons of faculty across areas of study, we organized departments into 107 fields (for example, Physics, Ecology) and eight domains (for example, Natural Sciences) (Extended Data Table 1). Field labels, provided in the AARC data, and subsequently hand-checked, are not mutually exclusive, such that 23% of faculty were assigned to multiple fields (for example, members of a Department of Physics and Astronomy were assigned to both Physics and Astronomy). For faculty associated with multiple departments, we restricted our analyses to their primary appointments only. All doctoral universities were manually annotated by country. Self-reported faculty genders were used when available, and otherwise algorithmically annotated (man or woman) on the basis of historical name–gender associations, recognizing that there are expansive identities beyond this limiting binary. These procedures resulted in gender annotations for 85% of records; faculty without name–gender annotations were not included in analyses of gender but were included in all other analyses. Comparing data collected in adjacent years, we also annotated all instances of new hiring, retention and attrition. Data preparation and annotation details can be found in Methods.

To analyse patterns of faculty hiring and exchange among US universities, we created a faculty hiring network for each of the 107 fields, eight domains and for academia as a whole (Methods). In such a network, each node u represents a university, and a directed edge u → v represents an individual with a doctorate from u who becomes a professor at v. Faculty employed at their doctoral universities, so-called self-hires, are represented as self-loops u → u. When aggregating field-level hiring into networks for the eight domains or for academia in toto, we take the union of the constituent fields’ edges, which avoids double-counting of faculty rostered in multiple fields. Anonymized data supporting our analyses are freely available (Data availability).

Pre-eminence of US doctorates

In general, although our data show that US academia largely requires doctoral training, the ecosystem of broad domains and specialized fields exhibits diversity in its credential requirements. Fully 92.7% of all faculty hold doctoral degrees yet only 1% lack a doctorate in Social Sciences compared with 19% in the Humanities (Fig. 1a). Even within the Humanities there is wide variation, with only 7% of remaining faculty lacking a doctorate if one separates out the fields of Theatre (67% non-doctorates), Art History (44%), Music (30%) and English (11%) (Extended Data Fig. 1).

Fig. 1: Composition and dynamics of the US professoriate by doctoral training.
figure 1

a, Degrees of n = 295,089 US faculty by domain, and for academia overall, separated by non-doctoral degrees (solid bars), US doctorates (open bars) and non-US doctorates (hatched bars). b, Continent of doctorate for n = 31,845 faculty with non-US doctorates by domain. Within the Europe and North America bars, darkened regions correspond to faculty from the United Kingdom and Canada, respectively. c,d, Ratios of average annual attrition risks among faculty with doctorates from Canada and the United Kingdom (c) (n = 11,156), and from all countries other than Canada, the United Kingdom and the United States (d) (n = 20,689), versus all US-trained faculty, for each field (colours), domain (grey) and academia (blue), on logarithmic axes. Circles, significantly different from 1.0, χ2 test, Benjamini–Hochberg-corrected P < 0.05; crosses, not significant.

Source data

This variation in credentials is paralleled by US faculty trained internationally. Overall, our analysis finds that 11% of US faculty have non-US doctorates yet only 2% of Education faculty received their doctorates internationally compared with 19% of Natural Sciences faculty (Fig. 1a). However, internationally trained faculty primarily receive their training from a limited range of geographical areas, with 35.5% trained in the United Kingdom or Canada compared with just 5.4% from all countries in Africa and the Americas, excluding Canada (Fig. 1b).

Our data suggest that differences in country of doctoral training are not without consequence for the dynamics of the professoriate. Using the 10 years of observations in our data, we identified instances of attrition and estimated the annual per-capita risk of attrition for faculty trained in three groups of countries: Canada and the United Kingdom, the United States, and all others. Those with doctorates from Canada and the United Kingdom (n = 11,156) left their faculty positions at statistically indistinguishable rates compared with US-trained faculty (n = 238,676) in all 107 fields and eight domains, and at slightly lower rates overall (significance level α = 0.05, Benjamini–Hochberg-corrected χ2 test; Fig. 1c). In stark contrast, those with doctorates from all other countries (n = 20,689) left the US tenure track at markedly higher rates overall, in all eight domains and in 39 individual fields (36%), and in no field did such faculty leave at significantly lower rates (α = 0.05, Benjamini–Hochberg-corrected χ2 test; Fig. 1d). We note that our data allow us to consider hypotheses related only to country of doctoral training, not to country of citizenship or birth, leaving open questions about foreign-born yet US-trained faculty42.

Universal production inequality

For faculty with US doctorates, we find that academia is characterized by universally extreme inequality in faculty production. Overall, 80% of all domestically trained faculty in our data were trained at just 20.4% of universities. Moreover, the five most common doctoral training universities—UC Berkeley, Harvard, University of Michigan, University of Wisconsin-Madison and Stanford—account for just over one in eight domestically trained faculty (13.8%; Fig. 2a and Extended Data Table 3). Even when disaggregated into domains of study, 80% of faculty were trained at only 19–28% of universities (Fig. 2b).

Fig. 2: Universal inequality in the production of US-trained faculty.
figure 2

a, Proportions of US faculty produced by US universities, sorted by production rank, with the university producing the most faculty having a rank of 1 (n = 238,676 faculty; n = 387 universities). Quintiles of production are highlighted with alternating colours and annotated with the number of universities falling within each. By production, the first quintile contains only eight universities and the bottom contains 308. b, Lorenz curves for faculty production at the field level (coloured lines) and at the domain level (grey lines). A point is placed at the site along the domain-level Lorenz curve where 80% of faculty have been produced.

Source data

Our analysis shows that universities that employ more faculty generally also place more of their graduates as faculty elsewhere (Pearson ρ = 0.76, two-sided z-test P< 10−5). Nevertheless, at the level of domains and fields, faculty size alone cannot explain faculty production and placement: in academia as a whole, in all eight domains and in 91 of 107 fields (85%), faculty size and production are from significantly different distributions (Kolmogorov–Smirnov (K-S) test, Benjamini–Hochberg-corrected P< 10−5 for academia and domains, P< 0.01 for fields), reproducing the findings of a previous analysis of faculty hiring networks in Business, Computer Science and History27. For the remaining 16 fields (15%), the hiring of one’s own graduates plays a key role: when self-hires are excluded, the distributions of hiring and production of only 12 fields (11%) remain statistically indistinguishable. In other words, inequalities in university or department size do not explain inequalities in faculty production.

The Gini coefficient is a standard way to quantify inequality in a distribution, with G = 0 representing perfect equality and G = 1 maximal inequality. We find that inequality in faculty production across academia as a whole is both marked (G = 0.75) and greater than the inequalities in seven of eight domains. Of those domains, inequality is lowest in Education (G = 0.67) and Medicine and Health (G = 0.67) and highest in the Humanities (G = 0.77). Similarly, inequality in faculty production at the domain level is nearly always greater than production inequality among a domain’s constituent fields. For instance, whereas G = 0.73 for Engineering as a whole, Gini coefficients for the ten fields within Engineering range from 0.58 to 0.68 and, overall, Gdomain > Gfield for 104 of 107 fields (97%; Fig. 3a). Generally, field-level faculty production distributions are heavy tailed and the universities comprising those tails are similar across fields within a given domain and, more broadly, across domains. That is, measurements of inequality in domestic faculty production increase as aggregation or scale expands, because of university-level correlation in faculty production across related fields and domains.

Fig. 3: Differential attrition exacerbates inequalities in domestic faculty production.
figure 3

a, Line segments contrast the faculty production Gini coefficients calculated for newly hired faculty (filled circles; n = 54,100) and for existing faculty (open circles; n = 184,576) for each of the 107 fields (colours), eight domains (grey) and academia as a whole (blue). Line segments are grouped and coloured by domain. b, Annual Gini coefficients for academia and for each domain showing strong interyear consistency. c, Attrition risk as a function of university production rank by domain and for academia overall, via logistic regression, showing that university production rank is a significant predictor of annual attrition risk (two-sided t-test, Benjamini–Hochberg-corrected P < 0.05) such that faculty trained at high-producing universities leave academia at substantially lower rates than those trained at less productive universities. The empirical average annual attrition rates vary around the fitted curves.

Source data

Faculty production inequalities are rooted in hiring but are exacerbated by attrition. Computing the domestic production Gini coefficients separately for newly hired faculty and their sitting colleagues across our longitudinal data frame, we find uniformly larger inequalities for existing faculty in every field, every domain and in academia overall (Fig. 3a). However, cross-sectional Gini coefficients, computed separately for each year of observation, are stable over time, a pattern that rules out a simple cohort effect that would over time draw the Gini coefficients downward towards those of the newly hired faculty (Fig. 3b). Combined, these observations suggest that distributions of faculty production change after hiring in a manner that increases observed inequalities. We tested this hypothesis directly by modelling annual attrition risk as a function of faculty production rank. For academia as a whole, all eight domains and 49 of 107 fields (46%), we find substantially higher rates of attrition among faculty trained at those universities that already produce fewer faculty in the first place (logistic regressions, two-sided t-test, Benjamini–Hochberg-corrected P< 0.05). Put differently, most US-trained faculty come from a small number of universities and those who do not are nearly twice as likely to leave the professoriate on an annual basis (Fig. 3c).

Women on the tenure track

In addition to inequalities in production, our analysis expands on well-documented gender inequalities8. Whereas the majority of tenure-track US faculty in our data are men (64%), we find substantial heterogeneity by area of study with moderate change over time. For instance, between 2011 and 2020, women’s representation rose from 12.5 to 17.1% among faculty in Engineering and from 55.4 to 58.5% among faculty in Education (Fig. 4a). In fact, women’s representation significantly increased in academia overall, in all eight domains and in 80 (75%) of 107 fields (one-sided z-test, Benjamini–Hochberg-corrected P< 0.05; Fig. 4a). Nursing, a majority-women field, is the single instance in which the representation of women significantly decreased. The representation of women among faculty is thus generally increasing, even as women remain broadly under-represented.

Fig. 4: Changes in gender demographics of US faculty.
figure 4

a, Representation of women over time, coloured by domain and academia (n = 162,408 men, n = 89,429 women). b, Line segments contrast percentages of women among newly hired faculty (filled circles; n = 59,007) and women among all-cause attritions (open circles; n = 90,978) for each of the 107 fields (colours), eight domains (grey) and academia as a whole (blue). Line segments are grouped and coloured by domain. c, Representation of women by career age, quantified by years since doctorate, coloured by domain and for academia as a whole. Lines indicate empirical proportions, bands indicate 95% confidence intervals.

Source data

Changes in the overall representation of women over time could be driven by many factors, including demographic changes in new hires between 2011 and 2020 or simply demographic turnover—differences between those entering and those retiring or leaving the professoriate before retirement. Investigating these potential explanations we first found that, between 2011 and 2020, the proportion of women among newly hired faculty did not change significantly in 100 of 107 fields (93%) and significantly decreased in the remaining seven fields (7%). However, by comparing the inflows of new hires with the outflows of departing faculty over our decade of observation we found that, in academia, all eight domains, and 103 of 107 fields (96%),  newly hired faculty were substantially more likely to be women than their departing counterparts (Fig. 4b). This pattern in all-cause attrition is driven by dramatic demographic turnover, with retirement-age faculty skewing heavily towards men (Fig. 4c), implying that the overall increases in women’s representation over this period of time (Fig. 4a) are primarily due to changes in faculty hiring that predate our decade of observation. Importantly, the fact that women’s representation among new hires has remained flat over the past decade, combined with the observation that newly hired faculty are still more likely to be men (in academia, six of eight domains (75%) and 75 of 107 fields (70%); Fig. 4b), suggests strongly that future gender parity in academia—and especially in Science, Technology, Engineering and Mathematics (STEM) fields—is unlikely without further changes in women’s representation among new faculty.

Self-hiring

Professors who are employed by their doctoral university, called self-hires, account for roughly one in 11 (9.1%) of all US professors in our data (11% of US-trained professors). Whereas these rates remain generally low compared with other countries (for example, 36% in Russia44, 67% in South Africa29 and 73% in Portugal45), they are nevertheless consistently greater than would be expected under a network-based null model that randomizes hiring patterns while keeping faculty production (outflow) and faculty hiring (inflow) fixed46. Self-hiring rates were similarly higher than expected across individual fields, ranging from 1.1-fold higher in Theatre to 29.3-fold in Nursing. Self-hiring rates also vary considerably by domain, being lowest in the Humanities (4.5%) and Social Sciences (6.0%) and highest in Medicine and Health (16.7%; Fig. 5a).

Fig. 5: Self-hiring.
figure 5

a, Self-hiring rates overall (open bars; n = 295,089), for women (solid; n = 89,429) and for men (hatched; n = 162,408), by domain and across academia. Dots overlaid on open bars indicate the expected rate of self-hiring under the network-based null model. b, Ratios of average annual attrition risks among self-hires (n = 26,720) versus all other faculty (n = 268,369) for each field (colours), domain (grey) and academia (blue), on logarithmic axes. Circles, significantly different from 1.0, χ2 test, Benjamini–Hochberg-corrected P< 0.05; crosses, not significant.

Source data

Previous work found that women were self-hired at higher rates than men in Computer Science47. We find overall that 11.2% of women are self-hires compared with 8.2% of men (two-sided z-test for proportions, Benjamini–Hochberg-corrected P < 10−5; Fig. 5a). However, this effect is driven by a minority of fields: only 26 (24%) showed differences in self-hiring rates by gender (two-sided z-test for proportions, Benjamini–Hochberg-corrected P < 0.01), 25 of which featured more frequent self-hiring among women than men. These differences are particularly common in Medicine and Health, where in 12 of 18 fields women are self-hired at significantly higher rates than men.

We also find that self-hires are at greater risk of attrition than non-self-hires. In academia, self-hires in our data leave at 1.2-fold the rate of other faculty and rates are similarly elevated in all eight domains, as well as in 36 of 107 fields (34%; two-sided z-test for proportions, Benjamini–Hochberg-corrected P < 10−5 for academia, P < 0.05 for fields and domains; Fig. 5b). Relative rates of self-hire attrition are highest in Criminal Justice and Criminology and Industrial Engineering, at 1.9- and 1.8-fold the rate of other faculty, respectively. Only in Nursing was the relative rate of self-hire attrition significantly below 1.0 (0.9-fold). It is unclear what drives these differences but, given the ubiquity of self-hired faculty and differential rates of attrition, determining and addressing the causes of this phenomenon would have a wide impact.

Ubiquitous hierarchies of prestige

If a faculty hiring market were to follow a strict social hierarchy, no university would hire a graduate from a university less prestigious than its own—100% of faculty would hold positions of equal or lower prestige than their doctoral training. The extent to which empirical faculty hiring networks follow perfect hierarchies has direct implications for academic careers, the mobility of the professoriate and the flow of scientific ideas3,37. Treating the flows of faculty between US universities as a network leads to a natural, recursive definition of prestige: a department is prestigious if its graduates are hired by other prestigious departments. We apply the SpringRank algorithm48 to each faculty hiring network to find, in approximation, an ordering of the nodes (universities) in that network that best aligns with a perfect hierarchy; this ordering represents the inferred hierarchy of prestige.

Faculty hiring networks in the United States exhibit a steep hierarchy in academia and across all domains and fields, with only 5–23% of faculty employed at universities more prestigious than their doctoral university (Fig. 6a,b and Extended Data Table 4). Measured by the extent to which they restrict such upward mobility, these prestige hierarchies are most steep in the Humanities (12% upward mobility) and Mathematics and Computing (13%) and least steep in Medicine and Health (21%; Fig. 6b). We tested whether these steep hierarchies could be a natural consequence of inequalities in faculty production and department size across universities, using a null model in which we randomly rewired the observed hiring networks while preserving out-degree (placements) and in-degree (hires) and ignoring self-loops (self-hires)46. For each rewired network we re-ranked nodes using SpringRank and measured induced upward mobility as a test statistic (fraction of up-hierarchy edges; Methods). For academia as a whole, all domains and 94 of 107 fields (88%), empirical networks showed significantly steeper prestige hierarchies than their randomized counterparts (one-sided Benjamini–Hochberg-corrected P< 0.05; Fig. 6c and Extended Data Table 5). No field was significantly less steep, although networks in the fields of Pharmacy (P = 0.88), Immunology (P = 0.77) and Pathology (P = 0.73) were less steep than null model randomizations most frequently. In short, the prestige hierarchies that broadly define faculty hiring are universally steep, and often substantially steeper than can be explained by the ubiquitous and large production inequalities.

Fig. 6: Hierarchies of prestige.
figure 6

a, Prestige change from doctorate to faculty job in the US faculty hiring network (n = 238,281; Methods), with ranks normalized to the unit interval and 1.0 being the most prestigious. The proportions of faculty at universities less prestigious than their doctorate are annotated as 'move down' (open bars), at universities more prestigious than their doctorates as 'move up' (hatched) and at the same university as self-hires (solid). b, Rank change among faculty in the US faculty hiring network, by domain, using the same shading scheme as in Fig. 1a. c, Comparison between empirical hierarchies and those from 1,000 draws from a null model of randomly rewired hiring networks (Methods), quantified through upward mobility. Fields above the diagonal reference line exhibit steeper hierarchies than can be explained by department size and faculty production inequalities alone. Circles, Benjamini–Hochberg-corrected P < 0.05, network null model (Methods); crosses, not significant; no field was significantly less steep than expected. d, Heatmap of pairwise Pearson correlations between prestige hierarchies of fields.

Source data

Inferred prestige ranks of universities are also highly correlated across fields, suggesting that many factors that drive field-level prestige operate at the university level. Among pairwise correlations of university prestige rankings across fields, the overwhelming majority are positive (all but 116 of 12,024) and nearly half (48%) have a correlation >0.7 (Pearson’s ρ). Fields in Engineering, Mathematics and Computing, and Humanities are particularly mutually correlated whereas the field of Pathology is, on average, the least correlated with others (mean correlation 0.2).

Patterns across field-level 'top-10' most prestigious departments illustrate other aspects of the stark inequalities that define US faculty hiring networks. Among the 1,070 departments that are ranked top-10 in any field, 248 (23.2%) top-10 slots are occupied by departments at just five universities—UC Berkeley, Harvard, Stanford, University of Wisconsin-Madison and Columbia; fully 252 universities (64%) have zero top-10 departments. These findings show that, both within individual fields and across entire domains, faculty placement power is highly concentrated among a small set of universities, complementing the already enormous concentration of faculty production among the same set of universities (Fig. 2). Together, these patterns create network structures characterized by a closely connected core of high-prestige universities that exchange faculty with each other and export faculty to—but rarely import them from—universities in the network periphery (Extended Data Fig. 2).

As a result of both systematic inequality in production and steep social hierarchies, the typical professor is employed at a university that is 18% further down the prestige hierarchy than their doctoral training (Fig. 6a, Extended Data Table 6). Combined with sharply unequal faculty production (Fig. 2), this movement downward in prestige implies that the typical US-trained professor can expect to supervise 2.4-fold fewer future faculty than did their doctoral advisor. At the field level, the typical professor who moves downward descends by between 28% (Electrical Engineering) and 46% (Classics) of the prestige hierarchy whereas the typical professor who moves upward, of whom there are very few, ascends by between 6% (Economics) and 26% (Agronomy) of the hierarchy. There was no significant difference in mobility between men and women in 82 of 107 fields, but of the 25 fields in which mobility did differ by gender (two-sided z-test for proportions, Benjamini–Hochberg-corrected P< 0.05), women were less likely to move down the prestige hierarchy and more likely to be self-hires (Extended Data Table 6); 11 of those 25 fields were within the domain of Medicine and Health. However, we found no significant differences in the magnitudes of upward or downward movements between men and women for all fields (K-S test, Benjamini–Hochberg-corrected α = 0.05).

Prestige helps explain more than just the flows of faculty between US universities. For instance, across all domains, our analysis shows that sitting faculty are markedly more likely to be self-hires as prestige increases, yet this relationship is progressively weaker among younger faculty cohorts (Extended Data Fig. 3) and is either attenuated or not significant for new hires (two-sided t-test, Benjamini–Hochberg-corrected α = 0.05; Extended Data Fig. 4a). By contrast, new hires in all domains are substantially more likely to be trained outside the United States as prestige increases, yet this relationship is either attenuated, not significant or even reversed for sitting faculty (two-sided t-test, Benjamini–Hochberg-corrected α = 0.05; Extended Data Fig. 4b). Although we observe no common relationship across domains between prestige and gender, both new and existing faculty are more likely to be men as prestige increases for academia as a whole (two-sided t-test, Benjamini–Hochberg-corrected P< 0.05; Extended Data Fig. 4c). Together, these observations suggest complicated interactions between prestige and the processes of hiring or retaining women, one’s own graduates and graduates from abroad, patterns that complement previously observed effects of prestige on peer review outcomes49,50 and productivity34.

Discussion

As a whole, by domain and by field, US tenure-track faculty hiring is dominated by a small minority of US universities that train a large majority of all faculty and sit atop steep hierarchies of prestige. Just five US universities train more US faculty than all non-US universities combined. As we expand our view from fields to entire domains, inequalities in faculty production further increase, reflecting elite universities’ positions at or near the top of multiple correlated prestige hierarchies across fields. In principle, universities are on equal footing as both producers and consumers in the faculty hiring market. However, the observed patterns of faculty hiring indicate that the system is better described as having a universal core–periphery structure, with modest faculty exchange among core universities, substantial faculty export from core to periphery and little importation in the reverse direction or from outside the United States.

Although significant efforts have been made over many years to make faculty hiring practices more inclusive, our analysis suggests that many inequalities at the faculty hiring stage are later magnified by differential rates of attrition. For instance, our analysis showed higher rates of attrition among US faculty who were (1) trained outside the United States, Canada or the United Kingdom, (2) trained at universities that have produced relatively fewer faculty overall and (3) employed at their doctoral alma mater. Combined with our observations of unchanging proportions of these groups over time, these differential attrition rates suggest a dynamic equilibrium of countervailing patterns of hiring and attrition. Identifying the causes of these elevated attrition rates is likely to provide insights and opportunities to improve retention strategies for faculty of all kinds.

Our analyses of the hiring and retention of women faculty point to stalled progress towards equal representation. Whereas women’s overall representation has increased steadily across all eight broad domains of study, women nevertheless remain under-represented among new hires in many fields, particularly in STEM, and women’s representation among newly hired faculty over the past decade has generally been flat. As a result, the continued increase in women’s overall representation can instead be attributed to the disproportionate number of men among retiring faculty, across all domains. Continued increases in women’s representation among faculty are therefore unlikely if the past decade’s pattern remains stable.

Around one in 11 US professors are employed by their doctoral university. Such high rates of self-hiring across fields and universities are surprising, because academic norms treat self-hiring negatively—for example, it is sometimes called 'academic inbreeding'51. Elevated self-hiring rates may indicate an unhealthy academic system52 because self-hiring restricts the spread of ideas and expertise3, and many decades of study suggest that it can correlate with lower quantity and quality of scholarship53,54. In this light, the sharply elevated rates of self-hiring at elite universities present a puzzle51, with uncertain epistemological consequences, yet these trends seem to be driven less by recent new hires and more by attrition or hiring patterns preceding our decade of observation. Overall, high rates of self-hiring persist in spite of (not because of) differential rates of attrition, with self-hires leaving US academia at higher rates in most fields, all domains and academia overall.

Our analyses describe system-wide patterns and trends, and hence say little about individual faculty experiences or the causal factors that predict the outcomes of individual faculty placements in the US academic system55. At best, our results provide statistical estimates for the direction and distance of faculty placements up or down a field’s prestige hierarchy, and they should not be used to inform or shape expectations of real hiring decisions. In other words, even though there are clear and strong patterns at the system level, the considerable variance in outcomes at the individual level shows that pedigree is not destiny.

One limitation of the present work is that, although doctoral universities were known, doctoral departments were not. Hence, our estimates of self-hiring rates reflect faculty employed by any department at their doctoral university, but not necessarily by their doctoral department. Our analyses therefore estimate only upper bounds on department-level self-hiring. Similarly, our estimates of production and prestige inequalities in individual fields reflect the volume and power of universities placing faculty into those fields, but not necessarily the volume of graduates produced by those fields or the related fields into which they may be hired26.

Our data also lack self-identified demographic characteristics and national origin, which limits the conclusions we may draw about the interaction between faculty hiring and representation by race, gender, socioeconomic background and nationality, and any intersectional analyses thereof. For instance, whereas we observe that faculty trained outside the United States constitute 2–19% of US faculty across domains, the fraction of US faculty born outside the United States is considerably higher42. Given our identification of markedly higher attrition rates for faculty trained outside the United States, Canada and the United Kingdom, an investigation of attrition by national origin could help identify its causes and address its differential impacts. Our approach also relies on cultural associations between name and binary man–woman genders, leaving the study of self-identified and more expansive identities, as well as intersectional representation more broadly, as open lines of enquiry.

Although our analysis shows that the clear cross-sectional patterns in faculty demographics and hiring networks are shaped by complex and evolving patterns of hiring and attrition alike, our analysis does not causally identify the mechanisms responsible. Our observations of clustered patterns among fields within the same domain suggest a role for domain-level macrocultures56. Strong correlations between a university’s ranks across different fields may indicate status signalling57, the impacts of elite universities’ resources on individuals’ productivity and prominence34, or other factors entirely. And, clear cohort effects—particularly in the representation of women—show non-stationarity in the patterns we observe and in the latent factors that drive them. Critically, future progress in understanding the causal factors shaping the US professoriate must investigate the factors that drive differential attrition, including those related to social identity, doctoral training (both abroad and domestically) and university of employment. Understanding the underlying causes of these differential attrition rates would surely inform efforts and policies aimed at mitigating social inequalities by improving equity and representation, which is likely to shape what discoveries are made and who makes them.

Methods

Data preparation overview

The data used in our analyses are based on a census of the US academic market obtained under a data use agreement with AARC. That unprocessed dataset consisted of the employment records of all tenured or tenure-track faculty at all 392 doctoral-degree-granting universities in the United States for each year between 2011 and 2020, as well as records of those faculty members’ most advanced degree. We cleaned, annotated and preprocessed that unprocessed dataset to ensure consistency and robustness of our measurements, resulting in the data used in our analyses.

Cleaning the original dataset involved nine steps, which were performed sequentially. After cleaning, we augmented the processed dataset with two pieces of extra information to enable further analyses of faculty and universities, by annotating the country of each university and the gender of each professor. The nine preparation steps and two annotation steps are described below.

Data preparation steps

The first step in preparing the dataset was to de-duplicate degree-granting universities. These universities are in our data either because they were 'employing' universities covered by the AARC sample frame (all tenure-track faculty of US PhD-granting universities) or because they were 'producing' universities at which one or more faculty members in the AARC sample frame obtained their terminal degree (university, degree, year). Producing universities include those based outside the United States and those that do not grant PhDs. Thus, due to the AARC sample frame, all employing universities are US-based and PhD granting, and this set of 392 universities did not require preprocessing. On the other hand, producing universities—those where one or more employed faculty earned a degree—may or may not be PhD granting and may or may not be located in the United States.

Producing universities were cleaned by hand: instances in which single universities were represented in multiple ways ('University of Oxford' and 'Keble College', for example) were de-duplicated and, in the rare instances in which a degree referenced an unidentifiable university ('Medical University, England', for example), the degrees associated with that 'university' were removed but the individuals holding those degrees were not removed.

The second step in preparing the dataset was to clean faculty members' degrees. Terminal degrees are recorded for 98.2% of faculty in the unprocessed data: 5.7% of these degrees are not doctorates (5.3% are Master’s degrees and 0.4% are Bachelor’s degrees). We treated all doctoral degrees as equivalent—for example, we drew no distinction between a PhD and a D.Phil. We note that faculty without doctorates are distributed unevenly throughout academia, with members in the Humanities and Applied Sciences being least likely to have a doctoral degree (Extended Data Fig. 1).

Faculty without doctorates were included in analyses of gender. They were also included in the denominators of self-hiring rate calculations but, possessing no doctorates, they were never considered as potentially self-hires, themselves. Faculty without a doctorate were not included in analyses of production and prestige, which were restricted to faculty with doctorates.

The third step in preparing the dataset was to identify and de-duplicate departments. We ensured that no department was represented multiple different ways, by collapsing records due to (1) multiple representations of the same name (for example, 'Computer Science Department' versus 'Department of Computer Science') and (2) departmental renaming (for example, 'USC School of Engineering' versus 'USC Viterbi School of Engineering'). Although rare instances of the dissolution or creation of departments were observed, we restricted analyses that did not consider time to those departments for which data were available for a majority of years between 2011 and 2020, and restricted longitudinal analyses to only those departments for which data were available for all years.

The fourth step in preparing the dataset was to annotate each department according to a two-level taxonomy based on the field (fine scale) and domain (coarse scale) of its focus. This taxonomy allowed us to analyse faculty hiring at both levels, and to compare patterns between levels. Extended Data Table 1 contains a complete list of fields and domains.

Most departments received just one annotation, but some received multiple annotations due to their interdisciplinarity. This choice was intentional, because the composition of faculty in a 'Department of Physics and Astronomy' is relevant to questions focused on the composition of both ('Physics, Natural Sciences') and ('Astronomy, Natural Sciences'). On the basis of this premise, we include both (or all) appropriate annotations for departments. For instance, the above hypothetical department and its faculty would be included in both Physics and Astronomy analyses. The basic unit of data in our analyses is therefore the individual–discipline pair. A focus on the individual would be preferable, but would require taxonomy annotations of individuals rather than departments—information we do not have. Furthermore, many individuals are likely to consider themselves to be members of multiple disciplines.

Whenever a university had multiple departments within the same field, those departments were considered as one unit. To illustrate how this was done, consider the seven departments of Carnegie Mellon’s School of Computer Science. All seven departments were annotated as Computer Science and treated together in analyses of Computer Science.

Some fields have the potential to conceptually belong to multiple domains. For example, Computer Engineering could be reasonably included in the domain of either Formal Sciences (which includes Computer Science) or Engineering (which includes Electrical Engineering). Similarly, Educational Psychology could be reasonably included in the domain of Education or of Social Sciences. In these instances, we associated each such field with the domain that maximized the fraction of faculty whose doctoral university had a department in that domain. In other words, we matched fields with domains using the heuristic that fields are best associated with the domains in which their faculty are most likely to have been trained.

The fifth step in preparing the dataset was to remove inconsistent employment records. Rarely, faculty in the dataset seem to be employed at multiple universities in the same year. These cases represent situations in which a professor made a mid-career move and the university from which they moved failed to remove that professor from their public-facing records. We removed such spurious and residual records for only the conflicting years, and left the records of employment preceding such mid-career moves unaltered. This removed only 0.23% of employment records.

The sixth step in preparing the dataset was to impute missing employment records. Rarely, faculty disappear from the dataset only to later reappear in the department they left. We considered these to be spurious 'departures', and imputed employment records for the missing years using the rank held by the faculty before becoming absent from the data. Employment records were not imputed if they were associated with a department that did not have any employment records in the given year. Imputations affected 1.3% of employment records and 4.7% of faculty.

The seventh step in preparing the dataset was to exclude non-primary appointments such as professors’ associations or courtesy/emeritus appointments with multiple departments. We identified primary appointments by making the following two assumptions. First, if a professor was observed to have just one appointment in a particular year, then that was their primary appointment for that year—as well as for any other year in which they held that appointment (including years with multiple observed appointments). This corresponds to a heuristic that faculty should appear on the roster of their primary unit before appearing on non-primary rosters. Second, if a professor was observed to have appointments in multiple units, and a promotion (for example, from Assistant Professor to Associate Professor) was observed in one unit’s roster but not in another’s, it was assumed that the non-updating unit is not a primary appointment. This corresponds to a heuristic that, if units vary in when they report promotions, it is more likely that the primary unit is updated first and thus units that update more slowly are non-primary.

Primary appointments could not be identified for 1.2% of faculty, and 5.5% of appointments were classified as non-primary. Field- and domain-level analyses were restricted to primary appointments, but analyses of academia included faculty regardless of whether their primary appointment(s) could be identified, under the assumption that employment in a tenure-track position implies having some primary appointment, identifiable or not.

The eighth step in preparing the dataset was to carefully handle employment records with mid-career moves so that each faculty member was associated with only a single employing university. Mid-career moves do not alter a professor’s doctoral university or gender, and so cannot affect measurements such as a discipline’s faculty production Gini coefficient, its gender composition or the fraction of faculty within the discipline that holds a degree from outside the United States. However, mid-career moves have the potential to alter a discipline’s self-hire rate and the steepness of its prestige hierarchy. This raises important questions for how one should treat mid-career moves when performing calculations that average over our decade of observations—should one analyse the appointment before or the appointment after the move(s)?

First we chose to use, whenever possible, the most recent employing university of each professor. In other words, if a professor was employed at multiple universities between 2011 and 2020, only that university where they were most recently employed was considered. Second, we checked that this choice did not meaningfully affect our analyses of self-hiring or prestige, because 6.9% of faculty made a mid-career move within our sample frame. To evaluate the impact of this choice on self-hiring analyses, we first calculated self-hiring rates on the basis of faculty members’ first employing university (that is, their pre-mid-career-move university if they had a mid-career move). We then calculated self-hiring rates on the basis of faculty members’ last employing university (that is, their post-mid-career-move university if they had a mid-career move). Comparing these two estimates we found that, across all 107 fields, eight domains and academia, mid-career moves had no significant effect on our measurements of self-hiring rates (two-sided z-test for proportions, α = 0.05, n = 295,089 faculty in both samples). To evaluate the impact of this choice on prestige hierarchies, we first calculated the upward mobility in rank-sorted faculty hiring networks on the basis of faculty members’ first employing university (that is, their pre-mid-career move university if they had a mid-career move). We then followed the same procedure but on the basis of faculty members’ last employing university (that is, their post-mid-career move university if they had a mid-career move). Comparing these two approaches, we found that mid-career moves did not significantly alter upward mobility in any field or domain (two-sample, two-sided z-test for proportions, Benjamini–Hochberg-corrected α = 0.05; see Extended Data Table 1 for n). At the academia level, taking the most recent university rather than the first university among mid-career moves resulted in 0.7% more upwardly mobile doctorate-to-faculty transitions (two-sample, two-sided z-test for proportions, Benjamini–Hochberg-corrected P < 0.05, n = 238,281 in both samples).

The ninth and final step in preparing the dataset was to exclude departments that were inconsistently sampled. Not all departments in the unprocessed dataset were recorded by the AARC in all years, for reasons outside the control of the research team. To ensure robustness of results, we restricted our analyses that did not consider time to those departments that appeared in a majority of years between 2011 and 2020. This resulted in the removal of 1.8% of employment records, 3.4% of faculty and 9.1% of departments. Additionally, 24 employing universities (6.1%) were excluded by this criterion, most of which were seminaries.

Annotations

The country of each producing university was determined by hand. First, Amazon Mechanical Turk was used to gather initial annotations. Each university was annotated by two different annotators. Inter-annotator agreement was >99% and disagreements were readily resolved by hand. To ensure no errors, a second pass was completed by the researchers and resulted in no alterations.

Self-identified gender annotations were provided for 6% of faculty in the unprocessed dataset. To annotate the remaining faculty with gender estimates, we used a two-step process based on first and last names. First, complete names were passed to two offline dictionaries: a hand-annotated list of faculty employed at Business, Computer Science and History departments (corresponding to the data used in ref. 27) and the open-source python package gender-guesser58. Both dictionaries responded with one of the following classifications: female, male or unable to classify. Second, for cases in which the dictionaries either disagreed or agreed but were unable to assign a gender to the name, we queried Ethnea59 and used the gender to which they assigned the name (if any). Using this approach we were able to annotate 85% of faculty with man or woman labels. Faculty whose names could not be associated with a gender were excluded from analyses of gender but included in other analyses. This methodology associates names with binary (man/woman) labels because of technical limitations inherent in name-based gendering methodologies, but we recognize that gender is non-binary. The use of these binary gender labels is not intended to reinforce a gender binary.

Per-analysis inclusion criteria

The prepared and annotated dataset contained 295,089 individuals employed at 368 universities, and was used as the basis of all of our analyses. In some analyses, further inclusion criteria were applied but with the guiding principle that analyses should be as inclusive as possible and reasonable. For example, analyses of the professoriate by gender considered only faculty with a gender annotation but did not require members to hold a doctorate. Analyses of prestige, on the other hand, considered only those faculty with doctorates from US universities but did not require that faculty have a gender annotation. The aim of these inclusion criteria was to ensure the robustness of results while simultaneously being maximally inclusive. When an analysis fell into more than one of the above categories, inclusion criteria for all categories were applied. For example, when analysing changes in US faculty production over time, inclusion criteria for analyses of both US faculty production and over time were applied.

Some fields and domains were excluded from field- or domain-level analyses, either because they were too small or because they were insufficiently self-contained. Faculty in excluded fields were nevertheless included in domain- and academia-level analyses, and those in excluded domains were nevertheless included in academia-level analyses (Extended Data Table 2).

Two domains were excluded from domain-level analysis: (1) Public Administration and Policy and (2) Journalism, Media and Communications. These domains were excluded because they employed far fewer faculty than other domains, and because their inclusion made domain-level comparisons difficult.

Fields were included in field-level analyses only if (1) at least 25% of universities had a department in that field or (2) the number of faculty with a primary appointment in that field, and who also earned their doctorate from a university that had a department in that field, was ≥500. These requirements were intended to ensure the coherence of fields for analyses of production and prestige. For information on the number of faculty excluded from field- and domain-level analyses, see Extended Data Table 2.

Analyses of production and prestige included only faculty who hold a US doctorate. Faculty without a doctorate are a small minority of the population in most fields, and were excluded because their degrees are not directly comparable to doctorates. Faculty with non-US doctorates were excluded because the universities that produced them are outside the sample frame.

For all longitudinal analyses, we required departments to be sampled in all years between 2011 and 2020 to ensure consistency in the sample frame. This resulted in the removal of 5.9% of employment records, 7.2% of faculty and 12.6% of departments for those analyses. Additionally, 15 employing universities (4.1%) were excluded by this criterion.

Identification of new hires

Some analyses required us to divide faculty into two complementary sets: new hires and existing faculty. For analyses that aggregated faculty over our decade of observation, we labelled faculty as new hires if they earned their degree within 4 years of their first recorded employment as faculty. Thus defined there are 59,007 new faculty, making up 20.0% of the faculty in the dataset. Our longitudinal analyses were more strict, such that faculty were labelled as new only in their first observed year of employment, but were considered as existing faculty for each observed year thereafter.

Identification of attrition and calculation of attrition risk

A professor who leaves academia for any reason constitutes an attrition, including retirement, termination of employment for any reason, acceptance of a position outside our sample frame (for example, in industry, government or a university outside the United States) or death. Our unprocessed data do not allow us to identify reasons for attrition. A professor’s last year of employment is considered the year of their attrition when counting attritions over time. Faculty who change disciplines are not considered to be attritions from disciplines they leave. Because attritions in a given year are identified through comparison with employment records in the next, attrition analyses do not include the final year of the sample frame (2020). Faculty were counted as an attrition at most once; a professor who appeared to leave multiple times was considered an attrition only on exiting for the last time.

Attrition risk is defined, for a given set of faculty in a given year, as the probability that each professor in that set failed to appear in the set in the next year—that is, the proportion of observed leaving events among possible leaving events on an annual basis. Thus, all attrition risks as stated in this study are annual per-capita risks of attrition. Average annual attrition risks were formed by counting all attrition events and dividing by the total person-years at risk.

Faculty hiring networks

Faculty hiring networks represent the directed flows of faculty from their doctoral universities to their employing universities. As such, each node in such a network represents a university and each weighted, directed edge represents the number of professors trained at one university and who are employed at the other. For the purposes of the faculty hiring networks analysed here, we restrict the set of nodes to, at most, those employing universities within the AARC sample frame. This means that nodes representing non-US universities are not included, and therefore the edges that would link them to in-sample universities are also not included. Without loss of generality, we now describe in more precise detail the creation of a particular field’s faculty hiring network, but this process applies equivalently for both domains and academia as a whole.

First, universities were included in a field only if they had a unit (for example, a department, or departments) associated with that field. As a result, a university appears in the rankings for a field only if it has a representative unit; without a Department of Botany, a university cannot be ranked in Botany. Second, ranks are identifiable from patterns in faculty hiring only if every unit employs at least one individual in that field who was trained at a unit that also employs faculty in that field. Phrased from the perspective of the faculty hiring network, this requirement amounts to ensuring that the in-degree of every node is at least one. Because the removal of one unit (based on the above requirements) might cause another to fail to meet the requirements, we applied this rule repeatedly until it was satisfied by all units.

The outcome of this network construction process is a weighted, directed multi-graph A(k) such that: (1) the set of nodes i = 1,2,... represent universities with a department or unit in field k. (2) The set of edges represent hiring relationships, such that \({A}_{ij}^{(k)}\) is an integer count of the number of faculty in field k who graduated from i and are employed at j. Thus A(k) is a positive, integer-weighted, non-symmetric, network adjacency matrix for field k. (3) The out-degree \({d}_{i}^{(k)}={\sum }_{j}{A}_{ij}^{(k)}\) is greater than or equal to one for every node i, meaning that every university has placed at least one graduate in field k. (4) The in-degree \({d}_{j}^{(k)}={\sum }_{i}{A}_{ij}^{(k)}\) is greater than or equal to one for every node j, meaning that every university has hired at least one graduate from field k.

To infer ranks in faculty hiring networks meeting the criteria above, we used the SpringRank algorithm48 without regularization, producing a scalar embedding of each network’s nodes. Node that embeddings were converted to ordinal rank percentiles. (In principle, embeddings may produce ties requiring a rule for tie-breaking when converting to ordinal ranks. However, no ties in SpringRanks were observed in practice).

To determine whether properties of an empirically observed hierarchy in a faculty hiring network could be ascribed to its in-degree sequence (unit sizes) and out-degree sequence (faculty production counts) alone, we generated an ensemble of n = 1,000 networks with identical in- and out-degrees that were otherwise entirely random, using a degree-preserving null model called the configuration model46,60. We excluded self-hires (that is, self-loops) from randomization in the configuration model for a subtle but methodologically important reason. We observed that self-hires occur at much higher rates in empirical networks than expected under a configuration model. As a result, were we to treat self-hires as links to be randomized, the process of randomization would, itself, increase the number of inter-university hires from which ranks were inferred. Because of the fact that SpringRank (or an alternative algorithm) infers ranks from inter-university hires, but not self-hires, the act of 'randomizing away' self-hires would thus distort ranks, as well as the number of potential edges aligned with (or aligned against) any inferred hierarchy. In short, randomization of self-hires would, in and of itself, distort the null distribution against which we hope to compare, dashing any hope of valid inferences to be drawn from the exercise. We note, with care, that when computing the fraction of hires violating the direction of the hierarchy, either empirically or in the null model, we nevertheless included self-hires in the total number of hires—that is, the denominator of said fraction. These methodological choices follow the considerations of the configuration model 'graph spaces' introduced by Fosdick et al.46.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.