Effects of homophily and academic reputation in the nomination and selection of Nobel laureates

Gallotti, Riccardo; De Domenico, Manlio

doi:10.1038/s41598-019-53657-6

Download PDF

Article
Open access
Published: 21 November 2019

Effects of homophily and academic reputation in the nomination and selection of Nobel laureates

Scientific Reports volume 9, Article number: 17304 (2019) Cite this article

4423 Accesses
15 Citations
47 Altmetric
Metrics details

Subjects

Abstract

In collective decision-making, a group of independent experts propose individual choices to reach a common decision. This is the case of competitive events such as Olympics, international Prizes or grant evaluation, where groups of experts evaluate individual performances to assign resources, e.g. scores, recognitions, or funding. However, there are systems where evaluating individual’s performance is difficult: in those cases, other factors play a relevant role, leading to unexpected emergent phenomena from micro-scale interactions. The Nobel assignment procedure, rooted on recommendations, is one of these systems. Here we unveil its network, reconstructed from official data and metadata about nominators, nominees and awardees between 1901 and 1965, consisting of almost 12,000 individuals and 17,000 nominations. We quantify the role of homophily, academic reputation of nominators and their prestige neighborhood, showing that nominees endorsed by central actors – who are part of the system’s core because of their prestigious reputation – are more likely to become laureate within a finite time scale than nominees endorsed by nominators in the periphery of the network. We propose a mechanistic model which reproduces all the salient observations and allows to design possible countermeasures to mitigate observed effects.

Quantifying hierarchy and prestige in US ballet academies as social predictors of career success

Article Open access 30 October 2023

Scientific success from the perspective of the strength of weak ties

Article Open access 24 March 2022

The Nobel Prize time gap

Article Open access 11 November 2022

Introduction

The will of the Swedish inventor Alfred Nobel established that an annual prize in his name must be awarded “to those who, during the preceding year, shall have conferred the greatest benefit to mankind” in Chemistry, Literature, Peace, Physics, and Physiology or Medicine¹. The overall process consists of three phases (Fig. 1a): (i) nominators are invited by each category’s committee, with the exception of the Peace Prize that is open to a selected list of qualified institutional members, and include all former Nobel laureates² (see also Supplementary Fig. 1); (ii) nominees for the award are proposed by competent nominators; (iii) the laureates are selected among the nominees, by Scandinavian-based committees under Nobel’s mandate that “in awarding the prizes no consideration whatever shall be given to the nationality of the candidates”. Since when the first Nobel Prizes were awarded in 1901, they quickly become globally acknowledged as the most important accolade in their fields of knowledge. In fact, winning a Nobel Prize is synonym of scientific success^3,4,5 and of a globally recognized reputation⁶, that is used by academics and their institutions to quantify their prestige⁷, enhance their rank (e.g., the Shanghai Ranking⁸ scores the presence of “alumni and staff who have won Nobel Prizes and Fields Medals”) and, consequently, boosting their economic growth in a global knowledge economy⁹. At the same time, Nobel laureates also represent role models for the future generations and thus an opportunity for facilitating the vocation of sexual, gender, or color minorities in science¹⁰. Rapidly, the time between the publication of the awarded work and the conferment of the prize increased for all disciplines way over the single year suggested by Nobel’s will, as the awards increasingly recognised achievements that had withstood the test of time^1,11,12. Nonetheless, the Prize has also been criticized for its winner-takes-all philosophy which has been also seen as main source of most of controversies associated with this institution¹³.

Unfortunately, quantifying academic performance is a long-standing problem, resulting in decades of research devoted to develop a wide spectrum of descriptors^14,15,16. The lack of consensus on performance metrics makes it difficult to identify and rank academics, requiring alternative procedures such as recommendation networks. In fact, the academic endeavour is strongly based on social relationships of different nature¹⁷ and several social dynamics concur during all evaluation and assignment procedures. On the one hand, homophily is expected to be a fundamental mechanism: academics tend to better know other academics from the same institution, from the same country, or who work on the same topics. Indirectly, the numerosity of specific sub-groups (by nationality, gender, etc) coupled with homophily, naturally generates the hegemony of those sub-groups. On the other hand, competition for prestige and reputation is a natural mechanism¹⁸: academics seldom recommend direct competitors in specific fields.

Moreover, there is evidence that strong collaborations have a significant positive impact on productivity and citations¹⁹ – the apostle effect – and that author’s reputation significantly drives a paper’s citation count early in its citation life cycle⁶. Experience plays a key role in the academic community. In fact, it has been recently shown that the chaperone effect characterizes publishing in high-impact venues²⁰, and that genealogical and coauthorship networks are good predictors of who wins multiple prizes, driving a system where the boundaries of science are pushed by small group of scientific elites²¹.

Given the scientific, economical and political impact of winning a Nobel Prize, it is natural to ask to which extent such mechanisms – namely homophily and reputation – are influenced by, and influence, the Nobel assignment process. Other factors, such as the sociological effects of winning a Nobel Prize²² and the patterns of productivity, collaboration, discovery, and authorship of nobel laureates have been the subject of intense research activity across half a century^{5,12,23,24,25} leading the emerging field of “science of science²⁶”, while little attention has been dedicated to the Nobel nomination and selection mechanisms^27,28. Here, we examine the impact of these mechanisms using the tools of network science and advanced statistics to provide compelling evidence for the emergence of four types of hegemony – political, gender, nationalistic, and prestige – influencing the three different phases of the assignment process (Fig. 1a). Similarly to other recent works on science of science²⁶, our intent here is to build upon theoretical concepts and social processes drawn from the sociology of science^29,30, isolating from large-scale data sources the power relationships present in the scientific community. This perspective is particularly relevant considering that the social structure and culture present in the scientific community influences the output of scientific knowledge produced, and that these social mechanisms can be controlled by closed circles or external forces³¹ with the undesirable effects of influencing the validity of the scientific results and overall retarding the quest for scientific knowledge by limiting the development and diffusion of new methodological or epistemological models.

To this aim, we have gathered data and metadata from the official Web page² about nominators and nominees involved in the Nobel assignment procedure between 1901 and 1965, as well as about the Nobel laureates between 1901 and 2016. Both datasets have been cross-checked for inconsistencies and manually corrected where needed according to other manually curated sources, such as Wikipedia. In the data gathered, gender is indicated as a binary field – Female (F) or Male (M) – while nationality might change across time. For sake of simplicity, every person or organisation has been associated with only a single country with a majority rule.

To model the intricate web of nomination relationships, we build two networks^32,33. One network consists of individuals, nominators and nominees, who are linked together by a nomination. For instance, Erwin Schrödinger (the nominator) nominated Erich Regener, Wolfgang Pauli and Enrico Fermi (the nominees) in 1938: in our model, three outgoing links are assigned to Schrödinger, each one pointing towards a different nominee. The second network consists of countries: a directed link is assigned to the countries to which nominator and nominees belong, with connections being weighted by the volume of nominations. For instance, a weight of one is assigned to the link from Germany (represented by Schrödinger) to Italy (represented by Fermi), whereas weight 2 is assigned to the link from United States of America to Italy, because of the nominations from Arthur H. Compton and Clinton J. Davisson to Fermi in the same year. Figure 1b,c illustrates additional examples.

To understand how homophily and reputation might affect the assignment process, we devise a model which describes (i) the progressive growth of the nomination-nominee network and (ii) the periodic assignment of an award. Our model successfully reproduces some empirical findings, such as the highly modular structures observed in the data or the central role played by prestigious scholars. Analyzing different scenarios, we illustrate how the nomination-selection process is potentially very efficient in selecting high quality laureates, but at the same time tends to perpetuate the privileges of hegemonic groups.

Results

Political homophily

The rationale behind the invitation process is to broaden the representation of different countries and universities, while keeping the nominators pool restricted to qualified persons only². The nominators’ selection is however influenced by international political relationships and prestige, as highlighted by the limited number of Russian nominators (114 in total, less than 10% of Americans, Germans, or French ones – see also Supplementary Fig. 2). To label this type of effect we use in this paper the term political homophily, which has to be intended here in the strict sense of a homophilic effect between countries sharing similar views about world politics, or economical and societal issues. Another example of political homophily is identified in correspondence of the political tensions surrounding World War II. The war indeed appear to have shocked the equilibrium of the international scientific community: if before the war the international prestige was mostly accumulated by german scholars, after the war the scientific world rewired itself into a more american-centric network (see Fig. 2 and Supplementary Fig. 3). This shock can be observed in the period between 1936^a^{Footnote 1} and 1948 for Germany, but also during the war for the German-controlled France. In these periods, nominators of these two countries have been largely excluded from the process (Figs. 2 and 3). This created a change in the nomination network before and after World War II, as the larger nominators pool taken by Germany for almost 40 years got quickly obscured by USA, which increased their weight during the war and then dominated the successive period. This naturally reflected on the nominees and laureates (Fig. 2b,c). In Fig. 3 and Supplementary Fig. 6 we disaggregate this view on the different Nobel categories, observing how in time, USA emerges with an increasing growth in both nominators and nominees pools, with the only exception of the Nobel Prize assigned to Literature. Remarkably, the overall number of American Nobel laureates grows differently from the one of all other countries (Fig. 2c). The different scaling behaviour (Supplementary Fig. 7) suggests a type of Matthew effect – also observed in other scientific contexts^34,35,36,37 – which favors cumulative advantage of candidates with high prestige while reducing the visibility and the opportunities of less known nominees.

Gender homophily

The Nobel Prizes assigned to women are few and far between¹⁰. Even after accounting for the underlying under-representation of women in the scientific disciplines, the assignment of Nobel Prizes is significantly favouring men³⁸. In the period we consider (1901–1965) we restricted the analysis to the 15668 nominations where both the gender of the nominators and of the nominees was correctly identified. In this sample, women constitute 5.0% of nominees and 3.7% of laureates but only 1.8% of nominators, highlighting natural limitations for women to enter in the nominators’ pool. To investigate the role played by gender homophily in the nomination process, we count the fraction of links between nodes of the same (F → F and M → M) or different genders (F → M and M → F) and compare them with a null model where the gender is randomly shuffled among the nodes. The results, displayed in Table 1, show without any doubt^b^{Footnote 2} significant effects of gender homophily the nomination process (Z-Score ≈ 13). The effect is symmetrical, as both genders equally favour intra-gender links with respect to the null-model.

Table 1 Intra- and inter- gender links in the nominator-nominee network.

Full size table

Nationalistic homophily

Through their nominations, recognized experts propose and support the candidate who, in their opinion, deserves the most the Nobel Prize. However, candidatures are more likely towards fellow academics from the same country. To quantify this effect during the nomination process, we use the network of nominations at country level, aggregated across time. Our analysis reveals a large fraction of nominations among individuals from the same country: the level of clustering into communities is quantified by network modularity³⁹ – calculated with respect to a country-based partition – for which a value of 0.38 is measured. Considering each Nobel category separately, the highest modularity (0.44) is observed for Physiology and Medicine and for Literature, while lowest values characterize Chemistry (0.34), Peace (0.32), and Physics (0.28). These high values indicate that the fraction of nominations within the same country exceeds what would be expected by chance, highlighting the existence of a nationalistic homophily, which appears to depend on the historical moment (see Fig. 4a, and Supplementary Figs. 3 and 4 for a comparison before and after World War II). This type of homophily reflects in the country distribution of the nominees, which is therefore strongly related to the committee choices of nominators. To verify this claim, we measure the evolution of the nominator and nominees pool countries with the Kullback-Liebler divergence (see Methods) between the distribution of countries in two consecutive years. The results shown in Fig. 4b confirm our expectation as the yearly evolution of the nominators and nominees pools is significantly correlated (Spearman r = 0.47). Consequently, as the nominators pool gets progressively concentrated in a few countries, the nationalistic homophily propagates this concentration to the nominees. Indeed, measuring the statistical dispersion of both distributions across time by means of the Gini coefficient (see Methods), a widely adopted index of diversity⁴⁰, we observe how the candidatures increasingly concentrate in fewer countries (Fig. 4c) regardless of the Nobel category (Supplementary Fig. 8). The trend is the same for both nominators and nominees, and the two dynamics exhibit a highly significant correlation (Spearman r = 0.68). These results show that the pool of nominees strongly depends on the nominator pool, a fact that contributes to dramatically alter the probability that a nominee will become a Nobel laureate.

Academic reputation

The committees, supported by specially appointed experts, choose the laureates among the nominees, in a process influenced by the committee members expertise and preferences²⁷. The number of nominations, on average five for awardees and two for non-awardees, is likely to play a role in the process. However, we have isolated an important effect due to prestige: the Nobel committee attributes greater accuracy to the opinion of former Nobel laureates, and it is particularly important if the initial candidature is endorsed by former Nobel laureates.

Such candidatures are indeed dramatically advantaged with respect to those ones not initially endorsed by Nobel laureates (Fig. 5a,b and Supplementary Figs. 9 and 10). To further test the hypothesis that the observed effect is genuine, we studied the nomination network (Fig. 5c) at individual level, to gain insight from the microscopic analysis of the Nobel assignment system. We find that Nobel laureates in Physics, Chemistry and Medicine are part of a scientific elite (Fig. 5d), constituting the system’s core and counting 363 individuals in the largest connected component of the nomination network. To quantify the chance of this observation, we have reshuffled the Nobel Prize assignments 50,000 times and counted, each time, the number of Nobel laureates in the largest cluster^32,33. The random expectation, compatible with the null hypothesis that the endorsement of former Nobel laureates is not a discriminating factor, is 314.5 ± 6.8: remarkably, the empirical value is more than 7 standard deviations from the mean (p–value ≈ 10⁻¹²), confirming the significant presence of a core (Fig. 5e). The authority of Nobel laureates thus induces a sort of social influence that is reflected in the importance given by the Nobel committee to their nominations, thus affecting the collective judgement⁴¹.

Similarly, another indicator of academic prestige we identified is having being repeatedly selected as a nominator. In Supplementary Fig. 10 we show how nominators who casted more candidatures are more likely to nominate winning candidates.

A mechanistic model of the Nobel ecosystem

To better understand our empirical findings, we develop a model describing the Nobel ecosystem, including the growth of its nomination network. In this network, nodes represent experts that can be at the same time both nominators and nominees. Links between nodes are directed and indicate a nomination. Each node i is characterized by (i) a random score s_i, distributed uniformly between 0 and 1, which embodies the individual expertise and merit; (ii) the node age a_i measured in time-steps; and iii) a vector of features ${\overrightarrow{F}}_{i}$ ⁴² encoding information such as nationality, gender, etc. The network is first initialized with N₀ nodes and no links: these initial nodes represent a starting core of nominators, that are never considered as potential nominees. Then, at every time step, a new node is injected, and a set of L potential nominators is selected on the base of their score and, eventually, their age with probability ${p}_{i}^{out}=\frac{{a}_{i}^{\alpha }{s}_{i}}{{\sum }_{i}\,{a}_{i}^{\alpha }{s}_{i}}$. Here, the score is multiplied by an aging factor ${a}_{i}^{\alpha }$. If α > 0, it is representing the social advantage cumulated along the career⁴³, while if α < 0 young nodes are favoured in the selection. If α = 0, age has no weight in the choice. These L potential nominators might then connect or not with the new node, the choice corresponding to deciding on whether to support or not the candidature of the new node. The evaluation of a node is based, simultaneously, upon reputation (score) and homophilic tendencies (defined in the feature space). Therefore, it is crucial to define the similarity $S({\overrightarrow{F}}_{1},{\overrightarrow{F}}_{2})$ between two nodes’ feature vectors. Here, we define similarity as $S({\overrightarrow{F}}_{1},{\overrightarrow{F}}_{2})=(1+\,\cos \,sim\,({\overrightarrow{F}}_{1},{\overrightarrow{F}}_{2}))/2$ where

$$cossim\,({\overrightarrow{F}}_{1},{\overrightarrow{F}}_{2})=\frac{{\overrightarrow{F}}_{1}\cdot {\overrightarrow{F}}_{2}}{\Vert {\overrightarrow{F}}_{1}\Vert \Vert {\overrightarrow{F}}_{2}\Vert }$$

(1)

is the cosine similarity. S = 1 if the features are parallel, S = 0 if they are orthogonal. This choice differs for example to the more commonly used Axelrod’s model⁴⁴, where the similarity is given by the fraction of shared features, and has the advantage that the cosine similarity induces an ordering among the feature vectors. Similarly to bounded confidence models⁴⁵, interactions (here, the nominations) are possible only if the nodes are not too different. However, here differences can be ignored if the reputation is high enough. The possible new link between i₁ and i₂ is indeed accepted, deterministically, only if

$$B{s}_{{i}_{2}}+(1-B)S({\overrightarrow{F}}_{{i}_{1}},{\overrightarrow{F}}_{{i}_{2}})\ge {H}_{T},$$

(2)

where B is a parameter describing the meritocracy of the choice (if B = 1, the choice is purely based on score), H_T a threshold parameter, This framework allows one to design block-like adjacency matrices describing the complex community structures observed in our data, including the case groups’in-between’ other two (Fig. 6a,b). For instance, in our particular problem – the study of the Nobel Prize nomination process – the added value of this perspective is manifest as it allow us to map the fluid relationships between scientific disciplines²⁵ (as illustrated for example in Fig. 6b). Similarly to what observed for the Axelrod’s model⁴⁶, one possible output here is the creation of segregated non-overlapping communities, consisting of nodes of identical features (See Fig. 6d, Supplementary Fig. 11 and the Supplementary Information).

The fact that nodes’ scores are considered for both the nominators selection and the nomination naturally segregates a high score core from a low score periphery. This property reflects what we observed in the data with the high concentration of Nobel laureates in the core. Another remarkable consequence of the homophilic link creation is that, in certain regimes, the presence of different categories of nodes makes the average score of selected nodes higher than what observed ceteris paribus when all nodes injected belong to the same category (see Fig. 6e). In a nutshell (see Methods for more detail), the process allows for selecting the best nodes among distant categories, even when the meritocracy is relatively small and closer categories are accepted without any regard of node score.

However, together with this relatively positive consequence, the presence of nodes of different categories might naturally yield also some negative effect. One example worth highlighting here is the persistent influence that a hegemonic group may play if the selection process is driven by a strong memory effect, as is the case in the Nobel nomination network. To illustrate this effect, we introduce in the model two further features inspired by the Nobel selection process. First, every T time-steps, a prize is awarded with a probability ${p}_{i}^{award}=\frac{{k}_{i}}{{\sum }_{j\in J}\,{k}_{j}}$ proportional to the node in-degree k_i. This selection is restricted to the set J of nodes that are not yet laureate (multiple awards are not permitted). Second, besides the L nominators selected accordingly to skill and, eventually, age, also the last M laureates are included in the nominators pool, and are similarly allowed to decide whether or not to nominate a new node with the outgoing directed link representing. This ‘design choice’ – of making Nobel laureates systematically become nominators – strengthens the central position of skilled nominators at the core of the network. In the following, we show that it has the drawback of perpetuating for longer time the influence of a hegemonic initial pool of nominators, established as the initial set of N₀ nodes.

To illustrate this, we study the simple scenario of a mono-dimensional feature space with only two possible types of nodes: (+1) and (−1). The two-features scenario allows only for identical S = 1 or orthogonal S = 0 pairs. To simplify the interpretation of the results, let us assume that this scenario describe gender homophilic decisions in the Nobel Prize. First, we can analyze the model without the new feature imposing the last former laureates as nominees, and with non-hegemonic initial conditions. We show in Fig. 7a an example of a network in this first non-hegemonic scenario, with modularity ≈0.4 built with B = 0.2 and H_T = 0.18.

For this example, in Fig. 7b we measure the gender unbalance in four different scenarios as the cumulative fraction of the number of awards assigned among the hegemonic gender until a given time. The baseline scenario is represented by the orange green circles, where there are no aging effects and the mechanics of injecting the laureates as nominators is not activated. This has to be compared with three variated scenarios. The positive aging scenario (blue triangles) introduces an extra age effect to reputation, describing a system where further social advantage is cumulated along the career. In this case the system has a stronger memory and the hegemony is sustained for longer times. The realistic scenario (orange stars) is without aging, but here former laureates automatically become nominators, as is the case for the Nobel prize. Similarly to the positive aging scenario, the system has a stronger memory and the hegemonic initial unbalance is maintained for longer times. The last (red square) is the negative aging scenario. It is similar to the realistic one, but here a negative aging parameter favours youngest nodes as nominators. In this last case, the memory is reduced and the effect due to hegemonic initial condition and high homophily is limited. This example suggests that the current design, where Nobel laureates are automatically included as nominators, creates a memory effect that might perpetuate existing hegemonies. The same result is found for a very broad range of the parameters B and H_T (see Supplementary Fig. 12).

Discussion

Interestingly, starting 2019 the Nobel committees explicitly request the nominators of considering diversity in geography, gender, and topic^10,47. Further measures have also been requested to improve gender balance, including changes in the nomination committee and nomination rules⁴⁷. Here, we have shown that these requests are definitively justified. Nominations are also surely gender biased, with both females and males preferring candidates of the same gender. The winning odds would be however fair if the choice of nominators would be gender balanced in the first place, but this was definitively not the case in the period 1901–1965, where official data and metadata are publicly available.

There is also evidence that nominations are mostly affected by nationalistic homophily. The nomination network is highly modular with respect to the country of origin, de facto making more difficult to award candidates from less represented countries, further increasing inequality. This effect can be originated by different mechanisms – e.g. nominators’ limited social/scientific neighborhood or real nationalistic preferences – whose determination is, however, beyond the scope of this work. Nevertheless, the existence of this type of homophily – similar to the one discovered in other highly competitive events, such as Olympics⁴⁸, and in social dynamics⁴⁹ – represents a huge obstacle to the fairness of the overall assignment process. The sum of all these effects renders the ultimate decision of who, among the candidate available, will win the Nobel prize highly predictable from the aggregated history of nominations up to that year. An additional evidence to support this argument is given by a machine learning algorithm able to learn these patterns, as we show in Supplementary Fig. 13.

This process is further influenced by political homophily and perceived academic prestige in the committees. Our results indicate that the Nobel assignment procedure (see Fig. 1a) is intrinsically reinforcing the propagation of these homophilic effects sustaining the presence of academic hegemonies over time. In particular, these effects are aggravated by the current prize assignment mechanism allowing new Nobel laureates to become nominators in the subsequent years, an undesired effect that can be reduced by selecting new young experts as nominators, as suggested by our model.

More in general, having pointed out a number of social mechanisms that influence the Nobel selection process, a natural question that arises is about the relative strength of such mechanisms and where eventually one may intervene to reduce the biases emerging from these. In this sense, we are inclined to conjecture that the single most efficient intervention would be to have the Nobel committees unbiased in terms of the homophilic tendencies highlighted in this paper. Homophily would be less an issue if the committee would not display hegemonic prevalences in terms of nationality and gender.

Materials and Methods

Kullback-Leibler divergence

The Kullback-Leibler divergence is a measure of “surprise”, quantifying how much a distribution P(x) can be well described by another distribution Q(x), where x is some observable of interest. Formally, it is defined by

$${D}_{{\rm{KL}}}(P||Q)={\int }_{-\infty }^{\infty }P(x)\,{\log }_{2}\frac{P(x)}{Q(x)}dx$$

quantifying information loss in describing P(x) by means of Q(x). A divergence close to zero indicates that the two distribution are very similar, if not identical. Conversely, larger the difference between the two distributions, larger the expected value of their divergence. In this work, we consider the distribution of the countries of nominators and nominees, separately, and we calculate their Kullback-Leibler divergence between successive years to quantify the underlying similarity across time.

Gini coefficient

The Gini coefficient is a measure of statistical dispersion, originally introduced to quantify income and wealth inequality. Formally, it is derived from the Lorenz curve L_P(y) of the probability distribution P(x), which describes the relative weight of the bottom y% items of the sample from P(x), as

$$G(P)=2{\int }_{0}^{100 \% }(y-L(y))dy$$

and thus represents the relative dimension of the inequality gap between the line of perfect equality and the Lorenz curve observed for the distribution at hand. The coefficient ranges from 0 to 1. A Gini coefficient of 0 represents perfect equality, while maximal inequality among the recorded values corresponds to a value of 1. Intermediate values, such as 0.5, characterize, for instance, a relatively high income inequality for a country. In this work, we measure the Gini coefficient of the distribution of the countries of nominators and nominees for a given year.

Computing the winning probabilities

To isolate the academic reputation, we have studied in Fig. 5 and Supplementary Fig. 9 the sequences of candidature years for different nominees. In this analysis, years are not necessarily consecutive. Moreover, in Fig. 5a the sequences representing the years of candidatures of non-laureated nominees that were shorter than 12 items have been extended to that length, as being excluded from the nomination process implies the impossibility of being awarded.

Selection with multiple groups

We observed in Fig. 6e that the model proposed is better at selecting high score nodes if the system is equally constituted by multiple categories. This apparently counterintuitive effect can be easily understood by noticing that, by definition, the similarity within the same category is S₁ = 1, while an eventual second closest category has a similarity S₂ = 1 − ΔS. For sake of simplicity, let us consider the case B < ΔS/(1 + ΔS) where nodes of the second category can be selected only for values of H_T ≤ (1 − B). This last condition corresponds to requiring that all nodes of the first category are automatically accepted, and consequently that the average score for the nodes of the same category of 〈s₁〉 = 0.5. Since s is distributed uniformly between 0 and 1, the values of H_T act as a cursor selecting a fraction f₂ ∈ [0, 1] of nodes that pass a threshold H_T = (1 − f₂)B + (1 − B)(1 − ΔS). These nodes are those with the highest scores among the second category will average 〈s₂〉 = 1 − f₂/2 > 〈s₁〉. In total, the average score for any accepted node with H_T activating links in the first and second categories is given by the weighted average 〈s〉 = (n₁〈s₁〉 + n₂f₂〈s₂〉)/(n₁ + n₂f₂) > 〈s₁〉, where n₁ is the fraction of nodes with similarity S = 1 with a randomly chosen node, and n₂ the fraction of nodes with similarity S₂ with a random node (see Fig. 6d, where the dashed line indicates the analytical solution found with the principles described here above). This last inequality states that the average score 〈s〉 exceeds the averages score 〈s₁〉 one will have if all nodes belong to the same category.

Data availability

Data are available from the authors upon request.

Notes

Year following the promulgation of the antisemitic and racist laws in Nazi Germany.
We verified that the distribution produced by the null model is homogeneous (see Supplementary Fig. 5), and statistically compatible with a p–value of 0. For sake of completeness, the most conservative statistical test – given by the Chebyshev inequality – suggests that this observation is likely with probability 5.7 · 10⁻³.

References

Statutes of the nobel foundation, https://www.nobelprize.org/nobel_organizations/nobelfoundation/statutes.html Accessed: October 2017 (2017).
Nomination and selection of nobel laureates, https://www.nobelprize.org/nomination/ Accessed: October 2016 (2016).
Wang, D., Song, C. & Barabási, A.-L. Quantifying long-term scientific impact. Science 342, 127–132 (2013).
Article ADS CAS Google Scholar
Moreira, J. A., Zeng, X. H. T. & Amaral, L. A. N. The distribution of the asymptotic number of citations to sets of publications by a researcher or from an academic department are consistent with a discrete lognormal model. PloS one 10, e0143108 (2015).
Article Google Scholar
Sinatra, R., Wang, D., Deville, P., Song, C. & Barabási, A.-L. Quantifying the evolution of individual scientific impact. Science 354, aaf5239 (2016).
Article Google Scholar
Petersen, A. M. et al. Reputation and impact in academic careers. PNAS 111, 15316–15321 (2014).
Article ADS CAS Google Scholar
Clynes, T. Where nobel winners get their start. Nature 538, 152 (2016).
Article ADS CAS Google Scholar
Liu, N. C. & Cheng, Y. The academic ranking of world universities. High. education Eur. 30, 127–136 (2005).
Article Google Scholar
Marginson, S. & Van der Wende, M. To rank or to be ranked: The impact of global rankings in higher education. J. studies international education 11, 306–329 (2007).
Article Google Scholar
Editorial. Noble effort. Nature 562, 164 (2018).
Article Google Scholar
Jones, B. F. & Weinberg, B. A. Age dynamics in scientific creativity. PNAS 108, 18910–18914 (2011).
Article ADS CAS Google Scholar
Fortunato, S. et al. Growing time lag threatens nobels. Nature 508, 186–186 (2014).
Article ADS CAS Google Scholar
Casadevall, A. & Fang, F. C. Is the Nobel Prize good for science? The FASEB J. 27, 4682–4690 (2013).
Article CAS Google Scholar
Hirsch, J. E. An index to quantify an individual’s scientific research output. PNAS 102, 16569–16572 (2005).
Article ADS CAS Google Scholar
Radicchi, F., Fortunato, S. & Castellano, C. Universality of citation distributions: Toward an objective measure of scientific impact. PNAS 105, 17268–17272 (2008).
Article ADS CAS Google Scholar
Dorogovtsev, S. N. & Mendes, J. F. Ranking scientists. Nat. Phys. 11, 882 (2015).
Article CAS Google Scholar
Latour, B. & Woolgar, S. Laboratory life: The construction of scientific facts (Princeton University Press, 2013).
Balietti, S., Goldstone, R. L. & Helbing, D. Peer review and competition in the art exhibition game. PNAS 201603723 (2016).
Petersen, A. M. Quantifying the impact of weak, strong, and super ties in scientific careers. PNAS 112, E4671–E4680 (2015).
Article ADS CAS Google Scholar
Sekara, V. et al. The chaperone effect in scientific publishing. PNAS 115, 12603–12607, https://doi.org/10.1073/pnas.1800471115 (2018).
Article ADS CAS PubMed Google Scholar
Ma, Y. & Uzzi, B. Scientific prize network predicts who pushes the boundaries of science. PNAS 115, 12608–12615, https://doi.org/10.1073/pnas.1800485115 (2018).
Article CAS PubMed Google Scholar
Zuckerman, H. The sociology of the nobel prizes. Sci. Am. 217, 25–33 (1967).
Article ADS CAS Google Scholar
Zuckerman, H. Nobel laureates in science: Patterns of productivity, collaboration, and authorship. Am. Sociol. Rev. 391–403 (1967).
Article CAS Google Scholar
Clauset, A., Larremore, D. B. & Sinatra, R. Data-driven predictions in the science of science. Science 355, 477–480 (2017).
Article ADS CAS Google Scholar
Szell, M., Ma, Y. & Sinatra, R. A Nobel opportunity for interdisciplinarity. Nat. Phys. 14, 1075–1078 (2018).
Article CAS Google Scholar
Fortunato, S. et al. Science of science. Science 359, eaao0185 (2018).
Article Google Scholar
Zuckerman, H. The sociology of science. (Sage Publications, Inc, 1988).
Bourdieu, P. Science of science and reflexivity (Polity, 2004).
Friedman, R. M. Nobel physics prize in perspective. Nature 292, 793–798 (1981).
Article ADS CAS Google Scholar
Friedman, R. M. The politics of excellence: Behind the Nobel Prize in science (Times Books, 2001).
Merton, R. K. The sociology of science: Theoretical and empirical investigations (University of Chicago press, 1973).
Newman, M. E. The structure and function of complex networks. SIAM review 45, 167–256 (2003).
Article ADS MathSciNet Google Scholar
Boccaletti, S., Latora, V., Moreno, Y., Chavez, M. & Hwang, D.-U. Complex networks: Structure and dynamics. Phys. reports 424, 175–308 (2006).
Article ADS MathSciNet Google Scholar
Merton, R. K. et al. The matthew effect in science. Science 159, 56–63 (1968).
Article ADS Google Scholar
Petersen, A. M., Jung, W.-S., Yang, J.-S. & Stanley, H. E. Quantitative and empirical demonstration of the matthew effect in a study of career longevity. PNAS 108, 18–23 (2011).
Article ADS CAS Google Scholar
Perc, M. The matthew effect in empirical data. J. The Royal Soc. Interface 11, 20140378 (2014).
Article Google Scholar
De Domenico, M. & Arenas, A. Researcher incentives: Eu cash goes to the sticky and attractive. Nature 531, 580–580 (2016).
Article Google Scholar
Lunnemann, P., Jensen, M. H. & Jauffred, L. Gender Bias in Nobel Prizes. arxiv.org 1810.07280 (2018).
Newman, M. E. Modularity and community structure in networks. PNAS 103, 8577–8582 (2006).
Article ADS CAS Google Scholar
Wittebolle, L. et al. Initial community evenness favours functionality under selective stress. Nature 458, 623–626 (2009).
Article ADS CAS Google Scholar
Lorenz, J., Rauhut, H., Schweitzer, F. & Helbing, D. How social influence can undermine the wisdom of crowd effect. PNAS 108, 9020–9025 (2011).
Article ADS CAS Google Scholar
Fortunato, S., Latora, V., Pluchino, A. & Rapisarda, A. Vector opinion dynamics in a bounded confidence consensus model. Int. J. Mod. Phys. C 16, 1535–1551 (2005).
Article ADS Google Scholar
Zuckerman, H. Scientific elite: Nobel laureates in the United States (Transaction Publishers, 1977).
Axelrod, R. The dissemination of culture: A model with local convergence and global polarization. J. conflict resolution 41, 203–226 (1997).
Article Google Scholar
Castellano, C., Fortunato, S. & Loreto, V. Statistical physics of social dynamics. Rev. Mod. Phys. 81, 591–646 (2009).
Article ADS Google Scholar
Murase, Y., Jo, H.-H., Török, J., Kertész, J. & Kaski, K. Structural transition in social networks: The role of homophily. arxiv.org 1808.05035 (2018).
Gibney, E. Nobel committees to tackle gender skew. Nature 562, 19–19 (2018).
Article ADS CAS Google Scholar
Zitzewitz, E. Nationalism in winter sports judging and its lessons for organizational decision making. J. Econ. & Manag. Strateg. 15, 67–99 (2006).
Article Google Scholar
Hewstone, M., Rubin, M. & Willis, H. Intergroup bias. Annu. review psychology 53, 575–604 (2002).
Article Google Scholar
Abel, G. J. & Sander, N. Quantifying global international migration flows. Science 343, 1520–1522 (2014).
Article ADS CAS Google Scholar

Download references

Acknowledgements

The authors thank Alex Arenas, Oriol Artime, Pierluigi Sacco, and Roberta Zambelli for fruitful discussions.

Author information

Authors and Affiliations

CoMuNe Lab, Fondazione Bruno Kessler, Via Sommarive 18, 38123, Povo, TN, Italy
Riccardo Gallotti & Manlio De Domenico

Authors

Riccardo Gallotti
View author publications
You can also search for this author in PubMed Google Scholar
Manlio De Domenico
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.G. and M.D.D. both designed and performed research, prepared figures and wrote the manuscript.

Corresponding author

Correspondence to Manlio De Domenico.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gallotti, R., De Domenico, M. Effects of homophily and academic reputation in the nomination and selection of Nobel laureates. Sci Rep 9, 17304 (2019). https://doi.org/10.1038/s41598-019-53657-6

Download citation

Received: 13 May 2019
Accepted: 12 October 2019
Published: 21 November 2019
DOI: https://doi.org/10.1038/s41598-019-53657-6

This article is cited by

Universalism and particularism in the recommendations of the nobel prize for science
- Byoung-Kwon Ko
- Yeongkyun Jang
- Jae-Suk Yang
Scientometrics (2024)
A two-fold evaluation in science: the case of Nobel Prize
- Lingzhi Chen
- Yutao Sun
- Cong Cao
Scientometrics (2023)
Citation inequity and gendered citation practices in contemporary physics
- Erin G. Teich
- Jason Z. Kim
- Dani S. Bassett
Nature Physics (2022)
Scientific elites versus other scientists: who are better at taking advantage of the research collaboration network?
- Yun Liu
- Mengya Zhang
- Xiongxiong You
Scientometrics (2022)
The coauthorship networks of the most productive European researchers
- Marian-Gabriel Hâncean
- Matjaž Perc
- Jürgen Lerner
Scientometrics (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.