On the inadequacy of nominal assortativity for assessing homophily in networks

Karimi, Fariba; Oliveira, Marcos

doi:10.1038/s41598-023-48113-5

Download PDF

Article
Open access
Published: 29 November 2023

On the inadequacy of nominal assortativity for assessing homophily in networks

Fariba Karimi^1,2 &
Marcos Oliveira³

Scientific Reports volume 13, Article number: 21053 (2023) Cite this article

1427 Accesses
13 Altmetric
Metrics details

Subjects

Abstract

Nominal assortativity (or discrete assortativity) is widely used to characterize group mixing patterns and homophily in networks, enabling researchers to analyze how groups interact with one another. Here we demonstrate that the measure presents severe shortcomings when applied to networks with unequal group sizes and asymmetric mixing. We characterize these shortcomings analytically and use synthetic and empirical networks to show that nominal assortativity fails to account for group imbalance and asymmetric group interactions, thereby producing an inaccurate characterization of mixing patterns. We propose the adjusted nominal assortativity and show that this adjustment recovers the expected assortativity in networks with various level of mixing. Furthermore, we propose an analytical method to assess asymmetric mixing by estimating the tendency of inter- and intra-group connectivities. Finally, we discuss how this approach enables uncovering hidden mixing patterns in real-world networks.

A novel method for assessing and measuring homophily in networks through second-order statistics

Article Open access 13 June 2022

The paradox of second-order homophily in networks

Article Open access 25 June 2021

Interplay between tie strength and neighbourhood topology in complex networks

Article Open access 03 April 2024

Introduction

Understanding how groups interact in networks is fundamental for uncovering mechanisms underlying diverse phenomena, from protein interactions to social communication^1,2,3. Such group-level interactions often generate mixing patterns in networks, commonly assessed with single-valued measures such as nominal assortativity^4,5. Though these measures help analyze group mixing concisely, they may be grounded on unrealistic assumptions about the network structure, which might produce imprecise estimates of group mixing tendencies, limiting our understanding of groups in networks.

Recent advances in relational data collection have enabled studies on mixing patterns to investigate fundamental processes that drive such tendencies⁶. In particular, much research has characterized homophily—nodes’ tendency to connect with alike—in a variety of social settings due to processes such as selective mixing and in-group favoritism^2,7. For instance, sexual partnership networks are assortative by race in the United States, meaning that individuals form ties with partners of the same race more often than one would expect by chance⁴. Similarly, college students are more likely to have friendships with peers of the same gender, major, residence, and year⁸. These homophilic tendencies have been documented in various other social phenomena such as research collaboration⁹, artist partnerships¹⁰, lawmaking¹¹, and book readership¹². Beyond social networks, previous works have also demonstrated homophily in biological domains such as networks of protein similarity¹³ and dolphin companionship¹⁴.

Researchers in network science generally use the so-called nominal assortativity to characterize mixing patterns regarding categorical attributes (e.g., race, gender, protein type)⁴, such as those mentioned above. Nominal assortativity or attribute assortativity describes how intra- and inter-group connectivity diverges from what we expect solely due to degree connectivity of the nodes and groups. The advantage of this measure compared to other existing measures of homophily is that it takes into consideration connectivity patterns of groups to assess the statistical significance. Its straightforward definition produces an intuitive quantity that ranges from $-1$ (i.e., complete disassortative mixing) to 0 (i.e., neutral) to $+1$ (i.e., complete assortative mixing), which enables researchers to analyze group mixing in networks concisely. Recently, Cinelli and colleagues showed that the assortativity coefficient, r, is bounded, analogous to the constraints that exist on Pearson correlation coefficients¹⁵. They demonstrated that the assortativity coefficient can range between $r_{\text {min}}$ and $r_{\text {max}}$, which are dependent on the edge counts in the networks. This specifically implies that $r=1$ is only achievable when the sum of the inter-group edge counts is equal to the total number of edges in the network. Conversely, $r=-1$ is attainable solely in cases where this sum is zero. Crucially, this work exposes the influence of certain network attributes such as metadata assignment and degree sequence on the bounds of r.

Here we demonstrate that nominal assortativity presents two fundamental inadequacies. First, it overlooks the group-size imbalance, implicitly assuming that groups are relatively equal. This assumption neglects that smaller groups have fewer possibilities to connect with themselves, misrepresenting mixing patterns in scenarios of group-size imbalance (see Fig. 1a). Second, the measure consists of a uni-dimensional value, only characterizing symmetric group mixing (or an average mixing). This restriction ignores potential asymmetries in networks, thereby missing relevant mixing patterns (Fig. 1b). Both inadequacies are problematic, particularly when analyzing real-world data and in the presence of minorities.

In real-world networks, groups tend to have unequal sizes, and some groups (i.e., minority groups) might be much smaller than the largest group. For instance, women are underrepresented in STEM fields, such as Computer Science and Physics, making them a minority group in professional networks^16,17,18. When analyzing such networks and other imbalanced data sets, we must consider group sizes to estimate the likelihood of in-group mixing biases. Besides unequal group sizes, networks might display asymmetries in how groups interact. For example, in male-dominated scientific fields, established researchers could be primarily men due to historical first-mover advantages¹⁸. Thus, senior men have the resources to drive their collaboration network, implying that the tendency for male-male collaboration may not be the same as female-female collaboration^19,20. In such settings, homophily is asymmetric, having different strengths for the minority and majority groups. These asymmetries, however, are lost when using a single-valued measure to characterize group mixing, such as nominal assortativity.

In this work, we demonstrate how nominal assortativity misses relevant mixing patterns in networks with unequal group sizes or asymmetric mixing and show how to tackle these shortcomings. First, we use generative network models with adjustable mixing parameters to show that nominal assortativity fails to recover the expected assortativity in synthetic networks. We characterize this limitation analytically and numerically by examining the relationship between assortativity, group size, and asymmetric mixing. Second, we propose the adjusted nominal assortativity and show that this adjustment recovers the expected assortativity from synthetic networks. Third, we propose to assess asymmetric mixing in networks by estimating group-mixing tendencies using our analytical formulations. Finally, we discuss how our approach enables characterizing hidden mixing patterns in real-world networks.

Results

Nominal assortativity characterizes mixing patterns by assessing the significance of the intra-group. To that end, this definition employs the $B \times B$ mixing matrix ${\textbf {e}}$ to account for groups connectivity, where B is the number of groups, and each matrix element $e_{ij}$ corresponds to the fraction of edges connecting nodes from group i to nodes from group j. The nominal assortativity measure is then defined as follows:

$$\begin{aligned} r = \dfrac{\sum _i e_{ii} - \sum _{i} a_i b_i}{1-\sum _i a_i b_i}, \end{aligned}$$

(1)

where $a_{i}$ and $b_{i}$ are the fraction of edges that, respectively, begin and end at nodes from group i, defined as $a_{i} = \sum _{j} e_{ij}$ and $b_{i} = \sum _{j} e_{ji}$⁴. This definition produces an intuitive quantity that equals zero when groups lack intra- and inter-group tendencies (i.e., $e_{ii} = a_i b_i$). The quantity reaches to its maximum $r=1$ when intra-group ties dominate the network (i.e., $\sum _i e_{ii} = 1$) and becomes negative when inter-group ties are predominant.

Nominal assortativity on networks with groups of unequal sizes

To examine how nominal assortativity represents mixing patterns, we use generative network models in which we have a prior knowledge on what to expect from the value of mixing. We aim to evaluate assortativity’s ability to recover the expected mixing value. More precisely, we generate random networks using a model with a tunable group mixing parameter h that corresponds to the probability of same-group nodes being connected, whereas its complement, $1-h$, is the probability of inter-group ties (see “Methods”). Here, we focus on the case of two groups, $B = 2$, in which nodes possess a binary attribute (e.g., red/blue, male/female). The case of beyond two groups is discussed in the Supplementary Note 4. We examine networks with a fixed h and varying group sizes, finding that nominal assortativity goes to zero as the minority group decreases in size (Fig. 1a). For example, when $h=0.8$ (i.e., homophily), assortativity can vary from 0.6 to 0, depending on the proportional size of the minority group, despite fixed group mixing.

To investigate why nominal assortativity varies with the minority fraction, we turn to the analytical formulation of the assortativity. Let us use a more general notion of group mixing in which $h_{ii}$ denotes the intrinsic tendency of a node from group i connecting to a node of the same group; its complement $h_{ij} = 1 - h_{ii}$ is the tendency of a node in group i to connect to a node in group j. Therefore, in a random network, the probability of finding an edge between group i and group j express as $p_{ij} = f_if_jh_{ij}$, where f corresponds to the proportional size of groups, implying that each mixing matrix element can be defined as $e_{ij} = p_{ij}/\sum _{ij} p_{ij}$, where the denominator is a normalizing factor.

Thus, $\sum _i e_{ii}$ and $\sum _i a_i b_i$ can be expressed as follows:

$$\begin{aligned} \sum _i e_{ii} = \dfrac{{f_0}^2 h_{00}+ {f_1}^2 h_{11}}{\sum _{ij} p_{ij}}, \end{aligned}$$

(2)

and

$$\begin{aligned} \sum _i a_i b_i = \dfrac{(f_0^2 h_{00} + f_0 f_1 h_{01} )^2 + (f_1^2 h_{11} + f_0 f_1 h_{10} )^2}{(\sum _{ij} p_{ij})^2}, \end{aligned}$$

(3)

where 0 and 1 are the labels for the minority and majority group, respectively. Finally, inserting Eqs. (2) and (3) into Eq. (1), the nominal assortativity can be written as:

$$\begin{aligned} r = \dfrac{\dfrac{{f_0}^2 h_{00}+ {f_1}^2 h_{11}}{\sum _{ij} p_{ij}} - \dfrac{(f_0^2 h_{00} + f_0 f_1 h_{01} )^2 + (f_1^2 h_{11} + f_0 f_1 h_{10} )^2}{(\sum _{ij} p_{ij})^2}}{1-\dfrac{( f_0^2 h_{00} + f_0 f_1 h_{01} )^2 + ( f_1^2 h_{11} + f_0 f_1 h_{10})^2}{(\sum _{ij} p_{ij})^2}}. \end{aligned}$$

(4)

This equation reveals that nominal assortativity is a function of group sizes $f_0$ and $f_1$. We verify this group-size dependency by comparing our analytical formulation with the assortativity measured on synthetic networks, finding a perfect agreement between Eq. (4) and simulations (Fig. 2a, b). Our results confirm the group-size dependence and reveal that this dependence increases with smaller minority groups (Fig. 2c). In contrast, when groups have similar sizes, we observe, as expected, a linear relationship between group mixing h and nominal assortativity. More precisely, when groups are equal in size, $f_0 = f_1 = 0.5$, Eq. (4) becomes $r = h_{00} + h_{11} - 1$. The group-size dependency occurs in other types of networks such as scale-free networks. For instance, we simulate the Barabási–Albert homophily (BA-Homophily) model, which incorporates group mixing preferences with the preferential attachment²¹, and demonstrate that nominal assortativity is a function of group sizes on such networks and in scenarios involving more than two groups (see Supplementary Notes 2 and 4). Overall, these findings imply that nominal assortativity is unadjusted for group sizes and introduces an artifactual bias into mixing analyses in imbalanced scenarios.

The adjusted nominal assortativity

Here we propose to adjust the nominal assortativity for group sizes by normalizing the elements of the mixing matrix. This approach accurately retrieves the expected assortativity in networks, enabling us to assess mixing patterns in imbalanced networks. To that end, we define the adjusted mixing matrix $\textbf{e}^{\star }$, which accounts for the network’s pool of opportunities, namely, the fact that larger groups have more opportunities to connect. We define each element of the adjusted mixing matrix $\textbf{e}^{\star }$ to be

$$\begin{aligned} e^\star _{ij} = \dfrac{e_{ij}}{f_if_j}, \end{aligned}$$

where $f_k$ corresponds to the proportional size of group k. This adjustment ensures that the elements of the mixing matrix only represents the mixing tendencies (h) that are relevant for measuring intrinsic homophily and assortativity and not other factors. For instance, in the case of two groups, where original mixing elements are $e_{00} \simeq f_0^2 h_{00}$ and $e_{11} \simeq f_1^2 h_{11}$, the adjusted elements of the matrix are expressed as $e_{00}^\star \simeq h_{00}$ and $e_{11}^\star \simeq h_{11}$.

Moreover, we define the adjusted nominal assortativity, $r_{adj}$, as follows:

$$\begin{aligned} r_{adj} = \dfrac{\sum _i e^\star _{ii} - \sum _{i} a^\star _i b^\star _i}{1-\sum _i a^\star _i b^\star _i}, \end{aligned}$$

where $a^\star _{i} = \sum _{j} e^\star _{ij}$ and $b_{i} = \sum _{j} e^\star _{ji}$. This adjustment considers the effects of group-size imbalance on the mixing matrix, leading to a consistent assessment of mixing patterns in imbalanced scenarios.

We verify the proposed measure by generating synthetic data with different group-imbalance and mixing scenarios. We examine networks generated with a fixed h and varying group sizes, revealing that adjusted nominal assortativity accurately recovers the expected mixing independent of group sizes (Fig. 2d-e). Thus results show that the adjusted nominal assortativity has a linear relationship with group mixing h, regardless of $f_0$ and $f_1$ as expected (see Fig. 2f). We find similar results for scale-free networks and three-groups scenarios (see Supplementary Note 2 and 4). In sum, the adjusted nominal assortativity accounts for group sizes and pool of opportunities, enabling us to assess group mixing preferences accurately.

Assessing group mixing in empirical networks

Next, we explore nominal assorativity in different real-world networks with unequal group sizes, showing that nominal assortativity underestimates the mixing patterns compared to the adjusted nominal assortativity (see Table 1). We analyze social networks of academic collaboration and face-to-face interactions with annotated binary gender information (see Supplementary Note 6 for detailed data descriptions)^22,23,24. In most cases, assortativity r is lower than the adjusted assortativity $r_{adj}$, especially in the cases of small minority groups. For example, in the collaborative coding platform GitHub, where women are only 6% of the network, nominal assortativity is $r=0.04$, implying the absence of assortative collaboration; in contrast, the adjusted assortativity is $r_{adj}=0.16$, suggesting a potential gender assortativity. Similarly, nominal assortativity might mislead us to mistake changes in group mixing for changes in group sizes. For instance, in the collaboration network among computer scientists (DBLP), assortativity r increases from $r=0.04$ in 1980 to $r=0.10$ in 2010, which could imply a possible change in group mixing over time. However, this change might be merely due to the growth of the minority group. The minority size almost doubled, from 11% to 21%, and the adjusted assortativity indicates a stable mixing around $r_{adj} = 0.15$. Overall, these findings underscore the importance of accounting for group sizes when analyzing mixing patterns and the risks of ignoring group imbalance in networks.

Table 1 Nominal assortativity and adjusted assortativity in empirical networks.

Full size table

Mixing patterns in networks with asymmetric mixing

A single measure of assortativity reduces information about the $B\times B$ mixing matrix into a single value, leading to a concise measure but potentially missing relevant asymmetries in mixing. The idea that a single summary statistic may not be representative of a dataset is, of course, not new and has been shown in prior works²⁵. More recently, Peel and colleagues showed heterogeneity in local assortativity⁵, and Piraveenan et al.²⁶ showed the extent to which each node contributes to the measure of assortativity. Here, we pay a special attention to the asymmetric nature of group mixing while assuming no heterogeneity at the node level.

To characterize r and $r_{adj}$ in asymmetric scenarios, we relax the assumption of $h_{00}=h_{11}=h$ and use our analytical formulation (Eq. 4) to evaluate the nominal assortativity over the whole parameter space of $h_{00}$ and $h_{11}$ (see Fig. 3). We find that the adjusted nominal assortativity is consistent and independent of group size in asymmetric cases, whereas the unadjusted version is size-dependent. Both measures, however, might characterize contrasting mixing patterns with the same value. In particular, these measures might indicate an absence of inter- or intra- group mixing tendency despite significant group mixing. For instance, when $h_{00} = 0.8$ and $h_{11} = 0.2$, the minority group has a strong homophilic tendency, whereas the majority has a strong heterophilic tendency; yet, nominal assortativity equals zero, incorrectly suggesting a lack of assortative or disassortative patterns (Fig. 3).

To better understand the underlying reason for this misrepresentation, note that when $r=0$, the numerator in Eq. (4) is zero, leading to the following equation:

$$\begin{aligned} \dfrac{{f_0}^2 h_{00}+ {f_1}^2 h_{11}}{\sum _{ij} p_{ij}} - \dfrac{(f_0^2 h_{00} + f_0 f_1 h_{01} )^2 + (f_1^2 h_{11} + f_0 f_1 h_{10} )^2}{(\sum _{ij} p_{ij})^2} = 0. \end{aligned}$$

Simplifying this equation by using the expression of Eq. (6), we find that $h_{00} = 1 - h_{11}$ satisfies this condition. In other words, in many cases when nominal assortavitiy reports a zero value (i.e., lack of any dis/assortative preferences), the group mixing tendencies could be widely different. These findings show that compressing the mixing matrix into a single value, such as assortativity, can hide relevant asymmetric mixing patterns that are present in networks. It is worth noting that paying attention to asymmetries in mixing patterns between groups is important in other applications, such as the emergence of core-periphery structures²⁷.

Assessing asymmetric mixing patterns in networks

In order to assess asymmetric mixing among groups, we propose to turn to the mixing probabilities in a network given an assumption of its generative process. For example, in a random homophilic networks described earlier (ER-Homophily), the diagonal of the mixing matrix can be expressed as:

$$\begin{aligned} e_{00} = \frac{{f_0}^2 h_{00}}{\sum _{ij} p_{ij}} \quad \text {and}\quad e_{11} = \frac{{f_1}^2 h_{11}}{\sum _{ij} p_{ij}}, \end{aligned}$$

which can be re-written as follows:

$$\begin{aligned} h_{00} = \frac{E_{00}}{E}\frac{\sum _{ij} p_{ij}}{{f_0}^2} \quad \text {and}\quad h_{11} = \frac{E_{11}}{E}\frac{\sum _{ij} p_{ij}}{{f_1}^2}, \end{aligned}$$

(5)

where

$$\begin{aligned} \sum _{ij} p_{ij} = {f_0}^2 h_{00} + {f_1}^2 h_{11} + {f_0}{f_1}h_{01} + {f_0}{f_1}h_{10}, \end{aligned}$$

(6)

and E is the total number of edges, and $E_{00}$ and $E_{11}$ are the number of intra-group edges of the minority and majority groups, and $e_{00}$ and $e_{11}$ are fraction of intra-group edges normalized by E. By combining the equations above, $\sum _{ij} p_{ij}$ can be expressed as:

$$\begin{aligned} \sum _{ij} p_{ij} = \frac{2 f_0 f_1}{1-e_{00}(1-f_1/f_0) - e_{11}(1-f_0/f_1)}. \end{aligned}$$

(7)

By using Eqs. (5) and (7), we can retrieve $h_{00}$ and $h_{11}$ from data, given we know basic information about the network (i.e., E, $E_{00}$, $E_{11}$, and $f_0$).

We verify this method by generating networks with varying mixing parameters and compare the estimated parameters with the ground truth in Supplementary Note 5. Similar methodology can be applied to scale-free networks, finding equivalent results (see Supplementary Note 6). Though this approach requires prior knowledge about the underlying generative processes in networks, it is plausible to argue that many small-scale and large-scale social networks often fall into these two categories of topologically random or scale-free structure^28,29,30. Once the plausible topology is identified by examining the degree distribution, the appropriate formulation can be used to extract the group mixing asymmetries. In Supplementary Note 3, we discuss the relationship between asymmetrical homophily and adjusted nominal assortativity in undirected networks.

Discussion

Despite its popularity and relative accuracy in capturing homophily and assortative mixing in a variety of networks, nominal assortativity can produce distorted assessments of mixing patterns in networks with unequal groups and asymmetric mixing. In this work, we demonstrated this inadequacy and proposed ways to tackle these limitations. By using generative network models with adjustable mixing, we show that nominal assortativity fails to assess homophily accurately in certain scenarios. Our results demonstrate (1) the need for accounting for group sizes in such analyses and (2) the inability of single-valued measures to capture asymmetries in networks.

To tackle these limitations, we develop two approaches to assess group mixing in networks. First, we propose adjusted nominal assortativity to solve the group-size limitation, which accurately recovers the expected symmetric assortativity from networks. Our analysis of real-world networks reveals that nominal assortativity underestimates the strength of mixing patterns compared to the adjusted assortativity. Second, we propose to assess asymmetric mixing in networks by estimating the intra-group mixing probabilities accounting for group size differences and other group-level topological features. It is worth mentioning that there are a variety of other segregation and assortativity measurements in the social network literature beyond the nominal assortativity. Future works should focus on comparing the sensitivity, equivalency, and compatibility of those measurements against each other and baseline scenarios similar to this paper and the previous efforts³¹.

Accurately measuring biases in group mixing in social networks is crucial because mixing biases affect perception of minorities³², access to social capital³³, and algorithmic visibility³⁴, to name a few. Our work lays a novel foundation by proposing an accurate measure of assortativity that can be applied to a wide range of social networks. Better assessment of group-level tendencies and asymmetries in networks provides the means to understand how diverse groups interact—a fundamental step for uncovering mechanisms governing our social lives.

Methods

Random networks with group mixing

To analyze assortativity in networks, we develop a simple model that incorporates group mixing and random tie formation in networks. In this model, an edge between two nodes depends on their group memberships via a stochastic process by tuning the homophily parameter ranging from 0 to 1, $h \in {0,1}$. That means the probability of a node from group i to establish a tie with a node from group j is denoted as $h_{ij}$. The probability of connecting with nodes of the same group is thus the complementary function so that $h_{ii} = 1 - h_{ij}$, likewise $h_{ji} = 1 - h_{jj}$. At each simulation step, one random node is selected, and it connects to a random target based on this probability. Note that this model has equivalencies with a simple version of stochastic block models in cases where group memberships are known and are the drivers of block formations³⁵. Analytical derivations of the mixing probabilities are described in Supplementary Note 1, and the code is available in the GitHub repository.

Data availability

The sources of all empirical data used in our analyses are described in Supplementary Note 6.

Code availability

All relevant code used in this study will be available at https://github.com/macoj/assortativity.

References

McPherson, M., Smith-Lovin, L. & Cook, J. M. Birds of a feather: Homophily in social networks. Annu. Rev. Sociol. 27, 415–444 (2001).
Article Google Scholar
Goodreau, S. M., Kitts, J. A. & Morris, M. Birds of a feather, or friend of a friend? Using exponential random graph models to investigate adolescent social networks. Demography 46, 103–125 (2009).
Article PubMed PubMed Central Google Scholar
Oliveira, M. et al. Group mixing drives inequality in face-to-face gatherings. Commun. Phys. 5, 1–9 (2022).
Article Google Scholar
Newman, M. E. J. Mixing patterns in networks. Phys. Rev. E 67, 026126 (2003).
Article ADS MathSciNet CAS Google Scholar
Peel, L., Delvenne, J.-C. & Lambiotte, R. Multiscale mixing patterns in networks. Proc. Natl. Acad. Sci. 115, 4057–4062 (2018).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Schaible, J., Oliveira, M., Zens, M. & Génois, M. Sensing close-range proximity for studying face-to-face interaction. In Handbook of Computational Social Science. Vol. 1. 219–239 (Routledge, 2021).
Efferson, C., Lalive, R. & Fehr, E. The coevolution of cultural groups and ingroup favoritism. Science 321, 1844–1849 (2008).
Article ADS CAS PubMed Google Scholar
Traud, A. L., Mucha, P. J. & Porter, M. A. Social structure of Facebook networks. Phys. A Stat. Mech. Appl. 391, 4165–4180 (2012).
Article Google Scholar
Pepe, A. & Rodriguez, M. A. Collaboration in sensor network research: An in-depth longitudinal analysis of assortative mixing patterns. Scientometrics 84, 687–701 (2010).
Article CAS PubMed Google Scholar
Jacobson, K. & Sandler, M. Musically meaningful or just noise? An analysis of on-line artist networks. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5493. 107–118 (LNCS, 2009).
Oliveira, M., Bastos-Filho, C. & Menezes, R. Political social networks reveal strong party loyalty in Brazil and weak regionalism. In The Sixth ASE International Conference on Social Computing. 1–8 (Stanford, 2014). https://doi.org/10.13140/RG.2.2.35595.08489.
Bucur, D. Gender homophily in online book networks. Inf. Sci. 481, 229–243 (2019).
Article Google Scholar
Cheng, S., Price, D. C., Karkar, S. & Bhattacharya, D. Exploring biotic interactions within protist cell populations using network methods. J. Eukaryotic Microbiol. 61, 399–403 (2014).
Article CAS Google Scholar
Lusseau, D. & Newman, M. E. J. Identifying the role that animals play in their social networks. Proc. R. Soc. Lond. Ser. B Biol. Sci. 271, 477–481 (2004).
Article Google Scholar
Cinelli, M., Peel, L., Iovanella, A. & Delvenne, J.-C. Network constraints on the mixing patterns of binary node metadata. Phys. Rev. E 102, 062310 (2020).
Article ADS MathSciNet CAS PubMed Google Scholar
National Science Foundation, National Center for Science and Engineering Statistics. Women, Minorities, and Persons with Disabilities in Science and Engineering: 2019. Special Report NSF 19-304. (ERIC Clearinghouse, 2019).
National Science Board, National Science Foundation. In Science and Engineering Indicators 2020: The State of U.S. Science and Engineering. NSB-2020-1. (ERIC Clearinghouse, 2020).
Kong, H., Martin-Gutierrez, S. & Karimi, F. Influence of the first-mover advantage on the gender disparities in physics citations. Commun. Phys. 5, 1–11 (2022).
Article Google Scholar
Karimi, F., Oliveira, M. & Strohmaier, M. Minorities in networks and algorithms. arXiv preprint arXiv:2206.07113 (2022).
Jadidi, M., Karimi, F., Lietz, H. & Wagner, C. Gender disparities in science? Dropout, productivity, collaborations and success of male and female computer scientists. Adv. Complex Syst. 21, 1750011 (2018).
Article MathSciNet Google Scholar
Karimi, F., Génois, M., Wagner, C., Singer, P. & Strohmaier, M. Homophily influences ranking of minorities in social networks. Sci. Rep. 8, 11077 (2018).
Article ADS PubMed PubMed Central Google Scholar
Fournet, J. & Barrat, A. Contact patterns among high school students. PLoS ONE 9, e107878 (2014).
Article ADS PubMed PubMed Central Google Scholar
Mastrandrea, R., Fournet, J. & Barrat, A. Contact patterns in a high school: A comparison between data collected using wearable sensors, contact diaries and friendship surveys. PLoS ONE 10, e0136497 (2015).
Article PubMed PubMed Central Google Scholar
Génois, M. et al. Combining sensors and surveys to study social contexts: Case of scientific conferences. arXiv preprint arXiv:2206.05201 (2022).
Anscombe, F. J. Graphs in statistical analysis. Am. Stat. 27, 17–21 (1973).
Google Scholar
Piraveenan, M., Prokopenko, M. & Zomaya, A. Y. On congruity of nodes and assortative information content in complex networks. Netw. Heterog. Media 7, 441–461 (2012).
Article MathSciNet MATH Google Scholar
Ureña-Carrion, J., Karimi, F., Iñiguez, G. & Kivelä, M. Assortative and preferential attachment lead to core-periphery networks. arXiv preprint arXiv:2305.15061 (2023).
Isella, L. et al. Close encounters in a pediatric ward: Measuring face-to-face proximity and mixing patterns with wearable sensors. PLoS ONE 6, e17144 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Broido, A. D. & Clauset, A. Scale-free networks are rare. Nat. Commun. 10, 1017 (2019).
Article ADS PubMed PubMed Central Google Scholar
Holme, P. Rare and everywhere: Perspectives on scale-free networks. Nat. Commun. 10, 1016 (2019).
Article ADS PubMed PubMed Central Google Scholar
Bojanowski, M. & Corten, R. Measuring segregation in social networks. Soc. Netw. 39, 14–32 (2014).
Article Google Scholar
Lee, E. et al. Homophily and minority-group size explain perception biases in social networks. Nat. Hum. Behav. 3, 1078–1087 (2019).
Article PubMed PubMed Central Google Scholar
Jackson, M. O. Inequality’s economic and social roots: The role of social networks and homophily. Available at SSRN 3795626 (2021).
Espín-Noboa, L., Wagner, C., Strohmaier, M. & Karimi, F. Inequality and inequity in network-based ranking and recommendation algorithms. Sci. Rep. 12, 2012 (2022).
Article ADS PubMed PubMed Central Google Scholar
Karrer, B. & Newman, M. E. Stochastic blockmodels and community structure in networks. Phys. Rev. E 83, 016107 (2011).
Article ADS MathSciNet Google Scholar

Download references

Acknowledgements

We thank Leto Peel, Matteo Cinelli, Samuel Martin-Gutierrez, and the Network Inequality group for helpful feedback on the first draft of our manuscript. FK was partly supported by the EU Horizon Europe project MAMMOth (Grant Agreement 101070285).

Author information

Authors and Affiliations

Complexity Science Hub Vienna, 1080, Vienna, Austria
Fariba Karimi
Graz University of Technology, Graz, Austria
Fariba Karimi
Computer Science, University of Exeter, Exeter, UK
Marcos Oliveira

Authors

Fariba Karimi
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Oliveira
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.K. and M.O. proposed the project, wrote relevant code, carried out analytical analyses, wrote the first draft, and reviewed the final manuscript.

Corresponding authors

Correspondence to Fariba Karimi or Marcos Oliveira.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Karimi, F., Oliveira, M. On the inadequacy of nominal assortativity for assessing homophily in networks. Sci Rep 13, 21053 (2023). https://doi.org/10.1038/s41598-023-48113-5

Download citation

Received: 10 August 2023
Accepted: 22 November 2023
Published: 29 November 2023
DOI: https://doi.org/10.1038/s41598-023-48113-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.