Assortativity

Homophily

tendency of individuals to associate disproportionately with others who are similar (e.g. ethnicity, age, profession, religion, etc) to themselves.

Lazarsfeld, P.F. “Friendship as a social process: A substantive and methodological analysis.” Freedom and Control in Modern Society (1954)

The tendency of individuals to associate with others who are similar to themselves is known as homophily or assortative mixing. On the other hand, disassortative mixing, the tendency of individuals to associate with others who are unlike them, is also widely observed in information, technological, and biological networks [@newman2018Networks].

Modularity

To talk about assortativity, we need to specify the characteristic that defines the similarity between nodes. When such characteristics are unknown, the Unsupervised Learning task of grouping nodes into clusters is called community detection.

The idea of a community is that nodes in the same community have a higher probability of being connected than those in different communities. To capture this idea, modularity of a division is defined as

Q = \frac{∣ E _{comm} ∣ - E _{unif} ∣ E _{comm} ∣}{m},

where $E_{comm}$ is the set of edges within communities, and $E_{unif}$ places all $m$ edges uniformly at random. A large modularity means a low entropy, which further suggests a strong community structure.

More formally, suppose $g_{i}$ returns the community index of node $i$ . Then,

∣ E_{comm} ∣ = \frac{1}{2} ij \sum A_{ij} 1 {g_{i} = g_{j}} .

If we break the network into stubs and reconnect the stubs uniformly at random, the expected number of edges within communities is

E ∣ E_{comm} ∣ = \frac{1}{2} ij \sum E [1 {(i, j) \in E}] \cdot 1 {g_{i} = g_{j}} = \frac{1}{2} ij \sum \frac{d _{i} d _{j}}{2 m} \cdot 1 {g_{i} = g_{j}} .

Thus,

Q = \frac{1}{2 m} ij \sum (A_{ij} - \frac{d _{i} d _{j}}{2 m}) 1 {g_{i} = g_{j}} . (1)

Let $Δ = (1 {g_{i} = g_{j}})_{ij}$ . The matrix form of modularity is

Q = \frac{1}{2 m} 1^{T} (A - \frac{d d ^{T}}{2 m}) ⊙ Δ 1 .

Modularity Maximization

Modularity can serve as an objective function for community detection. The discrete optimization problem is NP-hard in general. Consider a partition into two communities with index $- 1$ and $1$ . Then,

1 {g_{i} = g_{j}} = \frac{1}{2} (1 + g_{i} g_{j}) .

Note that $\sum_{ij} (A_{ij} - d_{i} d_{j} / (2 m)) = 2 m - 2 m = 0$ . Thus, for this binary partition, we have

Q = \frac{1}{4 m} g^{T} (A - d d^{T} / (2 m)) g,

recovering the ^spectral-partition with the graph Laplacian replaced by the modularity matrix $A - d d^{T} / (2 m)$ . However, here we can directly let the relaxed real-valued $g$ be the leading eigenvector of the modularity matrix, without degenerating to the trivial case where all nodes are in the same community. This method is also referred to as the spectral modularity maximization.

Assortative Mixing

Now suppose nodes have a scalar characteristic. If nodes with similar values tend to be connected together more often than those with different values, then the network is considered assortatively mixed according to that characteristic.

To see if links are formed based on a scalar characteristic $x$ , we calculate the covariance between the characteristic values of the two endpoints of a randomly chosen edge:

Cov (x_{i}, x_{j}) = \frac{\sum _{ij} A _{ij} ( x _{i} - μ ) ( x _{j} - μ )}{\sum _{ij} A _{ij}},

where the marginal mean $μ$ is defined as

μ = \frac{\sum _{ij} A _{ij} x _{i}}{\sum _{ij} A _{ij}} = \frac{\sum _{i} d _{i} x _{i}}{2 m} .

Thus,

Cov (x_{i}, x_{j}) = \frac{\sum _{ij} x _{i} x _{j}}{2 m} - \frac{\sum _{ij} d _{i} x _{i} d _{j} x _{j}}{( 2 m ) ^{2}} = \frac{1}{2 m} ij \sum (A_{ij} - \frac{d _{i} d _{j}}{2 m}) x_{i} x_{j},

which recovers ^eq-mod by replacing the community indicator with the characteristic value. In matrix form, we have

Cov (x_{i}, x_{j}) = \frac{1}{2 m} x^{T} (A - \frac{d d ^{T}}{2 m}) x .

Note that

Var (x_{i}) = Cov (x_{i}, x_{i}) = \frac{1}{2 m} ij \sum (d_{i} δ_{ij} - \frac{d _{i} d _{j}}{2 m}) x_{i} x_{j} = \frac{1}{2 m} x^{T} (D - \frac{d d ^{T}}{2 m}) x .

We define the assortativity coefficient as the correlation:

r = \frac{x ^{T} ( A - d d ^{T} / ( 2 m )) x}{x ^{T} ( D - d d ^{T} / ( 2 m )) x} .

$r = 1$ means perfect assortative mixing, $r = 0$ means no assortative mixing, and $r = - 1$ means perfect disassortative mixing.

Degree Correlation

When the characteristic is the degree, associative mixing gives core-periphery structures, where high-degree nodes tend to stick together to form a core surrounded by a periphery of low-degree nodes, commonly observed in social networks. Disassortative mixing by degree tend to give hub-and-spoke structures, where high-degree nodes are hubs that connect many low-degree nodes that connect to other hubs. Replacing $x$ with the degree vector $d$ gives the assortativity coefficient by degree.

Networked Networks

Table of Contents

Backlinks

Graph View

Assortativity

Table of Contents

Assortativity

Modularity

Modularity Maximization

Assortative Mixing

Degree Correlation

Backlinks

Graph View