Transitivity
Network transitivity, also known as clustering, means the presence of a heightened number of triangles in the Network. A triangle consists of three nodes that are mutually connected. We define the local/individual clustering coefficient of a node in an undirected graph as
We use the convention that if . Then, we can define the average clustering coefficient of the graph as .
A quite different measure of clustering is the global/overall clustering coefficient, also known as the fraction of transitive triples, is defined as
In the numerator, we count the number of triangles in the graph, each counted 6 times; in the denominator, we count the number of paths of length two (paths with different start and end nodes are different paths), or equivalently, the number of “wedges” (connected triplets, specified by a path of two without distinguishing the start and end nodes), each counted 2 times. Therefore, we can rewrite the global clustering coefficient as
The overall clustering can be thought of a weighted averaging of clustering across nodes with weights proportional to , while average clustering weights all nodes equally. Thus, the average clustering coefficient tends to overweight the contribution of low-degree nodes than the global clustering coefficient. Consequently, if there is a negative correlation between clustering and degree, then the overall clustering will be lower than the average clustering, and vice versa.
For directed graphs, in both the local and global clustering coefficients, we count in the denominator directed paths: , and in the numerator directed triangles: . This is the true definition of transitivity.
Clique
We can extend transitivity on three nodes to more nodes. A clique in a Graph is a maximally complete subgraph, that is, a subset of vertices such that every two distinct vertices are adjacent, and no additional vertex can be included without breaking this property. We can measure the size and number of cliques similarly.