Stochastic Block Model

  • Motivation: Stochastic block models (SBMs) generalize ER models to accommodate community structure, also known as homophily or assortative mixing
    • Homophily

      tendency of individuals to associate disproportionately with others who are similar (e.g. ethnicity, age, profession, religion, etc) to themselves.

      • Lazarsfeld, P.F. “Friendship as a social process: A substantive and methodological analysis.” Freedom and Control in Modern Society (1954)
      Link to original
  • Model: Fixed , undirected, a partition of nodes into disjoint communities , with and
    • We write and for the absolute and relative size of community , respectively
    • We write for edge probabilities between communities
  • Basic properties
    • Expected degree:
  • Remarks:
    • Reduces to ER model when there is only one community
    • Thus also known as multi-type ER model

Multi-Type Branching

Analyzing SBM requires generalizing the Local Branching to multi-type.

Transclude of 6-7260-hw2#problem-2

Testing

Consider a two-community SBM with and for some constants . We can recover the community structure with high probability if and only if .

Degree Correlated SBM

Within each community, standard SBM recovers ER model. We can also equip Chung–Lu Model with community structure to allow for degree heterogeneity within communities. This is known as degree-correlated SBM. Specifically, for an expected degree sequence and community weight , we have . To ensure node has the expect degree of , we have linear constraints for :