How to determine if a random graph with a fixed degree sequence has - - PowerPoint PPT Presentation

how to determine if a random graph with a fixed degree
SMART_READER_LITE
LIVE PREVIEW

How to determine if a random graph with a fixed degree sequence has - - PowerPoint PPT Presentation

How to determine if a random graph with a fixed degree sequence has a giant component Felix Joos, Guillem Perarnau, Dieter Rautenbach, Bruce Reed A walkthrough by Angus Southwell Monash University 1 Random graphs with a given degree sequence: G


slide-1
SLIDE 1

How to determine if a random graph with a fixed degree sequence has a giant component

Felix Joos, Guillem Perarnau, Dieter Rautenbach, Bruce Reed

A walkthrough by Angus Southwell

Monash University 1

slide-2
SLIDE 2

Random graphs with a given degree sequence: Gn,❞

Definition Let ❞ = (d1, . . . , dn). Then let G(❞) be a uniformly chosen simple graph with labelled vertices {1, . . . , n} and degree sequence ❞. The probability space of such graphs is Gn,❞.

  • Pro: these graphs are much more like most real-world graphs

than G(n, p).

  • Con: they are much more complicated to analyse.

Example In G(n, p), P (u ∼ v) = p trivially. In Gn,❞, P (u ∼ v) is not known in general.

2

slide-3
SLIDE 3

Random graphs with a given degree sequence: Gn,❞

Definition Let ❞ = (d1, . . . , dn). Then let G(❞) be a uniformly chosen simple graph with labelled vertices {1, . . . , n} and degree sequence ❞. The probability space of such graphs is Gn,❞.

  • Pro: these graphs are much more like most real-world graphs

than G(n, p).

  • Con: they are much more complicated to analyse.

Example In G(n, p), P (u ∼ v) = p trivially. In Gn,❞, P (u ∼ v) is not known in general.

2

slide-4
SLIDE 4

Random graphs with a given degree sequence: Gn,❞

Definition Let ❞ = (d1, . . . , dn). Then let G(❞) be a uniformly chosen simple graph with labelled vertices {1, . . . , n} and degree sequence ❞. The probability space of such graphs is Gn,❞.

  • Pro: these graphs are much more like most real-world graphs

than G(n, p).

  • Con: they are much more complicated to analyse.

Example In G(n, p), P (u ∼ v) = p trivially. In Gn,❞, P (u ∼ v) is not known in general.

2

slide-5
SLIDE 5

Random graphs with a given degree sequence: Gn,❞

Definition Let ❞ = (d1, . . . , dn). Then let G(❞) be a uniformly chosen simple graph with labelled vertices {1, . . . , n} and degree sequence ❞. The probability space of such graphs is Gn,❞.

  • Pro: these graphs are much more like most real-world graphs

than G(n, p).

  • Con: they are much more complicated to analyse.

Example In G(n, p), P (u ∼ v) = p trivially. In Gn,❞, P (u ∼ v) is not known in general.

2

slide-6
SLIDE 6

Switchings

vt w ❙t−1 y x vt w ❙t−1 y x A switching is an operation that takes G(❞) to G ′(❞). They are used to find the probability of certain events occurring that we previously got via configuration model, such as:

  • probability of specific edges being present,
  • probability that a given undiscovered vertex is found at the

next step,

  • probability of a giant component in the preprocessed vertices.

3

slide-7
SLIDE 7

Switchings

vt w ❙t−1 y x vt w ❙t−1 y x A switching is an operation that takes G(❞) to G ′(❞). They are used to find the probability of certain events occurring that we previously got via configuration model, such as:

  • probability of specific edges being present,
  • probability that a given undiscovered vertex is found at the

next step,

  • probability of a giant component in the preprocessed vertices.

3

slide-8
SLIDE 8

Giant component problem

Problem Given a random graph model, what is the distribution of the size of the largest connected component? Older results “Double jump” threshold for Erd˝

  • s–R´

enyi random graphs at around 1

2n edges.

  • Below the threshold, all components are order O(log n).
  • At the threshold, largest component has order Θ(n2/3).
  • Above the threshold, largest component has order Θ(n).

4

slide-9
SLIDE 9

Giant component problem

Problem Given a random graph model, what is the distribution of the size of the largest connected component? Older results “Double jump” threshold for Erd˝

  • s–R´

enyi random graphs at around 1

2n edges.

  • Below the threshold, all components are order O(log n).
  • At the threshold, largest component has order Θ(n2/3).
  • Above the threshold, largest component has order Θ(n).

4

slide-10
SLIDE 10

Results for fixed degree sequences

Theorem (Molloy and Reed (1995)) Let ❉(n) be a “well behaved” degree sequence with max. degree at most n

1 4 −ε. Then define

Q(D) := 1 n

  • j∈[n]

d(j)(d(j) − 2).

  • If Q(D) < 0, then all components have size O(log n).
  • If Q(D) > 0, then there exists a component with at least αn

vertices and βn cycles for α, β > 0.

5

slide-11
SLIDE 11

Proof sketch

  • Breadth first search on the graph.
  • Keep track of Xt, the number of “half edges” in your

component that can be explored still. vt wt ❙t−1

  • Show Et−1 [Xt − Xt−1] stays positive (or negative) for a

sufficiently long time.

6

slide-12
SLIDE 12

Proof sketch

  • Breadth first search on the graph.
  • Keep track of Xt, the number of “half edges” in your

component that can be explored still. vt wt ❙t−1

Figure 1: “Superman using his laser vision on the four-vertex empty graph, soon to be the three-vertex empty graph” - Tim

6

slide-13
SLIDE 13

Proof sketch

  • Breadth first search on the graph.
  • Keep track of Xt, the number of “half edges” in your

component that can be explored still. vt wt ❙t−1

  • Show Et−1 [Xt − Xt−1] stays positive (or negative) for a

sufficiently long time.

6

slide-14
SLIDE 14

Limitations of MR result

  • Proven in configuration model rather than Gn,❞.
  • Handling of large degree vertices is nonexistent.
  • Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

slide-15
SLIDE 15

Limitations of MR result

  • Proven in configuration model rather than Gn,❞.
  • Handling of large degree vertices is nonexistent.
  • Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

slide-16
SLIDE 16

Limitations of MR result

  • Proven in configuration model rather than Gn,❞.
  • Handling of large degree vertices is nonexistent.
  • Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

slide-17
SLIDE 17

Limitations of MR result

  • Proven in configuration model rather than Gn,❞.
  • Handling of large degree vertices is nonexistent.
  • Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

slide-18
SLIDE 18

Limitations of MR result

  • Proven in configuration model rather than Gn,❞.
  • Handling of large degree vertices is nonexistent.
  • Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

slide-19
SLIDE 19

The more comprehensive result

Theorem (Joos et al. (2018)) For any function δ → 0 as n → ∞, for every γ > 0, if RD ≤ δMD, the probability that G(D) has a component of order at least γn is

  • (1).

If there exists an ε > 0 such that RD ≥ εMD, then the probability that G(D) contains a component of size γn for some γ > 0 is 1 − o(1). 1 jD n RD MD (+2n2) Small Big

8

slide-20
SLIDE 20

The more comprehensive result

Theorem (Joos et al. (2018)) For any function δ → 0 as n → ∞, for every γ > 0, if RD ≤ δMD, the probability that G(D) has a component of order at least γn is

  • (1).

If there exists an ε > 0 such that RD ≥ εMD, then the probability that G(D) contains a component of size γn for some γ > 0 is 1 − o(1). 1 jD n RD MD (+2n2) Small Big

8

slide-21
SLIDE 21

Proof sketch

  • Breadth first search again.
  • Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

  • Suppression of degree 2 vertices.
  • Switchings used to work in the graph model to get edge

probabilities.

9

slide-22
SLIDE 22

Proof sketch

  • Breadth first search again.
  • Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

  • Suppression of degree 2 vertices.
  • Switchings used to work in the graph model to get edge

probabilities.

9

slide-23
SLIDE 23

Proof sketch

  • Breadth first search again.
  • Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

  • Suppression of degree 2 vertices.
  • Switchings used to work in the graph model to get edge

probabilities.

9

slide-24
SLIDE 24

Proof sketch

  • Breadth first search again.
  • Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

  • Suppression of degree 2 vertices.
  • Switchings used to work in the graph model to get edge

probabilities.

9

slide-25
SLIDE 25

Proof sketch

  • Breadth first search again.
  • Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

  • Suppression of degree 2 vertices.
  • Switchings used to work in the graph model to get edge

probabilities.

9

slide-26
SLIDE 26

Subcritical case

Let H be the graph G with all degree 2 vertices contracted, let ω > 0 be small such that RD ≤ ωMD. Let S be the smallest set of vertices of H such that

  • i∈S di ≥ 5ω1/4M and no vertex outside of S is larger.

Define initial exploration set S0 to be S ∪ {v} for any vertex v. Define X ′

t by

X ′

0 =

  • u∈S0

d(u), X ′

t = X ′ 0 + t

  • i=1

(d(wi) − 2).

10

slide-27
SLIDE 27

Subcritical case

Let H be the graph G with all degree 2 vertices contracted, let ω > 0 be small such that RD ≤ ωMD. Let S be the smallest set of vertices of H such that

  • i∈S di ≥ 5ω1/4M and no vertex outside of S is larger.

Define initial exploration set S0 to be S ∪ {v} for any vertex v. Define X ′

t by

X ′

0 =

  • u∈S0

d(u), X ′

t = X ′ 0 + t

  • i=1

(d(wi) − 2).

10

slide-28
SLIDE 28

Subcritical case

Let H be the graph G with all degree 2 vertices contracted, let ω > 0 be small such that RD ≤ ωMD. Let S be the smallest set of vertices of H such that

  • i∈S di ≥ 5ω1/4M and no vertex outside of S is larger.

Define initial exploration set S0 to be S ∪ {v} for any vertex v. Define X ′

t by

X ′

0 =

  • u∈S0

d(u), X ′

t = X ′ 0 + t

  • i=1

(d(wi) − 2).

10

slide-29
SLIDE 29

Subcritical case

Let H be the graph G with all degree 2 vertices contracted, let ω > 0 be small such that RD ≤ ωMD. Let S be the smallest set of vertices of H such that

  • i∈S di ≥ 5ω1/4M and no vertex outside of S is larger.

Define initial exploration set S0 to be S ∪ {v} for any vertex v. Define X ′

t by

X ′

0 =

  • u∈S0

d(u), X ′

t = X ′ 0 + t

  • i=1

(d(wi) − 2).

10

slide-30
SLIDE 30

Subcritical case

We get the following results about the initial stages of the exploration: Lemma

w∈V \S d(w) (d(w) − 2) ≤ −4ω1/4M,

  • there is a vertex in S of degree at most ω−1/4.

So the vertices outside the preprocessing set are “small”.

  • Process is highly concentrated around its mean.
  • All degrees outside St−1 being low helps the bounds on

switchings.

11

slide-31
SLIDE 31

Subcritical case

We get the following results about the initial stages of the exploration: Lemma

w∈V \S d(w) (d(w) − 2) ≤ −4ω1/4M,

  • there is a vertex in S of degree at most ω−1/4.

So the vertices outside the preprocessing set are “small”.

  • Process is highly concentrated around its mean.
  • All degrees outside St−1 being low helps the bounds on

switchings.

11

slide-32
SLIDE 32

Subcritical case

We get the following results about the initial stages of the exploration: Lemma

w∈V \S d(w) (d(w) − 2) ≤ −4ω1/4M,

  • there is a vertex in S of degree at most ω−1/4.

So the vertices outside the preprocessing set are “small”.

  • Process is highly concentrated around its mean.
  • All degrees outside St−1 being low helps the bounds on

switchings.

11

slide-33
SLIDE 33

Subcritical case

The game is now to bound Et−1 [d(wt) − 2] , the expected increase between X ′

t−1 and X ′ t.

Et−1 [d(wt) − 2] =

  • w /

∈St−1

(d(w) − 2)Pt−1(wt = w).

12

slide-34
SLIDE 34

Subcritical case

The game is now to bound Et−1 [d(wt) − 2] , the expected increase between X ′

t−1 and X ′ t.

Et−1 [d(wt) − 2] =

  • w /

∈St−1

(d(w) − 2)Pt−1(wt = w).

12

slide-35
SLIDE 35

Lemma If t ≤ ω1/9M and X ′

t−1 ≤ ω1/5M, and Xt′ > 0 for all t′ < t, then:

  • If w ∈ V \St−1 and d(w) = 1, then

P (wt = w) ≥ (1 − 9ω1/5) 1 Mt−1 .

  • If w ∈ V \St−1, then

P (wt = w) ≤ (1 + 9ω1/5)d(w) Mt−1 .

13

slide-36
SLIDE 36

Switchings II

vt w ❙t−1 y x vt w ❙t−1 y x Want to find the number of forward and backward switchings. Number of forward switchings is at most Mt−1. How many of these are ‘bad’?

  • x or y ∈ St−1
  • vt ∼ x
  • w ∼ y
  • Vertices overlap

14

slide-37
SLIDE 37

Switchings II

vt w ❙t−1 y x vt w ❙t−1 y x Want to find bounds on the number of forward and backward switchings. Number of forward switchings is at most Mt−1. How many of these are ‘bad’?

  • x or y ∈ St−1
  • vt ∼ x
  • w ∼ y
  • Vertices overlap

14

slide-38
SLIDE 38

Switchings II

vt w ❙t−1 y x vt w ❙t−1 y x Want to find bounds on the number of forward and backward switchings. Number of forward switchings is at most Mt−1. How many of these are ‘bad’?

  • x or y ∈ St−1
  • vt ∼ x
  • w ∼ y
  • Vertices overlap

14

slide-39
SLIDE 39

Subcritical case: finishing touches

Lemma Define Yt = d(wt) − Et−1(d(wt)). The probability that there exists a t such that

t′≤t Yt′ > M2/3 is less than e−M1/4.

Lemma For t ≤ ⌊ ω1/9M

2

⌋, we have that Et−1 (d(wt) − 2) ≤ − t M + 19ω1/5. Lemma With probability greater than 1 − e−M1/4, there exists a time t ≤ ⌊ ω1/9M

3

⌋ such that Xt = 0.

15

slide-40
SLIDE 40

Subcritical case: finishing touches

Lemma Define Yt = d(wt) − Et−1(d(wt)). The probability that there exists a t such that

t′≤t Yt′ > M2/3 is less than e−M1/4.

Lemma For t ≤ ⌊ ω1/9M

2

⌋, we have that Et−1 (d(wt) − 2) ≤ − t M + 19ω1/5. Lemma With probability greater than 1 − e−M1/4, there exists a time t ≤ ⌊ ω1/9M

3

⌋ such that Xt = 0.

15

slide-41
SLIDE 41

Subcritical case: finishing touches

Lemma Define Yt = d(wt) − Et−1(d(wt)). The probability that there exists a t such that

t′≤t Yt′ > M2/3 is less than e−M1/4.

Lemma For t ≤ ⌊ ω1/9M

2

⌋, we have that Et−1 (d(wt) − 2) ≤ − t M + 19ω1/5. Lemma With probability greater than 1 − e−M1/4, there exists a time t ≤ ⌊ ω1/9M

3

⌋ such that Xt = 0.

15

slide-42
SLIDE 42

Supercritical case

Same exploration process, but different preprocessing. Preprocessing – supercritical case Expose all components in H containing a vertex of degree larger than

√ M log M . Call the set of exposed vertices U.

Analysis splits into two cases:

u∈U d(u) ≥ R 100 – then U contains a giant component,

u∈U d(u) < R 100 – same exploration as in the subcritical

case.

16

slide-43
SLIDE 43

Supercritical case

Same exploration process, but different preprocessing. Preprocessing – supercritical case Expose all components in H containing a vertex of degree larger than

√ M log M . Call the set of exposed vertices U.

Analysis splits into two cases:

u∈U d(u) ≥ R 100 – then U contains a giant component,

u∈U d(u) < R 100 – same exploration as in the subcritical

case.

16

slide-44
SLIDE 44

Supercritical case

Same exploration process, but different preprocessing. Preprocessing – supercritical case Expose all components in H containing a vertex of degree larger than

√ M log M . Call the set of exposed vertices U.

Analysis splits into two cases:

u∈U d(u) ≥ R 100 – then U contains a giant component,

u∈U d(u) < R 100 – same exploration as in the subcritical

case.

16

slide-45
SLIDE 45

Supercritical case

Lemma Let U be a set of vertices containing all vertices with degree greater than

√ M log M and let 1 4 < c < 1 be such that u∈U d(u) ≤ cR. Then

  • u∈V \U

d(u) (d(u) − 2) ≥ (1 − c) 2 R.

17

slide-46
SLIDE 46

Supercritical switching analysis

vt w ❙t−1 y x vt w ❙t−1 y x Same switching, more complicated bounds: need a lower bound on #backward. Bad backward switchings are:

  • vt ∼ w
  • x ∼ y
  • y = x

18

slide-47
SLIDE 47

Supercritical switching analysis

vt w ❙t−1 y x vt w ❙t−1 y x Same switching, more complicated bounds: need a lower bound on #backward. Bad backward switchings are:

  • vt ∼ w
  • x ∼ y
  • y = x

18

slide-48
SLIDE 48

Supercritical case

Lemma Let β = 10−6ε2 be a fixed constant. If Mt−1 ≥ 3M

4

and Xt−1 ≤ βM, then for every w ∈ St−1, (1 − 10

  • β)d(w)

Mt−1 ≤ P (w = wt) ≤ (1 + 10

  • β)d(w)

Mt−1 . Futhermore, P

  • d′

t(w) ≥ ⌊2

  • βd(w)⌋ + i
  • w = wt
  • ≤ βi/2,

where d′

t(w) is the number of edges from w to St−1\ {vt} in H

and the number of loops at w in H.

  • Conditions imply linear but still early stages of exploration
  • Probabilities no longer asymptotically equal

19

slide-49
SLIDE 49

Supercritical case

Lemma Let β = 10−6ε2 be a fixed constant. If Mt−1 ≥ 3M

4

and Xt−1 ≤ βM, then for every w ∈ St−1, (1 − 10

  • β)d(w)

Mt−1 ≤ P (w = wt) ≤ (1 + 10

  • β)d(w)

Mt−1 . Futhermore, P

  • d′

t(w) ≥ ⌊2

  • βd(w)⌋ + i
  • w = wt
  • ≤ βi/2,

where d′

t(w) is the number of edges from w to St−1\ {vt} in H

and the number of loops at w in H.

  • Conditions imply linear but still early stages of exploration
  • Probabilities no longer asymptotically equal

19

slide-50
SLIDE 50

Supercritical case

Lemma Let β = 10−6ε2 be a fixed constant. If Mt−1 ≥ 3M

4

and Xt−1 ≤ βM, then for every w ∈ St−1, (1 − 10

  • β)d(w)

Mt−1 ≤ P (w = wt) ≤ (1 + 10

  • β)d(w)

Mt−1 . Futhermore, P

  • d′

t(w) ≥ ⌊2

  • βd(w)⌋ + i
  • w = wt
  • ≤ βi/2,

where d′

t(w) is the number of edges from w to St−1\ {vt} in H

and the number of loops at w in H.

  • Conditions imply linear but still early stages of exploration
  • Probabilities no longer asymptotically equal

19

slide-51
SLIDE 51

Lemma For t ≤ τ, E [d(wt) − 2] ≥ ε

4, E [d′ t(wt)] ≤ E[d(wt)−2] 3

, and thus E [Xt − Xt−1] ≥

ε 12.

Here τ is the smallest t for which either Xt ≥ βM or Mt ≤

  • 1 −

R 4M

  • M0.

Xt ≥ Xt−1 + (d(wt) − 2) − 2d′

t(wt), so we can use this recursively

to get...

20

slide-52
SLIDE 52

Lemma For t ≤ τ, E [d(wt) − 2] ≥ ε

4, E [d′ t(wt)] ≤ E[d(wt)−2] 3

, and thus E [Xt − Xt−1] ≥

ε 12.

Here τ is the smallest t for which either Xt ≥ βM or Mt ≤

  • 1 −

R 4M

  • M0.

Xt ≥ Xt−1 + (d(wt) − 2) − 2d′

t(wt), so we can use this recursively

to get...

20

slide-53
SLIDE 53

Xτ ≥ E [Xτ] +

  • t≤τ

At +

  • t≤τ

Bt, where At = d(wt) − E [d(wt)] and Bt = d′

s(wt) − E [d′ t(wt)].

Lemma With probability 1 − o(1), there exists no t ≤ τ for which

s≤t As

  • r

s≤t Bs are greater than M log log M .

Lemma With probability 1 − o(1), Xτ ≥ βM.

21

slide-54
SLIDE 54

Xτ ≥ E [Xτ] +

  • t≤τ

At +

  • t≤τ

Bt, where At = d(wt) − E [d(wt)] and Bt = d′

s(wt) − E [d′ t(wt)].

Lemma With probability 1 − o(1), there exists no t ≤ τ for which

s≤t As

  • r

s≤t Bs are greater than M log log M .

Lemma With probability 1 − o(1), Xτ ≥ βM.

21

slide-55
SLIDE 55

Small caveats

  • We found bounds on the number of edges in each component,

not vertices!

  • What about degree 2 vertices?
  • Degree 2 vertices in components of H(D)
  • Degree 2 vertices in cyclic components
  • The case of too many degree 2 vertices

22

slide-56
SLIDE 56

Small caveats

  • We found bounds on the number of edges in each component,

not vertices!

  • What about degree 2 vertices?
  • Degree 2 vertices in components of H(D)
  • Degree 2 vertices in cyclic components
  • The case of too many degree 2 vertices

22

slide-57
SLIDE 57

Small caveats

  • We found bounds on the number of edges in each component,

not vertices!

  • What about degree 2 vertices?
  • Degree 2 vertices in components of H(D)
  • Degree 2 vertices in cyclic components
  • The case of too many degree 2 vertices

22

slide-58
SLIDE 58

Thank you!

23

slide-59
SLIDE 59

References

Felix Joos, Guillem Perarnau, Dieter Rautenbach, and Bruce Reed. How to determine if a random graph with a fixed degree sequence has a giant component. Probability Theory and Related Fields, 170(1):263–310, Feb 2018. ISSN 1432-2064. doi: 10.1007/s00440-017-0757-1. URL https://doi.org/10.1007/s00440-017-0757-1. Michael Molloy and Bruce Reed. A critical point for random graphs with a given degree sequence. Random Struct. Algorithms, 6(2-3):161–180, March 1995. ISSN 1042-9832. doi: 10.1002/rsa.3240060204. URL http://dx.doi.org/10.1002/rsa.3240060204.

24