[PPT] - How to determine if a random graph with a fixed degree sequence has PowerPoint Presentation

SLIDE 1

How to determine if a random graph with a fixed degree sequence has a giant component

Felix Joos, Guillem Perarnau, Dieter Rautenbach, Bruce Reed

A walkthrough by Angus Southwell

Monash University 1

SLIDE 2

Random graphs with a given degree sequence: Gn,❞

Definition Let ❞ = (d1, . . . , dn). Then let G(❞) be a uniformly chosen simple graph with labelled vertices {1, . . . , n} and degree sequence ❞. The probability space of such graphs is Gn,❞.

Pro: these graphs are much more like most real-world graphs

than G(n, p).

Con: they are much more complicated to analyse.

Example In G(n, p), P (u ∼ v) = p trivially. In Gn,❞, P (u ∼ v) is not known in general.

2

SLIDE 3

Random graphs with a given degree sequence: Gn,❞

Definition Let ❞ = (d1, . . . , dn). Then let G(❞) be a uniformly chosen simple graph with labelled vertices {1, . . . , n} and degree sequence ❞. The probability space of such graphs is Gn,❞.

Pro: these graphs are much more like most real-world graphs

than G(n, p).

Con: they are much more complicated to analyse.

Example In G(n, p), P (u ∼ v) = p trivially. In Gn,❞, P (u ∼ v) is not known in general.

2

SLIDE 4

Random graphs with a given degree sequence: Gn,❞

Definition Let ❞ = (d1, . . . , dn). Then let G(❞) be a uniformly chosen simple graph with labelled vertices {1, . . . , n} and degree sequence ❞. The probability space of such graphs is Gn,❞.

Pro: these graphs are much more like most real-world graphs

than G(n, p).

Con: they are much more complicated to analyse.

Example In G(n, p), P (u ∼ v) = p trivially. In Gn,❞, P (u ∼ v) is not known in general.

2

SLIDE 5

Random graphs with a given degree sequence: Gn,❞

Definition Let ❞ = (d1, . . . , dn). Then let G(❞) be a uniformly chosen simple graph with labelled vertices {1, . . . , n} and degree sequence ❞. The probability space of such graphs is Gn,❞.

Pro: these graphs are much more like most real-world graphs

than G(n, p).

Con: they are much more complicated to analyse.

Example In G(n, p), P (u ∼ v) = p trivially. In Gn,❞, P (u ∼ v) is not known in general.

2

SLIDE 6

Switchings

vt w ❙t−1 y x vt w ❙t−1 y x A switching is an operation that takes G(❞) to G ′(❞). They are used to find the probability of certain events occurring that we previously got via configuration model, such as:

probability of specific edges being present,
probability that a given undiscovered vertex is found at the

next step,

probability of a giant component in the preprocessed vertices.

3

SLIDE 7

Switchings

vt w ❙t−1 y x vt w ❙t−1 y x A switching is an operation that takes G(❞) to G ′(❞). They are used to find the probability of certain events occurring that we previously got via configuration model, such as:

probability of specific edges being present,
probability that a given undiscovered vertex is found at the

next step,

probability of a giant component in the preprocessed vertices.

3

SLIDE 8

Giant component problem

Problem Given a random graph model, what is the distribution of the size of the largest connected component? Older results “Double jump” threshold for Erd˝

s–R´

enyi random graphs at around 1

2n edges.

Below the threshold, all components are order O(log n).
At the threshold, largest component has order Θ(n2/3).
Above the threshold, largest component has order Θ(n).

4

SLIDE 9

Giant component problem

Problem Given a random graph model, what is the distribution of the size of the largest connected component? Older results “Double jump” threshold for Erd˝

s–R´

enyi random graphs at around 1

2n edges.

Below the threshold, all components are order O(log n).
At the threshold, largest component has order Θ(n2/3).
Above the threshold, largest component has order Θ(n).

4

SLIDE 10

Results for fixed degree sequences

Theorem (Molloy and Reed (1995)) Let ❉(n) be a “well behaved” degree sequence with max. degree at most n

1 4 −ε. Then define

Q(D) := 1 n

j∈[n]

d(j)(d(j) − 2).

If Q(D) < 0, then all components have size O(log n).
If Q(D) > 0, then there exists a component with at least αn

vertices and βn cycles for α, β > 0.

5

SLIDE 11

Proof sketch

Breadth first search on the graph.
Keep track of Xt, the number of “half edges” in your

component that can be explored still. vt wt ❙t−1

Show Et−1 [Xt − Xt−1] stays positive (or negative) for a

sufficiently long time.

6

SLIDE 12

Proof sketch

Breadth first search on the graph.
Keep track of Xt, the number of “half edges” in your

component that can be explored still. vt wt ❙t−1

Figure 1: “Superman using his laser vision on the four-vertex empty graph, soon to be the three-vertex empty graph” - Tim

6

SLIDE 13

Proof sketch

Breadth first search on the graph.
Keep track of Xt, the number of “half edges” in your

component that can be explored still. vt wt ❙t−1

Show Et−1 [Xt − Xt−1] stays positive (or negative) for a

sufficiently long time.

6

SLIDE 14

Limitations of MR result

Proven in configuration model rather than Gn,❞.
Handling of large degree vertices is nonexistent.
Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

SLIDE 15

Limitations of MR result

Proven in configuration model rather than Gn,❞.
Handling of large degree vertices is nonexistent.
Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

SLIDE 16

Limitations of MR result

Proven in configuration model rather than Gn,❞.
Handling of large degree vertices is nonexistent.
Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

SLIDE 17

Limitations of MR result

Proven in configuration model rather than Gn,❞.
Handling of large degree vertices is nonexistent.
Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

SLIDE 18

Limitations of MR result

Proven in configuration model rather than Gn,❞.
Handling of large degree vertices is nonexistent.
Criterion does not extend to general degree sequences:

Consider n = k2 for large odd k, and ❞ = (1, . . . , 1, 2k). Then Q(D) ≈ 3, so we would expect a giant component according to the MR criterion. · · ·

1 2 (n − 1 − 2k)

Figure 1: “ An evil spider that can pick between 1 and n toothpicks as its weapon” - Tim

7

SLIDE 19

The more comprehensive result

Theorem (Joos et al. (2018)) For any function δ → 0 as n → ∞, for every γ > 0, if RD ≤ δMD, the probability that G(D) has a component of order at least γn is

(1).

If there exists an ε > 0 such that RD ≥ εMD, then the probability that G(D) contains a component of size γn for some γ > 0 is 1 − o(1). 1 jD n RD MD (+2n2) Small Big

8

SLIDE 20

The more comprehensive result

Theorem (Joos et al. (2018)) For any function δ → 0 as n → ∞, for every γ > 0, if RD ≤ δMD, the probability that G(D) has a component of order at least γn is

(1).

If there exists an ε > 0 such that RD ≥ εMD, then the probability that G(D) contains a component of size γn for some γ > 0 is 1 − o(1). 1 jD n RD MD (+2n2) Small Big

8

SLIDE 21

Proof sketch

Breadth first search again.
Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

Suppression of degree 2 vertices.
Switchings used to work in the graph model to get edge

probabilities.

9

SLIDE 22

Proof sketch

Breadth first search again.
Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

Suppression of degree 2 vertices.
Switchings used to work in the graph model to get edge

probabilities.

9

SLIDE 23

Proof sketch

Breadth first search again.
Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

Suppression of degree 2 vertices.
Switchings used to work in the graph model to get edge

probabilities.

9

SLIDE 24

Proof sketch

Breadth first search again.
Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

Suppression of degree 2 vertices.
Switchings used to work in the graph model to get edge

probabilities.

9

SLIDE 25

Proof sketch

Breadth first search again.
Now with added preprocessing!

1 j∗

D

jD n RD P M (+2n2)

Suppression of degree 2 vertices.
Switchings used to work in the graph model to get edge

probabilities.

9

SLIDE 26

Subcritical case

Let H be the graph G with all degree 2 vertices contracted, let ω > 0 be small such that RD ≤ ωMD. Let S be the smallest set of vertices of H such that

i∈S di ≥ 5ω1/4M and no vertex outside of S is larger.

Define initial exploration set S0 to be S ∪ {v} for any vertex v. Define X ′

t by

X ′

0 =

u∈S0

d(u), X ′

t = X ′ 0 + t

i=1

(d(wi) − 2).

10

SLIDE 27

Subcritical case

Let H be the graph G with all degree 2 vertices contracted, let ω > 0 be small such that RD ≤ ωMD. Let S be the smallest set of vertices of H such that

i∈S di ≥ 5ω1/4M and no vertex outside of S is larger.

Define initial exploration set S0 to be S ∪ {v} for any vertex v. Define X ′

t by

X ′

0 =

u∈S0

d(u), X ′

t = X ′ 0 + t

i=1

(d(wi) − 2).

10

SLIDE 28

Subcritical case

Let H be the graph G with all degree 2 vertices contracted, let ω > 0 be small such that RD ≤ ωMD. Let S be the smallest set of vertices of H such that

i∈S di ≥ 5ω1/4M and no vertex outside of S is larger.

Define initial exploration set S0 to be S ∪ {v} for any vertex v. Define X ′

t by

X ′

0 =

u∈S0

d(u), X ′

t = X ′ 0 + t

i=1

(d(wi) − 2).

10

SLIDE 29

Subcritical case

Let H be the graph G with all degree 2 vertices contracted, let ω > 0 be small such that RD ≤ ωMD. Let S be the smallest set of vertices of H such that

i∈S di ≥ 5ω1/4M and no vertex outside of S is larger.

Define initial exploration set S0 to be S ∪ {v} for any vertex v. Define X ′

t by

X ′

0 =

u∈S0

d(u), X ′

t = X ′ 0 + t

i=1

(d(wi) − 2).

10

SLIDE 30

Subcritical case

We get the following results about the initial stages of the exploration: Lemma

w∈V \S d(w) (d(w) − 2) ≤ −4ω1/4M,

there is a vertex in S of degree at most ω−1/4.

So the vertices outside the preprocessing set are “small”.

Process is highly concentrated around its mean.
All degrees outside St−1 being low helps the bounds on

switchings.

11

SLIDE 31

Subcritical case

We get the following results about the initial stages of the exploration: Lemma

w∈V \S d(w) (d(w) − 2) ≤ −4ω1/4M,

there is a vertex in S of degree at most ω−1/4.

So the vertices outside the preprocessing set are “small”.

Process is highly concentrated around its mean.
All degrees outside St−1 being low helps the bounds on

switchings.

11

SLIDE 32

Subcritical case

We get the following results about the initial stages of the exploration: Lemma

w∈V \S d(w) (d(w) − 2) ≤ −4ω1/4M,

there is a vertex in S of degree at most ω−1/4.

So the vertices outside the preprocessing set are “small”.

Process is highly concentrated around its mean.
All degrees outside St−1 being low helps the bounds on

switchings.

11

SLIDE 33

Subcritical case

The game is now to bound Et−1 [d(wt) − 2] , the expected increase between X ′

t−1 and X ′ t.

Et−1 [d(wt) − 2] =

w /

∈St−1

(d(w) − 2)Pt−1(wt = w).

12

SLIDE 34

Subcritical case

The game is now to bound Et−1 [d(wt) − 2] , the expected increase between X ′

t−1 and X ′ t.

Et−1 [d(wt) − 2] =

w /

∈St−1

(d(w) − 2)Pt−1(wt = w).

12

SLIDE 35

Lemma If t ≤ ω1/9M and X ′

t−1 ≤ ω1/5M, and Xt′ > 0 for all t′ < t, then:

If w ∈ V \St−1 and d(w) = 1, then

P (wt = w) ≥ (1 − 9ω1/5) 1 Mt−1 .

If w ∈ V \St−1, then

P (wt = w) ≤ (1 + 9ω1/5)d(w) Mt−1 .

13

SLIDE 36

Switchings II

vt w ❙t−1 y x vt w ❙t−1 y x Want to find the number of forward and backward switchings. Number of forward switchings is at most Mt−1. How many of these are ‘bad’?

x or y ∈ St−1
vt ∼ x
w ∼ y
Vertices overlap

14

SLIDE 37

Switchings II

vt w ❙t−1 y x vt w ❙t−1 y x Want to find bounds on the number of forward and backward switchings. Number of forward switchings is at most Mt−1. How many of these are ‘bad’?

x or y ∈ St−1
vt ∼ x
w ∼ y
Vertices overlap

14

SLIDE 38

Switchings II

vt w ❙t−1 y x vt w ❙t−1 y x Want to find bounds on the number of forward and backward switchings. Number of forward switchings is at most Mt−1. How many of these are ‘bad’?

x or y ∈ St−1
vt ∼ x
w ∼ y
Vertices overlap

14

SLIDE 39

Subcritical case: finishing touches

Lemma Define Yt = d(wt) − Et−1(d(wt)). The probability that there exists a t such that

t′≤t Yt′ > M2/3 is less than e−M1/4.

Lemma For t ≤ ⌊ ω1/9M

2

⌋, we have that Et−1 (d(wt) − 2) ≤ − t M + 19ω1/5. Lemma With probability greater than 1 − e−M1/4, there exists a time t ≤ ⌊ ω1/9M

3

⌋ such that Xt = 0.

15

SLIDE 40

Subcritical case: finishing touches

Lemma Define Yt = d(wt) − Et−1(d(wt)). The probability that there exists a t such that

t′≤t Yt′ > M2/3 is less than e−M1/4.

Lemma For t ≤ ⌊ ω1/9M

2

⌋, we have that Et−1 (d(wt) − 2) ≤ − t M + 19ω1/5. Lemma With probability greater than 1 − e−M1/4, there exists a time t ≤ ⌊ ω1/9M

3

⌋ such that Xt = 0.

15

SLIDE 41

Subcritical case: finishing touches

Lemma Define Yt = d(wt) − Et−1(d(wt)). The probability that there exists a t such that

t′≤t Yt′ > M2/3 is less than e−M1/4.

Lemma For t ≤ ⌊ ω1/9M

2

⌋, we have that Et−1 (d(wt) − 2) ≤ − t M + 19ω1/5. Lemma With probability greater than 1 − e−M1/4, there exists a time t ≤ ⌊ ω1/9M

3

⌋ such that Xt = 0.

15

SLIDE 42

Supercritical case

Same exploration process, but different preprocessing. Preprocessing – supercritical case Expose all components in H containing a vertex of degree larger than

√ M log M . Call the set of exposed vertices U.

Analysis splits into two cases:

u∈U d(u) ≥ R 100 – then U contains a giant component,

u∈U d(u) < R 100 – same exploration as in the subcritical

case.

16

SLIDE 43

Supercritical case

Same exploration process, but different preprocessing. Preprocessing – supercritical case Expose all components in H containing a vertex of degree larger than

√ M log M . Call the set of exposed vertices U.

Analysis splits into two cases:

u∈U d(u) ≥ R 100 – then U contains a giant component,

u∈U d(u) < R 100 – same exploration as in the subcritical

case.

16

SLIDE 44

Supercritical case

Same exploration process, but different preprocessing. Preprocessing – supercritical case Expose all components in H containing a vertex of degree larger than

√ M log M . Call the set of exposed vertices U.

Analysis splits into two cases:

u∈U d(u) ≥ R 100 – then U contains a giant component,

u∈U d(u) < R 100 – same exploration as in the subcritical

case.

16

SLIDE 45

Supercritical case

Lemma Let U be a set of vertices containing all vertices with degree greater than

√ M log M and let 1 4 < c < 1 be such that u∈U d(u) ≤ cR. Then

u∈V \U

d(u) (d(u) − 2) ≥ (1 − c) 2 R.

17

SLIDE 46

Supercritical switching analysis

vt w ❙t−1 y x vt w ❙t−1 y x Same switching, more complicated bounds: need a lower bound on #backward. Bad backward switchings are:

vt ∼ w
x ∼ y
y = x

18

SLIDE 47

Supercritical switching analysis

vt w ❙t−1 y x vt w ❙t−1 y x Same switching, more complicated bounds: need a lower bound on #backward. Bad backward switchings are:

vt ∼ w
x ∼ y
y = x

18

SLIDE 48

Supercritical case

Lemma Let β = 10−6ε2 be a fixed constant. If Mt−1 ≥ 3M

4

and Xt−1 ≤ βM, then for every w ∈ St−1, (1 − 10

β)d(w)

Mt−1 ≤ P (w = wt) ≤ (1 + 10

β)d(w)

Mt−1 . Futhermore, P

d′

t(w) ≥ ⌊2

βd(w)⌋ + i
w = wt
≤ βi/2,

where d′

t(w) is the number of edges from w to St−1\ {vt} in H

and the number of loops at w in H.

Conditions imply linear but still early stages of exploration
Probabilities no longer asymptotically equal

19

SLIDE 49

Supercritical case

Lemma Let β = 10−6ε2 be a fixed constant. If Mt−1 ≥ 3M

4

and Xt−1 ≤ βM, then for every w ∈ St−1, (1 − 10

β)d(w)

Mt−1 ≤ P (w = wt) ≤ (1 + 10

β)d(w)

Mt−1 . Futhermore, P

d′

t(w) ≥ ⌊2

βd(w)⌋ + i
w = wt
≤ βi/2,

where d′

t(w) is the number of edges from w to St−1\ {vt} in H

and the number of loops at w in H.

Conditions imply linear but still early stages of exploration
Probabilities no longer asymptotically equal

19

SLIDE 50

Supercritical case

Lemma Let β = 10−6ε2 be a fixed constant. If Mt−1 ≥ 3M

4

and Xt−1 ≤ βM, then for every w ∈ St−1, (1 − 10

β)d(w)

Mt−1 ≤ P (w = wt) ≤ (1 + 10

β)d(w)

Mt−1 . Futhermore, P

d′

t(w) ≥ ⌊2

βd(w)⌋ + i
w = wt
≤ βi/2,

where d′

t(w) is the number of edges from w to St−1\ {vt} in H

and the number of loops at w in H.

Conditions imply linear but still early stages of exploration
Probabilities no longer asymptotically equal

19

SLIDE 51

Lemma For t ≤ τ, E [d(wt) − 2] ≥ ε

4, E [d′ t(wt)] ≤ E[d(wt)−2] 3

, and thus E [Xt − Xt−1] ≥

ε 12.

Here τ is the smallest t for which either Xt ≥ βM or Mt ≤

1 −

R 4M

M0.

Xt ≥ Xt−1 + (d(wt) − 2) − 2d′

t(wt), so we can use this recursively

to get...

20

SLIDE 52

Lemma For t ≤ τ, E [d(wt) − 2] ≥ ε

4, E [d′ t(wt)] ≤ E[d(wt)−2] 3

, and thus E [Xt − Xt−1] ≥

ε 12.

Here τ is the smallest t for which either Xt ≥ βM or Mt ≤

1 −

R 4M

M0.

Xt ≥ Xt−1 + (d(wt) − 2) − 2d′

t(wt), so we can use this recursively

to get...

20

SLIDE 53

Xτ ≥ E [Xτ] +

t≤τ

At +

t≤τ

Bt, where At = d(wt) − E [d(wt)] and Bt = d′

s(wt) − E [d′ t(wt)].

Lemma With probability 1 − o(1), there exists no t ≤ τ for which

s≤t As

r

s≤t Bs are greater than M log log M .

Lemma With probability 1 − o(1), Xτ ≥ βM.

21

SLIDE 54

Xτ ≥ E [Xτ] +

t≤τ

At +

t≤τ

Bt, where At = d(wt) − E [d(wt)] and Bt = d′

s(wt) − E [d′ t(wt)].

Lemma With probability 1 − o(1), there exists no t ≤ τ for which

s≤t As

r

s≤t Bs are greater than M log log M .

Lemma With probability 1 − o(1), Xτ ≥ βM.

21

SLIDE 55

Small caveats

We found bounds on the number of edges in each component,

not vertices!

What about degree 2 vertices?
Degree 2 vertices in components of H(D)
Degree 2 vertices in cyclic components
The case of too many degree 2 vertices

22

SLIDE 56

Small caveats

We found bounds on the number of edges in each component,

not vertices!

What about degree 2 vertices?
Degree 2 vertices in components of H(D)
Degree 2 vertices in cyclic components
The case of too many degree 2 vertices

22

SLIDE 57

Small caveats

We found bounds on the number of edges in each component,

not vertices!

What about degree 2 vertices?
Degree 2 vertices in components of H(D)
Degree 2 vertices in cyclic components
The case of too many degree 2 vertices

22

SLIDE 58

Thank you!

23

SLIDE 59

References

Felix Joos, Guillem Perarnau, Dieter Rautenbach, and Bruce Reed. How to determine if a random graph with a fixed degree sequence has a giant component. Probability Theory and Related Fields, 170(1):263–310, Feb 2018. ISSN 1432-2064. doi: 10.1007/s00440-017-0757-1. URL https://doi.org/10.1007/s00440-017-0757-1. Michael Molloy and Bruce Reed. A critical point for random graphs with a given degree sequence. Random Struct. Algorithms, 6(2-3):161–180, March 1995. ISSN 1042-9832. doi: 10.1002/rsa.3240060204. URL http://dx.doi.org/10.1002/rsa.3240060204.