Information-theoretic locality properties
- f natural language
Richard Futrell
Department of Language Science Department of Computer Science University of California, Irvine @rljfutrell rfutrell@uci.edu
Quantitative Syntax 2019 2019-08-26
1
Information-theoretic locality properties of natural language - - PowerPoint PPT Presentation
Information-theoretic locality properties of natural language Richard Futrell Department of Language Science Department of Computer Science University of California, Irvine @rljfutrell rfutrell@uci.edu Quantitative Syntax 2019 2019-08-26
1
2
2
2
2
(Zipf, 1949; Hockett, 1960; Slobin, 1973; Givón, 1991, 1992; Hawkins, 1994, 2004, 2014; Christiansen & Chater, 2008; Jaeger & Tily, 2011; Fedzechkina et al., 2012; MacDonald, 2013)
3
(Zipf, 1949; Hockett, 1960; Slobin, 1973; Givón, 1991, 1992; Hawkins, 1994, 2004, 2014; Christiansen & Chater, 2008; Jaeger & Tily, 2011; Fedzechkina et al., 2012; MacDonald, 2013)
3
(Zipf, 1949; Hockett, 1960; Slobin, 1973; Givón, 1991, 1992; Hawkins, 1994, 2004, 2014; Christiansen & Chater, 2008; Jaeger & Tily, 2011; Fedzechkina et al., 2012; MacDonald, 2013)
4
4
4
4
4
4
4
5
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
5
dependency distance minimization, domain minimization, early immediate constituents, principle of head proximity, Behaghel’s First Law, …)
6
6
Grodner & Gibson, 2005; Bartek et al., 2011)
6
Grodner & Gibson, 2005; Bartek et al., 2011)
2018; Liu et al., 2018).
6
Grodner & Gibson, 2005; Bartek et al., 2011)
2018; Liu et al., 2018).
6
Grodner & Gibson, 2005; Bartek et al., 2011)
2018; Liu et al., 2018).
6
Grodner & Gibson, 2005; Bartek et al., 2011)
2018; Liu et al., 2018).
(Hawkins, 1994, 2004, 2014; Wasow, 2002)
6
Grodner & Gibson, 2005; Bartek et al., 2011)
2018; Liu et al., 2018).
(Hawkins, 1994, 2004, 2014; Wasow, 2002)
6
Grodner & Gibson, 2005; Bartek et al., 2011)
2018; Liu et al., 2018).
(Hawkins, 1994, 2004, 2014; Wasow, 2002)
7
7
7
7
7
7
7
7
7
7
8
9
9
& Levy, 2013; Hale, 2016):
9
& Levy, 2013; Hale, 2016):
9
& Levy, 2013; Hale, 2016):
9
& Levy, 2013; Hale, 2016):
Smith & Levy (2013). The effect of word predictability on reading time is
9
& Levy, 2013; Hale, 2016):
Smith & Levy (2013). The effect of word predictability on reading time is
9
& Levy, 2013; Hale, 2016):
Smith & Levy (2013). The effect of word predictability on reading time is
9
& Levy, 2013; Hale, 2016):
Levy, 2008)
Smith & Levy (2013). The effect of word predictability on reading time is
9
& Levy, 2013; Hale, 2016):
Levy, 2008)
Smith & Levy (2013). The effect of word predictability on reading time is
9
& Levy, 2013; Hale, 2016):
Levy, 2008)
Smith & Levy (2013). The effect of word predictability on reading time is
10
10
10
10
10
10
10
10
10
10
prediction
prediction
prediction
prediction
prediction
12
12
12
12
where mi is a lossy compression of the context w1,…,i-1, i.e. mi is an approximate epsilon-machine (Feldman & Crutchfield, 1998; Marzen & Crutchfield, 2017).
12
where mi is a lossy compression of the context w1,…,i-1, i.e. mi is an approximate epsilon-machine (Feldman & Crutchfield, 1998; Marzen & Crutchfield, 2017).
12
where mi is a lossy compression of the context w1,…,i-1, i.e. mi is an approximate epsilon-machine (Feldman & Crutchfield, 1998; Marzen & Crutchfield, 2017).
12
where mi is a lossy compression of the context w1,…,i-1, i.e. mi is an approximate epsilon-machine (Feldman & Crutchfield, 1998; Marzen & Crutchfield, 2017).
13
13
memory representation will have less information about words that have been in memory longer.
13
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
i−1
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
i−1
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
i−1
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
i−1
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
i−1
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
i−1
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
i−1
memory representation will have less information about words that have been in memory longer.
mutual information are distant.
13
i−1
14
14
14
information (de Paiva Alves, 1996; Yuret, 1998)
14
information (de Paiva Alves, 1996; Yuret, 1998)
covariance.
14
information (de Paiva Alves, 1996; Yuret, 1998)
covariance.
information locality effects.
14
information (de Paiva Alves, 1996; Yuret, 1998)
covariance.
information locality effects.
14
information (de Paiva Alves, 1996; Yuret, 1998)
covariance.
information locality effects.
14
Information locality Dependency Locality
15
19
20
20
20
20
20
20
20
20
20
20
20
20
20
21
21
21
21
21
21
21
21
21
21
22
22
22
22
22
22
22
23
24
24
24
24
24
25
25
25
25
25
25
25
26