CS 4501 Machine Learning for NLP
Introduction
Yangfeng Ji
Department of Computer Science University of Virginia
CS 4501 Machine Learning for NLP Introduction Yangfeng Ji - - PowerPoint PPT Presentation
CS 4501 Machine Learning for NLP Introduction Yangfeng Ji Department of Computer Science University of Virginia Overview 1. Course Information 2. Basic Linear Algebra 3. Basic Probability Theory 4. Statistical Estimation 1 About Online
Department of Computer Science University of Virginia
1
◮ Chime in ◮ Use the “Raise Hand” feature ◮ Send a message via Chat
2
◮ Chime in ◮ Use the “Raise Hand” feature ◮ Send a message via Chat
2
4
◮ Yangfeng Ji ◮ Office hour: TBD
5
◮ Yangfeng Ji ◮ Office hour: TBD
◮ Stephanie Schoch ◮ Office hour: TBD
5
6
◮ Text classification ◮ Language modeling ◮ Word embeddings ◮ Sequence labeling ◮ Machine translation
◮ Discourse processing, text generation, interpretability in NLP
◮ Final project
7
8
◮ 14% × 6 = 84%
8
◮ 14% × 6 = 84%
◮ 2 – 3 students per group ◮ Proposal: 4% ◮ Final presentation: 6% ◮ Final project report: 6%
8
9
◮ Collaboration is not encouraged ◮ Students are allowed to discuss with their classmates
◮ It should be a team effort
10
11
◮ Eisenstein, Natural Language Processing, 2018
12
◮ Eisenstein, Natural Language Processing, 2018
◮ Jurafsky and Martin, Speech and Language Processing, 3rd Edition, 2019 ◮ Smith, Linguistic Structure Prediction, 2009 ◮ Shalev-Shwartz and Ben-David, Understanding Machine Learning: From Theory to Algorithms, 2014 ◮ Goodfellow, Bengio and Courville, Deep Learning, 2016
12
13
14
16
16
17
◮ The element on the 푖-th row and the 푗-th column is denoted as 푎푖,푗
◮ The 푖-th element is denoted as 푥푖
18
◮ The element on the 푖-th row and the 푗-th column is denoted as 푎푖,푗
◮ The 푖-th element is denoted as 푥푖
18
푖
19
푛
20
푛
2 = 풙, 풙 21
푛
2 = 풙, 풙
21
푛
2 = 풙, 풙
21
푖
푖,푗
22
23
23
24
25
25
25
26
27
27
29
30
30
30
31
◮ the coin will lead head on the next toss ◮ it will rain tomorrow
32
◮ the coin will lead head on the next toss ◮ it will rain tomorrow
32
33
33
34
6
푘=1 are the parameters of this distribution, which is also the
35
6
푘=1 are the parameters of this distribution, which is also the
35
6
푘=1 are the parameters of this distribution, which is also the
35
36
36
36
37
◮ 푃(푌 = 0 | 푋 = 1) = 0.25, ◮ 푃(푌 = 1 | 푋 = 1) = 0.75
38
◮ 푃(푌 = 0 | 푋 = 1) = 0.25, ◮ 푃(푌 = 1 | 푋 = 1) = 0.75
38
◮ 푃(푌 = 0 | 푋 = 1) = 0.25, ◮ 푃(푌 = 1 | 푋 = 1) = 0.75
38
◮ Observations of coin tossing: {0, 1, 1, 0, 0, 1, 0}
40
41
푛
42
푛
푛
42
휃
43
휃
푛
43
44
푛
푛
44
푛
푛
휕휃
푖=1 푥(푖)
44
푛
푛
휕휃
푖=1 푥(푖)
44
45
푑휃
45
Kolter, Z. (2015). Linear algebra review and reference.
46