Some fixes in LM part regarding ngram history length + MLE ngram by uralik · Pull Request #13 · nyu-dl/NLP_DL_Lecture_Note

uralik · 2017-12-30T03:23:55Z

So given the definition of n-gram the text is 100% correct but in the formulas there are always histories of length n, which is probably a typo. I have also added small explanation about why relative freq. ngram estimator is optimal from the MLE perspective.

kyunghyuncho · 2017-12-30T03:46:00Z

lecture_note.tex

 on the $n-1$ preceding symbols only, meaning
 \begin{align*}
-    p(w_k | w_{<k}) \approx p(w_k | w_{k-n}, w_{k-n+1}, \ldots, w_{k-1}).
+    % p(w_k | w_{<k}) \approx p(w_k | w_{k-n}, w_{k-n+1}, \ldots, w_{k-1}). % history length should be n-1


please remove this commented line

kyunghyuncho · 2017-12-30T03:46:10Z

lecture_note.tex

 This results in 
 \begin{align*}
-    p(S) \approx \prod_{t=1}^T p(w_t | w_{t-n}, \ldots, w_{t-1}).
+    p(S) \approx \prod_{t=1}^T p(w_t | w_{t-n+1}, \ldots, w_{t-1}). % history should have n-1 length


kyunghyuncho · 2017-12-30T03:47:03Z

lecture_note.tex


 The biggest issue of having an $n$-gram that never occurs in the training corpus
-is that any sentence containing the $n$-gram will be given a zero probability
+is that any sentence containing such $n$-gram will be given a zero probability


such an $n$-gram

Some fixes in LM part regarding ngram history length + MLE ngram

3870bb2

kyunghyuncho reviewed Dec 30, 2017

View reviewed changes

uralik added 2 commits December 29, 2017 22:50

wipe comments, + such as

0b5ad71

such an

4664255

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some fixes in LM part regarding ngram history length + MLE ngram#13

Some fixes in LM part regarding ngram history length + MLE ngram#13
uralik wants to merge 3 commits intonyu-dl:masterfrom
uralik:master

uralik commented Dec 30, 2017

Uh oh!

kyunghyuncho Dec 30, 2017

Uh oh!

uralik Dec 30, 2017

Uh oh!

kyunghyuncho Dec 30, 2017

Uh oh!

uralik Dec 30, 2017

Uh oh!

kyunghyuncho Dec 30, 2017

Uh oh!

uralik Dec 30, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

uralik commented Dec 30, 2017

Uh oh!

kyunghyuncho Dec 30, 2017

Choose a reason for hiding this comment

Uh oh!

uralik Dec 30, 2017

Choose a reason for hiding this comment

Uh oh!

kyunghyuncho Dec 30, 2017

Choose a reason for hiding this comment

Uh oh!

uralik Dec 30, 2017

Choose a reason for hiding this comment

Uh oh!

kyunghyuncho Dec 30, 2017

Choose a reason for hiding this comment

Uh oh!

uralik Dec 30, 2017

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants