부스트캠프 AI Tech
Beam Search and BLEU score
말린malin
2022. 10. 23. 08:19
<Beam Search>
- Greedy decoding has no way to undo decisions!
- Exhaustive search : This O(𝑉𝑡)complexity is far too expensive!
<Beam search>
- on each time step of the decoder, we keep track of the 𝑘 most probable partial translations (which we call hypothese)
- k : beam size (in practice around 5 to 10)
- longer hypotheses have lower scores
- 그러므로 Normalize가 필요함
<BLEU score>
- model 2는 100%가 나왔지만 문법적으로 전혀 말이 안 됨. 이 부분을 보완하자!
- BLUE score : precision를 계산, N-gram overlap(연속된 단위로 비교하기)