๋…ผ๋ฌธ๋ฆฌ๋ทฐ

[NLP๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] Zero-shot transfer learning with synthesized data for multi-domain dialogue state tracking(2020)

JihyunLee 2021. 9. 2. 00:02
๋ฐ˜์‘ํ˜•

์ œ๋ชฉ : Zero-shot transfer learning with synthesized data for multi-domain dialogue state tracking

์ €์ž : Giovanni Campagna Agata Foryciarz Mehrad Moradshahi Monica S. Lam

๋ฐœํ–‰๋…„๋„ : 2020

paper : https://arxiv.org/pdf/2005.00891.pdf

code : https://github.com/stanford-oval/genie-toolkit

Review

์ด๋ฒˆ ๋…ผ๋ฌธ์€ Domain State Tracking(DST) ์—์„œ์˜ Zero/Few shot learning๊ณผ ๊ด€๋ จํ•œ ๋…ผ๋ฌธ์ด๋‹ค. ์ด ๋…ผ๋ฌธ์€ ์ƒˆ๋กœ์šด ๋ชจ๋ธ ๊ตฌ์กฐ๋ฅผ ๋งŒ๋“ ๊ฒƒ์ด ์•„๋‹ˆ๋ผ, ontology๋ฅผ ์ด์šฉํ•ด ๋Œ€ํ™” ๋ฐ์ดํ„ฐ๋ฅผ "ํ•ฉ์„ฑ" ํ•œ ๋’ค, ํ•ฉ์„ฑ๋œ ๋ฐ์ดํ„ฐ๋กœ๋งŒ ๊ธฐ์กด์— ์กด์žฌํ•˜๋˜ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚จ ์—ฐ๊ตฌ์ด๋‹ค.
์—ฐ๊ตฌ์ž๋“ค์€ MultiOZ dataset์˜ ๊ตฌ์กฐ๋ฅผ ๋ณด๊ณ (๋…ผ๋ฌธ์—์„œ๋„ emperical ์ด๋ผ๋Š” ๋‹จ์–ด๋ฅผ ์”€) ๊ฒฝํ—˜์ ์œผ๋กœ ๋Œ€ํ™” ๋ฐ์ดํ„ฐ์— "ํŒจํ„ด" ์ด ์žˆ๋‹ค๊ณ  ์ƒ๊ฐํ•˜์—ฌ MultiOZ ์—์„œ ์ œ๊ณตํ•˜๋Š” ontology ๋ฅผ ์ด์šฉํ•ด์„œ ๋Œ€ํ™”๋ฐ์ดํ„ฐ๋ฅผ Rule based๋กœ (๋…ผ๋ฌธ์—์„œ๋Š” few human-hours๋ผ๊ณ  ํ‘œํ˜„) ๋งŒ๋“  ๋’ค ์ด๋ฅผ baseline๋ชจ๋ธ์—  ํ•™์Šต์‹œํ‚จ ์—ฐ๊ตฌ์ด๋‹ค.

๋…ผ๋ฌธ์—์„œ ๋งŒ๋“  ๋Œ€ํ™” ๊ตฌ์„ฑ ํ‘œ

ํ•ฉ์„ฑ๋œ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šต์‹œํ‚จ ๋ชจ๋ธ๋“ค์€ TRADE์™€ SUMBT์ด๋ฉฐ TRADE๋Š” pre-trainned language model(bert์™€ ๊ฐ™์€..) ์„ ์‚ฌ์šฉํ•˜์ง€ ์•Š๊ณ , ํ•™์Šต๊ณผ์ •์—์„œ ๋ณด์ง€ ์•Š์€ value๋ฅผ ์ƒ์„ฑํ• ์ˆ˜ ์žˆ๋‹ค๋Š” ํŠน์ง•์ด ์žˆ๋Š” ๋ฐ˜๋ฉด SUMBT๋Š” Bert ๋ฅผ ์‚ฌ์šฉํ•˜์˜€๊ณ , ํ•™์Šต๊ณผ์ •์—์„œ ๋ณด์ง€ ์•Š์€ value๋Š” ๋‹ต์œผ๋กœ ์ œ์‹œํ• ์ˆ˜ ์—†๋‹ค๋Š” ํŠน์ง•์ด ์žˆ๋‹ค.

ํ•ฉ์„ฑ๋œ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šต์‹œํ‚จ ๋ชจ๋ธ์˜ Joint, Slot Acc.

ํ•ฉ์„ฑ๋œ ๋ฐ์ดํ„ฐ๋กœ ๋ชจ๋ธ์„ ํ•™์Šต์‹œ์ผฐ์„ ๋•Œ ๊ฒฐ๊ณผ๋Š” ์›๋ž˜์˜ ๊ฒฐ๊ณผ์™€ ํฌ๊ฒŒ ๋‹ค๋ฅด์ง€ ์•Š๋‹ค. ํ•˜์ง€๋งŒ Zero shot ๋ฐฉ์‹(๋ชฉ์ ์œผ๋กœ ํ•˜๋Š” domain์„ ์ œ์™ธํ•˜๊ณ  ํ•™์Šต์‹œํ‚จ๋’ค, test๋Š” ๋ชฉ์ ์œผ๋กœ ํ•˜๋Š” domain์œผ๋กœ ํ•˜๋Š” ๋ฐฉ๋ฒ•)์œผ๋กœ ๋ชจ๋ธ์„ ํ…Œ์ŠคํŠธ ํ–ˆ์„ ๋•Œ๋Š” TRADE๊ฐ€ 1/2, SUMBT๊ฐ€ 2/3 ์ •๋„์˜ ์„ฑ๋Šฅ์„ ๋ณด์˜€๋‹ค.

์ถœ์ฒ˜ : ๋…ผ๋ฌธ. zeroshot learning ๋ฐฉ์‹์—์„œ์˜ ๊ฒฐ๊ณผ. Zero-shot(DM)์ด ๋…ผ๋ฌธ์—์„œ ์ œ์•ˆํ•œ ๋ฐฉ์‹์ด๋‹ค.

์ด๋ฅผ ๋ณด์•˜์„ ๋•Œ, pre trainned๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜๋Š” SUMBT๊ฐ€ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์™€์˜ ํ•ฉ(?) ์ด ๋” ์ž˜๋งž๋‹ค๊ณ  ํ•  ์ˆ˜ ์žˆ๋‹ค.

Limitation + ๋‚ด์ƒ๊ฐ

์•ž์—์„œ ๋ฆฌ๋ทฐํ•œ ๋…ผ๋ฌธ๋“ค๊ณผ๋Š” ๋‹ค๋ฅด๊ฒŒ ๋ณธ ๋…ผ๋ฌธ์€ ๋ชจ๋ธ์„ ์ƒˆ๋กœ ๊ตฌ์„ฑํ•˜๊ธฐ ๋ณด๋‹จ, ๋ฐ์ดํ„ฐ๋ฅผ ํ•™์Šตํ•˜์—ฌ ๋ชจ๋ธ์— ์ ์šฉํ•œ ๋ฐฉ์‹์„ ์‚ฌ์šฉํ•˜์˜€๋‹ค. ์ด ๋ฐฉ๋ฒ•์ด ๊ฐ€์ง€๊ณ  ์žˆ๋Š” ๋‹จ์ ์€

- ์‚ฌ๋žŒ์ด ๋Œ€ํ™” ๊ทœ์น™์„ ๋งŒ๋“ค์–ด์•ผํ•œ๋‹ค. ๋Œ€ํ™”๊ทœ์น™์„ ์ž˜ ๋งŒ๋“ค๋”๋ผ๋„ ๋งŒ๋“ค์–ด์ง„ ๋Œ€ํ™”๊ทœ์น™์ด ๋ชจ๋“  Multi OZ๊ฐ€ ์•„๋‹Œ ๋‹ค๋ฅธ ๋ฐ์ดํ„ฐ์—๋„ ์ ์šฉ์ด ๋  ์ˆ˜ ์žˆ์„์ง€? ์˜๋ฌธ์ด ๋“ค์—ˆ๋‹ค. 

๊ทธ๋ฆฌ๊ณ  ์ด ๋…ผ๋ฌธ์„ ์ฝ์œผ๋ฉด์„œ ์–ด์ฉŒ๋ฉด ๋…ธ๊ฐ€๋‹ค(?) ๋ผ๊ณ  ์ƒ๊ฐ ํ•  ์ˆ˜ ์žˆ๋Š” ์ž‘์—…์„ ์ •๊ตํ•˜๊ฒŒ ํ–ˆ์„ ๋•Œ, ๊ฒฐ๊ณผ๊ฐ€ ์ข‹๋‹ค๋ฉด(์‚ฌ์‹ค ๊ทธ๋‹ฅ ์ข‹์€์ง„ ๋ชจ๋ฅด๊ฒ ์Œ..!) ์ •๊ตํ•˜๊ณ  ๋…ผ๋ฆฌ์ ์œผ๋กœ ๋…ธ๊ฐ€๋‹ค๋ฅผ ์ง„ํ–‰ํ–ˆ๋‹ค๋ฉด ์ข‹์€ ๋…ผ๋ฌธ์œผ๋กœ ๋‚˜์˜ฌ ์ˆ˜ ์žˆ๊ตฌ๋‚˜ ํ•˜๋Š” ์ƒ๊ฐ์„ ํ–ˆ๋‹ค!๐Ÿ’›

๋ฐ˜์‘ํ˜•