Sequence-Level Knowledge Distillation

Yoon KimAlexander M. Rush

   Papers with code   Abstract  PDF

Neural machine translation (NMT) offers a novel alternative formulation of translation that is potentially simpler than statistical approaches. However to reach competitive performance, NMT models need to be exceedingly large... (read more)

Benchmarked Models

No benchmarked models yet. Click here to submit a model.