The Evolved Transformer

David R. SoChen LiangQuoc V. Le

   Papers with code   Abstract  PDF

Recent works have highlighted the strength of the Transformer architecture on sequence tasks while, at the same time, neural architecture search (NAS) has begun to outperform human-designed models. Our goal is to apply NAS to search for a better alternative to the Transformer... (read more)

Benchmarked Models

No benchmarked models yet. Click here to submit a model.