Motivation
To improve support for Asian languages, we should switch our current model to XLM-RoBERTa.
XLM-RoBERTa is a multilingual model pre-trained on 100 languages, including a wide range of Asian languages (Chinese, Japanese, Korean, Thai, Vietnamese, Hindi, etc.), and consistently outperforms multilingual BERT (mBERT) on cross-lingual benchmarks.
Proposal
- Evaluate XLM-RoBERTa (
xlm-roberta-base and/or xlm-roberta-large) as a replacement for the current model.
- Benchmark performance on Asian language inputs against the current setup.
- Update the model loading / inference code paths to support XLM-RoBERTa.
- Update documentation and any related configs.
References
Motivation
To improve support for Asian languages, we should switch our current model to XLM-RoBERTa.
XLM-RoBERTa is a multilingual model pre-trained on 100 languages, including a wide range of Asian languages (Chinese, Japanese, Korean, Thai, Vietnamese, Hindi, etc.), and consistently outperforms multilingual BERT (mBERT) on cross-lingual benchmarks.
Proposal
xlm-roberta-baseand/orxlm-roberta-large) as a replacement for the current model.References