Skip to content

Switch to XLM-RoBERTa for Asian language support #457

@hanneshapke

Description

@hanneshapke

Motivation

To improve support for Asian languages, we should switch our current model to XLM-RoBERTa.

XLM-RoBERTa is a multilingual model pre-trained on 100 languages, including a wide range of Asian languages (Chinese, Japanese, Korean, Thai, Vietnamese, Hindi, etc.), and consistently outperforms multilingual BERT (mBERT) on cross-lingual benchmarks.

Proposal

  • Evaluate XLM-RoBERTa (xlm-roberta-base and/or xlm-roberta-large) as a replacement for the current model.
  • Benchmark performance on Asian language inputs against the current setup.
  • Update the model loading / inference code paths to support XLM-RoBERTa.
  • Update documentation and any related configs.

References

Metadata

Metadata

Assignees

No one assigned

    Labels

    Mentoring will be providedThis is a harder issue, but not too difficult with a mentoradvancedAn issue that requires major refactoring or changes to the core architecture

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions