❓ Questions and Help
Before asking:
- search the issues.
- search the docs.
What is your question?
If we are using seq2seq model, and use streamAtt or AlignAtt then in streaming mode (audio coming in chunked fashion ) can we use k, v caching?
If yes do you think the cached values remain valid when a new audio chunk is added / concatted to previous accumulated audio?
If no any optimization suggestion to improve inference speed?
Code
What have you tried?
What's your environment?
- fairseq Version (e.g., 1.0 or master):
- PyTorch Version (e.g., 1.0)
- OS (e.g., Linux):
- How you installed fairseq (
pip, source):
- Build command you used (if compiling from source):
- Python version:
- CUDA/cuDNN version:
- GPU models and configuration:
- Any other relevant information:
❓ Questions and Help
Before asking:
What is your question?
If we are using seq2seq model, and use streamAtt or AlignAtt then in streaming mode (audio coming in chunked fashion ) can we use k, v caching?
If yes do you think the cached values remain valid when a new audio chunk is added / concatted to previous accumulated audio?
If no any optimization suggestion to improve inference speed?
Code
What have you tried?
What's your environment?
pip, source):