Service to be run on a serverless runpod.ai instance to translate messages from any language to any language (english by default).
We are using m2m_100_418m, this is a many-to-many model. Metrics on performance are TBD.
Model is cached on the docker image.
- A dockerhub account
- A runpod.ai account
- If you want to use the GA to upload to dockerhub, add the following secrets to GH:
- DOCKER_PASSWORD
- DOCKER_USERNAME
Check the wiki for more detailed information
- Upload the container image to dockerhub (you can use the GA) and then add it as a template to your runpod.ai account.
Alternatively, you can use my image:
leandroalbero/serverless-runpod:latest - Create an API on runpod.ai serverless section, add as many servers as you need and customize the settings to your needs.
- Call the runsync endpoint with the following payload, don't forget to authenticate with your API key (Bearer token):
{
"input":{
"src_lang":"es", # Optional, defaults to auto-detected
"target_lang":"en", # Optional, defaults to EN
"input_text":"Hola mundo, esta es una frase en castellano"
}
}A response will be returned with the translated text to english:
{
"delayTime": 96,
"executionTime": 84,
"id": "sync-d14c2348-19b5-44dd-9657-1995146d2f49",
"output": {
"translated_text": "building 8 stories",
"translation_time": 0.0007581710815429688,
"uses_gpu": True
},
"status": "COMPLETED",
}... TODO
