Quick migration with Docker
Dockerfile
: Defines the container for running the application.requirements.txt
: Lists dependencies needed for the application.banana_config.json
file.
Banana uses a banana_config.json
file which contains things like Idle Timeout, Inference Timeout, and Max Replicas.
Idle Timeout
Runpod allows you to set an Idle Timeout when creating the endpoint. The default value is 5 seconds.
Inference Timeout
Runpod has a similar concept to Inference Timeout. For runs that are take less than 30 seconds to execute, you should use the run_sync
handler. For runs that take longer than 30 seconds to execute, you should use the sync
handler.
Max Replicas
When creating a Worker in Runpod, you can set the max Workers that will scale up depending on the amount of Worker sent to your endpoint. For more information, see Scale Type.