For the **MLOps** engineers: how would you improve the performance and scalability of the REST API in production?