Cloud Run for AI Inference
Cloud Run for AI Inference
The course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products.
The course aims to help developers efficiently deploy and optimize AI inference services on Cloud Run. It includes examples that deploys a model for AI inference with GPUs and integrates gen AI apps with data storage services.
- Use Cloud Run GPUs for AI inference.
- Deploy lightweight language models on Cloud Run for AI inference.
- Optimize AI inference deployments on Cloud Run for performance and cost efficiency.
- Integrate Cloud Run AI inference services with database services on Google Cloud.
Course: Developing Applications with Cloud Run on Google Cloud: Fundamentals (Recommended)