
Scaling Generative AI with Cloudera and NVIDIA: Deploying LLMs with AI
Inference

In this session, discover how to deploy scalable GenAI applications with NVIDIA NIM using the Cloudera AI Inference service. Learn how to manage and optimize AI workloads during the critical deployment phase of the AI lifecycle, focusing on Large Language Models (LLMs).
Why You Should Watch:
- Understand how Cloudera AI Inference with NVIDIA
enables scalable GenAI applications. - Gain insights into the deployment phase of AI which is
critical for operationalizing AI workloads. - See practical demos on deploying LLMs with AI
Inference. - Learn how NVIDIA’s GPU-accelerated infrastructure
enhances performance for AI applications.