Get in Touch

Course Outline

Introduction to Kubeflow

  • Grasping the Kubeflow mission and architectural design
  • Overview of core components and the ecosystem
  • Deployment strategies and platform capabilities

Working with the Kubeflow Dashboard

  • Navigating the user interface
  • Managing notebooks and workspaces
  • Integrating storage solutions and data sources

Kubeflow Pipelines Fundamentals

  • Pipeline structure and component design
  • Creating pipelines with the Python SDK
  • Executing, scheduling, and monitoring pipeline runs

Training ML Models on Kubeflow

  • Distributed training patterns
  • Utilizing TFJob, PyTorchJob, and other operators
  • Resource management and autoscaling within Kubernetes

Model Serving with Kubeflow

  • Overview of KFServing / KServe
  • Deploying models using custom runtimes
  • Managing revisions, scaling, and traffic routing

Managing ML Workflows on Kubernetes

  • Versioning data, models, and artifacts
  • Integrating CI/CD for ML pipelines
  • Security and role-based access control

Best Practices for Production ML

  • Designing reliable workflow patterns
  • Observability and monitoring
  • Troubleshooting common Kubeflow issues

Advanced Topics (Optional)

  • Multi-tenant Kubeflow environments
  • Hybrid and multi-cluster deployment scenarios
  • Extending Kubeflow with custom components

Summary and Next Steps

Requirements

  • A foundational understanding of containerized applications
  • Experience with basic command-line operations
  • Familiarity with Kubernetes concepts

Audience

  • Machine Learning practitioners
  • Data scientists
  • DevOps teams new to Kubeflow
 14 Hours

Number of participants


Price per participant

Testimonials (3)

Upcoming Courses

Related Categories