← All projects

Cedana

Automation Layer for AI Factories

Ops & Infragpucontainerscheckpoint-restorehpcai-infrastructurecomputekubernetes
Cedana screenshot

About

Cedana is a Save/Migrate/Resume (SMR) platform for containerized CPU and GPU workloads. It sits between the Linux kernel and your workloads to checkpoint and migrate container state across instances and cloud vendors. The system is designed to maximize compute utilization, improve reliability, and support policy-based orchestration for AI and HPC workloads.

Problem

Compute resources are wasted due to idle instances and workloads that can't seamlessly migrate across nodes or cloud vendors.

For

AI and HPC engineers managing containerized workloads

How it works

Cedana intercepts between the Linux kernel and containers to save the full state of a workload, enabling live migration and resumption across different instances and vendors via an API.

Business model

unknown

Status

launched

Company

Cedana

Similar projects