lakeFS
lakefs.ioThe Control Plane for AI-Ready Data
Data & Analyticsdata-version-controlobject-storagemlopsdata-engineeringopen-sourceai-infrastructuredata-governance

About
lakeFS is an open-source data version control platform that brings Git-like branching, versioning, and repository semantics to object storage systems. It helps AI and data engineering teams manage data lifecycle, provenance, and access across distributed infrastructure. Teams can test pipeline changes in isolation, ensure reproducibility of training runs, and maintain compliance and governance across data workloads.
Problem
Data teams lack version control, reproducibility, and governance tooling for large-scale object storage and data lakes.
For
AI and data engineering teams at enterprises
How it works
lakeFS wraps existing object storage (S3-compatible) with Git-like branching and versioning, enabling isolated testing, rollback, and full data lineage without copying data.
Business model
open-source
Status
launched
Company
lakeFS