← All projects

Datacurve

Frontier coding data for training and evaluating LLMs

AI Toolsllm-trainingcoding-datarlhffine-tuningevaluationfoundation-modelsdata-annotation
Datacurve screenshot

About

Datacurve provides high-quality coding datasets for foundation model labs, specializing in post-training and evaluation data. Their offerings include supervised fine-tuning (SFT) datasets, reinforcement learning environments for repo-wide code tasks, and RLHF pipelines with custom model endpoints. They partner with AI teams to identify model weaknesses via private benchmarking and deliver tailored annotation data at scale.

Problem

AI teams lack high-quality, domain-specific coding data needed to train and evaluate large language models effectively.

For

Foundation model labs and AI research teams

How it works

Datacurve benchmarks a client's model to identify gaps, then creates and delivers custom SFT, RL environment, and RLHF datasets tailored to close those performance gaps.

Business model

unknown

Status

launched

Company

Datacurve

Similar projects