Supported Input Formats
Data Repair accepts multiple file formats:- JSON (
.json) - Single JSON object or array - JSONL (
.jsonl) - JSON Lines format, one record per line - YAML (
.yaml,.yml) - YAML format
Parameters
| Parameter | Required | Description |
|---|---|---|
input | Yes | Path to input data file or directory |
request | Yes | Description of modifications to apply |
domain | No | Domain context for modification |
schema_file | No | Path to a local schema file |
mcp_server_url | No | URL of an MCP server providing the schema |
reference_doc | No | Reference documentation path |
Providing
schema_file or mcp_server_url gives the repair process access to your full function schema.Execution and Completion
After confirmation, the CLI displays a progress panel showing real-time status as each record is modified. The panel updates dynamically with elapsed time and current processing status.

Directory Input
You can specify a directory to modify multiple files at once. The CLI automatically aggregates all supported files within the directory (up to 3 levels deep) and processes them together..json, .jsonl, .yaml, and .yml files in the specified directory.
Output
After a run completes, results are saved underoutputs/ as a new run directory, for example:
outputs/modified_data_<run_id>/
modified_data_with_details.jsonl- Detailed modification results (includes applied changes)modified_data.jsonl- Stripped output containing only the modified contentdatapoints/- One JSON file per sample (expanded view ofmodified_data.jsonl)metadata.json- Run metadata (task type, parameters, primary files, timestamps)
Using /execute
You can also run data-repair non-interactively via/execute with a YAML config.
Prerequisites
- You have a YAML configuration file available.
- You configure a schema source via
/configure, or provide one in the YAML config (for example,mcp_server_url).