Data Repair detects and resolves minor errors, inconsistencies, or malformed values in existing data, keeping changes minimal and targeted. Describe what changes you want, and the system intelligently applies them across your data while maintaining consistency and quality.Documentation Index
Fetch the complete documentation index at: https://docs.eigenai.com/llms.txt
Use this file to discover all available pages before exploring further.
Supported Input Formats
Data Repair accepts multiple file formats:- JSON (
.json) - Single JSON object or array - JSONL (
.jsonl) - JSON Lines format, one record per line - YAML (
.yaml,.yml) - YAML format
Parameters
| Parameter | Required | Description |
|---|---|---|
input | Yes | Path to input data file or directory |
request | Yes | Description of modifications to apply |
domain | No | Domain context for modification |
schema_file | No | Path to a local schema file |
mcp_server_url | No | URL of an MCP server providing the schema |
reference_doc | No | Reference documentation path |
Providing
schema_file or mcp_server_url gives the repair process access to your full function schema.Execution and Completion
After confirmation, the CLI displays a progress panel showing real-time status as each record is modified. The panel updates dynamically with elapsed time and current processing status.

Directory Input
You can specify a directory to modify multiple files at once. The CLI automatically aggregates all supported files within the directory (up to 3 levels deep) and processes them together..json, .jsonl, .yaml, and .yml files in the specified directory.
Output
After a run completes, results are saved underoutputs/ as a new run directory, for example:
outputs/modified_data_<run_id>/
modified_data_with_details.jsonl- Detailed modification results (includes applied changes)modified_data.jsonl- Stripped output containing only the modified contentdatapoints/- One JSON file per sample (expanded view ofmodified_data.jsonl)metadata.json- Run metadata (task type, parameters, primary files, timestamps)
Using /execute
You can also run data-repair non-interactively via/execute with a YAML config.
Prerequisites
- You have a YAML configuration file available.
- You configure a schema source via
/configure, or provide one in the YAML config (for example,mcp_server_url).