Pipelines Docs is in beta — content is actively being added.
Platform GuideDatasets & Data Vault

Importing Data

Import datasets from files and external sources into the Data Vault.

Import from file

Upload a CSV, JSON, or JSONL file to create a new dataset.

  1. Navigate to Data Vault and click Import Dataset.
  2. Select the File Upload tab.
  3. Drag and drop your file or click to browse.
  4. Review the preview — column types are auto-detected and can be adjusted using the dropdown in each column header.
  5. Enter a dataset name.
  6. Click Import.

Supported formats

FormatDetails
CSVComma-separated values. First row is treated as headers.
JSONArray of objects ([{...}, {...}]).
JSONLJSON Lines — one JSON object per line.

Limits

  • Maximum file size: 50 MB
  • Maximum rows: 100,000

If the file's columns match a known platform export (e.g., LangSmith, Braintrust, Scale AI, Labelbox), a badge is shown in the preview and the source platform is recorded on the dataset.

Import from HuggingFace

Import datasets directly from the HuggingFace Hub. See HuggingFace Integration for details.