Workspace Structure¶

How Agentomics-ML organizes files during and after execution.

Directory Overview¶

agentomics-ml/
├── datasets/                 # Public train/validation datasets
├── test_datasets/            # Hidden test datasets
└── outputs/                  # Final results

../workspace/runs/<agent_id>/ # Local-mode active workspace
├── run/                      # Current run files
├── best_iteration_snapshot/  # Best iteration snapshot
├── reports/                  # Iteration reports
├── logs/                     # Logs and metrics
└── fallbacks/                # Reserved recovery area

Docker mode uses an internal temporary workspace volume with the same layout and copies it to outputs/<agent_id>/ when the run ends.

datasets/¶

Public datasets use split folders:

datasets/my_dataset/
├── train/
│   ├── input/
│   └── labels.csv
├── validation/             # Optional
│   ├── input/
│   └── labels.csv
├── supplementary/          # Optional: dataset-level source materials
│   └── README.md           # Optional: describes the supplementary materials
├── metadata.json           # Optional if task type is supplied at preparation
└── dataset_description.md  # Optional domain information

Hidden test data uses a matching separate root:

test_datasets/my_dataset/
└── test/
    ├── input/
    └── labels.csv

Each unprepared labels.csv must include id and label columns. Preparation rewrites labels in place with id and numeric_label, then writes metadata.json with "prepared": true. Only train and validation are supported under datasets/; only test is supported under test_datasets/. The input/ interface is recorded at preparation time, must match across all splits, and must not be modified during a run.

After preparation, the public dataset directory is the agent-facing dataset:

datasets/my_dataset/
├── train/
│   ├── input/
│   └── labels.csv          # id,numeric_label
├── validation/
│   ├── input/
│   └── labels.csv          # id,numeric_label
├── supplementary/          # Dataset-level source materials, if provided
├── dataset_description.md
└── metadata.json           # includes "prepared": true

Hidden test data is also prepared in place before final evaluation:

test_datasets/my_dataset/
└── test/
    ├── input/
    └── labels.csv          # id,numeric_label

The agent never sees test_datasets/ during training.

Active Workspace¶

Active execution area. In local mode this is ../workspace/runs/<agent_id>/; in Docker mode it is the temporary /workspace volume.

run/¶

Current run working directory:

<workspace_root>/run/
├── shared/
│   ├── .conda/                  # Shared Conda environment
│   ├── config.json
│   ├── environment.yml
│   └── splits/
├── current_iteration/
│   ├── current_step/            # Active step workspace
│   └── runtime_info/
├── iteration_0/                 # Archived iteration
├── iteration_1/
└── ...

best_iteration_snapshot/¶

Best iteration snapshot:

<workspace_root>/best_iteration_snapshot/
├── model_training/
│   ├── train.py
│   └── training_artifacts/
├── model_inference/
│   └── inference.py
├── runtime_info/
├── environment.yml
└── .conda/

Updated whenever a new best iteration is achieved.

fallbacks/¶

Reserved recovery area:

<workspace_root>/fallbacks/

This directory may be empty for normal runs.

run/shared/splits/¶

Versioned train/validation split folders:

<workspace_root>/run/shared/splits/
└── split_0/
    ├── train/
    │   ├── input/
    │   └── labels.csv
    ├── validation/
    │   ├── input/
    │   └── labels.csv
    └── mini_train/
        ├── input/
        └── labels.csv

Each time the agent changes the train/validation split, a new split_<n>/ folder is created. Iteration outputs record which split version they used. The input/ structure must match the original recorded structure across all splits and must not be modified. The mini_train/ folder is a small subset of training data (at most 100 samples) used for quick script validation.

reports/¶

Iteration reports are written here during runs. These are copied to outputs/<agent_id>/reports/ after completion.

logs/¶

Logs and auxiliary artifacts (metrics, run logs) are stored here and copied to outputs/<agent_id>/logs/.

outputs/¶

Final results after run completion:

outputs/<agent_id>/
├── best_iteration_snapshot/           # Best iteration artifacts
│   ├── model_training/
│   │   ├── train.py
│   │   └── training_artifacts/
│   ├── model_inference/
│   │   └── inference.py
│   ├── runtime_info/
│   ├── environment.yml
│   └── .conda/
├── run/                      # All iterations + data splits
│   ├── shared/
│   │   ├── config.json
│   │   └── splits/
│   │       └── split_0/
│   │           ├── train/
│   │           │   ├── input/
│   │           │   └── labels.csv
│   │           ├── validation/
│   │           │   ├── input/
│   │           │   └── labels.csv
│   │           └── mini_train/
│   │               ├── input/
│   │               └── labels.csv
│   ├── iteration_0/
│   ├── iteration_1/
│   └── ...
├── reports/
│   ├── markdown/
│   │   ├── run_report_iter_0.md
│   │   ├── run_report_iter_1.md
│   │   └── ...
│   └── pdf/
│       ├── iteration_0.pdf
│       ├── iteration_1.pdf
│       └── plots/
├── logs/                     # Logs and metrics
└── README.md                 # Run summary

File Notes¶

Iteration contents and artifact names can vary by run. Use <step_id>/output.json inside each archived iteration or best iteration snapshot as the structured source of truth for step outputs. Use outputs/<agent_id>/README.md for the most accurate per-run details.

Cleanup¶

Remove Specific Run¶

rm -rf outputs/<agent_id>

Clean Workspace¶

rm -rf ../workspace/runs/<agent_id>

Clean Everything¶

rm -rf outputs/*
rm -rf ../workspace/*
rm -rf datasets/*
rm -rf test_datasets/*

Docker Volumes¶

In Docker mode, workspace is mounted as a volume:

Code repository: read-only
Workspace: Read-write
Outputs: Read-write

This isolates agent execution from the host system.

Understanding Outputs - Using output files
Running Inference - Using trained models