System Improvement Roadmap

1. Purpose

This document tracks the highest-value engineering and product improvements for the StableSteering system itself.

It focuses on:

It does not focus on research questions or study design. Those belong in:

2. Current Baseline

The current MVP already includes:

The next phase is not about making the system merely functional. It is about making it faster to trust, easier to extend, and safer to operate in real research workflows.

3. Priority Levels

4. P0: Near-Term System Priorities

4.1 Strengthen real-backend end-to-end coverage

Why it matters:

Implementation notes:

Success signal:

4.2 Add export packaging for session trace bundles

Why it matters:

Implementation notes:

Success signal:

4.3 Improve runtime diagnostics depth

Why it matters:

Implementation notes:

Success signal:

4.4 Harden trace and export path hygiene

Why it matters:

Implementation notes:

Success signal:

5. P1: Workflow and UX Improvements

5.1 Build true mode-specific feedback controls

Why it matters:

Implementation notes:

Success signal:

5.2 Improve replay and trace navigation

Why it matters:

Implementation notes:

Success signal:

5.3 Add richer async job visibility

Why it matters:

Implementation notes:

Success signal:

5.4 Improve frontend resilience

Why it matters:

Implementation notes:

Success signal:

5.5 Add richer elicitation modes and UI workflows

Why it matters:

Implementation notes:

Success signal:

6. P1: Performance Improvements

6.1 Reduce repeated pipeline warm-up cost

Why it matters:

Implementation notes:

Success signal:

6.2 Improve database structure for future growth

Why it matters:

Implementation notes:

Success signal:

6.3 Optimize artifact lifecycle

Why it matters:

Implementation notes:

Success signal:

6.4 Add a synthetic-data generation pipeline

Why it matters:

Implementation notes:

Success signal:

6.5 Expand steering support to more diffusion pipelines

Why it matters:

Implementation notes:

Success signal:

6.6 Add stronger sampler families

Why it matters:

Implementation notes:

Success signal:

6.7 Add stronger preference-model implementations

Why it matters:

Implementation notes:

Success signal:

6.8 Add steering-dimension selection methods

Why it matters:

Implementation notes:

Success signal:

7. P1: Synthetic Data Infrastructure and Tooling

7.1 Support anchor-seeking synthetic-user simulation

Why it matters:

Implementation notes:

Success signal:

7.2 Support diversity-seeking synthetic-user simulation

Why it matters:

Implementation notes:

Success signal:

7.3 Add synthetic corpus management

Why it matters:

Implementation notes:

Success signal:

7.4 Add synthetic-data quality checks

Why it matters:

Implementation notes:

Success signal:

8. P2: Architecture and Scale Improvements

8.1 Add a pluggable storage layer for shared deployments

Why it matters:

Implementation notes:

Success signal:

8.2 Improve release automation

Why it matters:

Implementation notes:

Success signal:

8.3 Add API schema snapshots

Why it matters:

Implementation notes:

Success signal:

9. Milestone View

Milestone A: Operator Trust

Milestone B: Better Interactive Use

Milestone C: Hardening and Release Maturity

10. Suggested Execution Order

  1. expand real-backend end-to-end validation
  2. package session trace bundles for export
  3. improve diagnostics depth
  4. build mode-specific feedback UI
  5. add richer elicitation modes and critique-aware UI
  6. refine async progress states
  7. improve replay and trace navigation
  8. add steering-dimension selection and sweep tooling
  9. add stronger sampler families
  10. add stronger preference-model implementations
  11. reduce pipeline warm-up cost
  12. normalize high-value SQLite query paths
  13. add artifact retention and cleanup tooling
  14. add anchor-seeking synthetic-data pipeline
  15. add diversity-seeking synthetic-data pipeline
  16. build synthetic corpus management and quality checks
  17. expand generation contracts for multiple diffusion pipeline types
  18. add image-prompt steering support
  19. add inpainting steering support
  20. add ControlNet steering support
  21. harden release automation
  22. prepare shared-storage evolution
  23. add API schema snapshots

11. Summary

The main engineering goal is no longer “make the MVP exist.” That is done.

The next system phase should make StableSteering: