• Variable-length generation up to 6:20, at per-second granularity

  • Full song composition on-device (first open model to do this)

  • Audio inpainting: single-segment editing, multi-segment editing, and causal continuation (extending a track beyond its original endpoint) LoRA training support with published documentation

  • Fully licensed training data

  • Commercial use permitted under the Community License

The model family:

  • Stable Audio 3.0 Small SFX — on-device sound effects, up to 2 min

  • Stable Audio 3.0 Small — full music composition on-device, up to 2 min

  • Stable Audio 3.0 Medium — higher musicality and longer tracks, up to 6:20