candle-annotator/openspec/changes
Marko Djordjevic b9beea1574 fix(ml): add _model_swap_lock to prediction reads for thread-safe model access
In /predict and /predict/batch endpoints, grab the model reference under
_model_swap_lock before running inference. Inference itself runs outside
the lock (using a local variable) to avoid blocking model swaps during
potentially slow computation.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-18 11:26:33 +01:00
..
archive sync: ml-ui-connection delta specs to main specs 2026-02-18 10:21:05 +01:00
code-review-fix fix(ml): add _model_swap_lock to prediction reads for thread-safe model access 2026-02-18 11:26:33 +01:00