candle-annotator

Author	SHA1	Message	Date
Marko Djordjevic	ff15adc847	feat: add SHA256 model integrity check before joblib.load() Add verify_model_checksum() that validates model files against a models/checksums.sha256 manifest before loading. Fails open when manifest is missing or file not listed (backward compat), raises HTTP 500 on hash mismatch. Created empty manifest placeholder. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:25:14 +01:00
Marko Djordjevic	b7f9b2e04d	security: replace exception details with generic error in ML service HTTP responses Replace all instances of `detail=str(e)`, `detail=f"...{exc}"`, and similar patterns that exposed internal exception messages to HTTP clients in services/ml/app/main.py. All exception details are now logged server-side only via logger.error(), while clients receive a generic "Internal server error" message. Fixes 9 handlers across predict, batch predict, pattern detection, training start, training runs fetch, training run delete, dataset info, build dataset, and model load endpoints. Mark task 5.1 as completed in tasks.md. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:23:41 +01:00
Marko Djordjevic	5f569d9134	feat(ml): add API key authentication via FastAPI Depends() on all endpoints except /health - Import Header, Depends, Security from fastapi - Add verify_api_key dependency: reads API_KEY env var, checks X-API-Key header, raises HTTP 401 if key mismatch; fail-open if env var not set - Apply Depends(verify_api_key) to all 14 non-health endpoints - /health endpoint remains unauthenticated for liveness probes - Mark task 3.2 as complete in tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:04:41 +01:00
Marko Djordjevic	26ff80a682	fix: implement CORS origins configuration via environment variable - Replace hardcoded allow_origins=['*'] with dynamic configuration - Read CORS_ORIGINS environment variable (comma-separated list) - Default to 'http://localhost:3000' if CORS_ORIGINS is not set - Support multiple origins by splitting and stripping whitespace from env var	2026-02-18 11:02:00 +01:00
Marko Djordjevic	67dd7aa2f0	security: validate run_id format and add path containment check in ML service - Add `import re` to services/ml/app/main.py - In POST /model/load: validate run_id matches ^[a-zA-Z0-9_-]+$ before DB lookup; use Path.resolve() + directory containment check before loading model artifact - In DELETE /training/runs/{run_id}: validate run_id matches ^[a-zA-Z0-9_-]+$ before any processing; use Path.resolve() + directory containment check before deleting model artifact - Both endpoints return HTTP 400 with {"detail": "Invalid run_id format"} on invalid input - Mark task 2.2 as completed in openspec/changes/code-review-fix/tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:00:19 +01:00
Marko Djordjevic	d3dcfcea7d	feat: auto-build training dataset from DB annotations before training - Add build_dataset_from_db() that exports candles from DB, runs feature engineering, and ingests span annotations into labeled CSV - Call it automatically in _run_training_background before training starts - Add POST /training/build-dataset endpoint for standalone use - Add Next.js proxy route /api/training/build-dataset - Update TrainingPanel: remove dataset-missing block on Start Training, show informational message that dataset builds automatically	2026-02-18 00:24:39 +01:00
Marko Djordjevic	b4956f3fb9	fix: call init_db() on ML service startup to create training_runs table	2026-02-18 00:08:23 +01:00
Marko Djordjevic	6ef102cf21	fix: training panel stuck button, stale runs on startup, add delete model	2026-02-17 22:23:50 +01:00
Marko Djordjevic	2a02669222	feat: add FastAPI model/load endpoint and all Next.js proxy routes (tasks 2-4)	2026-02-17 18:47:04 +01:00
Marko Djordjevic	b8e649e333	feat: add FastAPI pattern detection endpoints (Section 1) - Extract CDL pattern detection logic into services/ml/app/patterns.py with TALIB_PATTERNS dict, get_available_patterns(), validate_pattern_names(), and detect_patterns(candles, pattern_names) functions - Add GET /patterns/available endpoint returning all 54 supported CDL pattern names with display names - Add POST /patterns/detect endpoint accepting {candles, patterns}, running selected CDL functions, returning span annotations with source "talib" - Add input validation: reject invalid pattern names with HTTP 400, treat empty patterns list as "run all" Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-17 18:34:14 +01:00
Marko Djordjevic	f7b154f866	fix(ml): extract class labels from model when metadata is missing The model info returned empty labels array because the pkl file has no metadata dict. Now extracts labels from model.classes_ or model.model.classes_ as fallback.	2026-02-15 22:18:16 +01:00
Marko Djordjevic	be09cef098	fix(ml): add missing numpy import in main.py	2026-02-15 22:08:41 +01:00
Marko Djordjevic	40d6d1739e	fix(ml): add windowed feature flattening for inference parity The model was trained on 94-candle sliding windows flattened to 2820 features (94 candles x 30 features). Inference was sending raw per-candle features (27 columns). Changes: - Rewrite preprocessing to return (X, window_times) tuple - Add sliding window creation with correct feature ordering - Fill missing columns (average, barCount) with 0 for feature parity - Fill NaN from indicator warmup with 0 instead of dropping rows - Always compute all indicators (including MFI) for feature parity - Update predict and batch predict endpoints for new signature	2026-02-15 22:07:06 +01:00
Marko Djordjevic	f850728d44	fix(api): add GET /api/charts/[id] and fix batch prediction - Add GET handler to /api/charts/[id] route to fetch chart metadata - Fix batch prediction to use regular /predict endpoint with database candles - Remove /predict/batch usage (was designed for file-based predictions) - Make volume field optional in CandleData model (database candles don't have volume) - Convert timestamps to ISO dates for batch requests Known issue: TA-Lib indicators failing with 'input array type is not double' - May need to ensure candle data is float64/double type before processing	2026-02-15 21:49:22 +01:00
Marko Djordjevic	6d0d67e39b	fix(ml): make pair and timeframe optional in PredictRequest - Change pair and timeframe fields from required to optional - Frontend only sends candles array, not pair/timeframe metadata - These fields are only used for logging, not prediction logic - Update logging to handle None values with 'unknown' fallback - Fixes 422 validation error on /predict endpoint	2026-02-15 21:43:14 +01:00
Marko Djordjevic	3a83fd38e9	feat(ml): implement FastAPI inference service with model loading, preprocessing, and prediction endpoints	2026-02-15 14:29:07 +01:00

16 commits