candle-annotator

Author	SHA1	Message	Date
Marko Djordjevic	2e02d155af	feat: add Zod schema validation to training/start route (task 4.4) Validates model_type as a non-empty string using .safeParse(); returns HTTP 400 with error details on invalid input. Marks task 4.4 as done. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:14:00 +01:00
Marko Djordjevic	4cffc223b3	feat: add Zod schema validation to model/load route Validate run_id in POST /api/model/load using Zod: - run_id must be a non-empty string matching /^[a-zA-Z0-9_-]+$/ - Returns HTTP 400 with error details if validation fails - Validated data is forwarded to the inference service Marks task 4.3 as complete in tasks.md. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:13:13 +01:00
Marko Djordjevic	5c399037c3	feat: add Zod validation to predict/batch route (task 4.2) Add BatchPredictRequestSchema with Zod to validate pair, timeframe, start_date, and end_date fields. Returns HTTP 400 with flattened error details on invalid input. Forward only validated data to the inference service. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:12:38 +01:00
Marko Djordjevic	3361236d3f	feat: add Zod schema validation to predict API route - Add CandleSchema validating time, open, high, low, close (number) and optional volume - Add PredictRequestSchema validating pair (non-empty string), timeframe (non-empty string), candles array - Use safeParse() and return HTTP 400 with error details on invalid input - Forward only validated data to the inference service - Mark task 4.1 as done in tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:11:58 +01:00
Marko Djordjevic	c023702644	feat: add API_KEY to .env.example with placeholder and instructions - Add API_KEY environment variable with placeholder value 'change_me_to_a_strong_random_key' - Include helpful comment explaining its purpose: authentication between Next.js and ML service - Provide command for generating strong random value: openssl rand -hex 32 - Mark task 3.4 as completed	2026-02-18 11:06:47 +01:00
Marko Djordjevic	4a3e4a48ba	feat: forward X-API-Key header from Next.js proxy routes to ML service All 12 Next.js API routes that proxy requests to the ML service (INFERENCE_API_URL / localhost:8001) now include the X-API-Key header read from process.env.API_KEY. Affected routes: - /api/predict - /api/predict/batch - /api/model/info - /api/model/load - /api/training/start - /api/training/runs - /api/training/runs/[run_id] (DELETE) - /api/training/dataset-info - /api/training/active - /api/training/build-dataset - /api/patterns/available - /api/patterns/detect Marks task 3.3 as complete in openspec/changes/code-review-fix/tasks.md. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:06:18 +01:00
Marko Djordjevic	5f569d9134	feat(ml): add API key authentication via FastAPI Depends() on all endpoints except /health - Import Header, Depends, Security from fastapi - Add verify_api_key dependency: reads API_KEY env var, checks X-API-Key header, raises HTTP 401 if key mismatch; fail-open if env var not set - Apply Depends(verify_api_key) to all 14 non-health endpoints - /health endpoint remains unauthenticated for liveness probes - Mark task 3.2 as complete in tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:04:41 +01:00
Marko Djordjevic	577bb2e56e	feat: add API key auth middleware for /api/* routes (task 3.1) - Create src/middleware.ts with Next.js middleware - Reads API_KEY env var and checks X-API-Key header on all /api/* routes - Skips auth for /api/health endpoint - Fails open (with warning) when API_KEY is not configured - Returns 401 Unauthorized when key is missing or mismatched - Mark task 3.1 as complete in tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:02:51 +01:00
Marko Djordjevic	0cd21887e4	mark: task 2.5 complete (CORS configuration)	2026-02-18 11:02:10 +01:00
Marko Djordjevic	94bc5768d1	feat: add file type validation to upload endpoint - Validate filename ends with .csv (case-insensitive) - Validate MIME type is text/* or application/csv or text/csv - Return HTTP 400 with error message if validation fails - Mark task 2.4 as complete	2026-02-18 11:01:28 +01:00
Marko Djordjevic	0e239dc3da	security: add file size (10MB) and row count (500k) limits to upload route - Reject uploads larger than 10MB before reading file content - Reject CSVs with more than 500,000 data rows after parsing - Checks placed as early as possible in the handler flow - Mark task 2.3 as done in tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:01:02 +01:00
Marko Djordjevic	67dd7aa2f0	security: validate run_id format and add path containment check in ML service - Add `import re` to services/ml/app/main.py - In POST /model/load: validate run_id matches ^[a-zA-Z0-9_-]+$ before DB lookup; use Path.resolve() + directory containment check before loading model artifact - In DELETE /training/runs/{run_id}: validate run_id matches ^[a-zA-Z0-9_-]+$ before any processing; use Path.resolve() + directory containment check before deleting model artifact - Both endpoints return HTTP 400 with {"detail": "Invalid run_id format"} on invalid input - Mark task 2.2 as completed in openspec/changes/code-review-fix/tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 11:00:19 +01:00
Marko Djordjevic	870f92d208	feat: add run_id format validation in DELETE training/runs endpoint Validate that run_id matches /^[a-zA-Z0-9_-]+$ regex before interpolating into the API URL. Returns HTTP 400 with 'Invalid run_id format' error if validation fails. This prevents potential URL injection attacks and invalid identifier usage.	2026-02-18 10:58:54 +01:00
Marko Djordjevic	4e5ce321b9	chore: bind ML service port to 127.0.0.1:8001:8001 for localhost-only access - Changed ML service port binding from 8001:8001 to 127.0.0.1:8001:8001 in docker-compose.yml - Marks task 1.8 as complete in tasks.md	2026-02-18 10:58:31 +01:00
Marko Djordjevic	c327ba3370	bind: MLflow port to 127.0.0.1:5000:5000 in docker-compose.yml Changes: - Updated docker-compose.yml MLflow service port binding from 5000:5000 to 127.0.0.1:5000:5000 to restrict access to localhost only for security - Marked task 1.7 as complete in tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 10:58:11 +01:00
Marko Djordjevic	9efa1dbbcc	fix: Bind PostgreSQL port to 127.0.0.1:5432:5432 for localhost-only access - Changed PostgreSQL service port binding from 5432:5432 to 127.0.0.1:5432:5432 in docker-compose.yml - This restricts PostgreSQL to listen only on localhost, improving security by preventing access from other interfaces - Marked task 1.6 as completed Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 10:57:55 +01:00
Marko Djordjevic	e3469ec39f	fix: replace hardcoded DB credentials with env var interpolation in docker-compose.yml All DATABASE_URL values and postgres service env vars now use \${POSTGRES_USER}, \${POSTGRES_PASSWORD}, \${POSTGRES_DB} interpolation instead of hardcoded ml_user/ml_password/candle_annotator values. Also updated pg_isready healthcheck to use the same env vars. Closes task 1.5. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 10:57:31 +01:00
Marko Djordjevic	9bc82b822c	security: remove credential SQL comments and add DATABASE_URL fail-fast check - Remove hardcoded SQL comments containing 'ml_user' and 'ml_password' - Remove fallback default credentials in DATABASE_URL construction - Add fail-fast validation: raise RuntimeError if DATABASE_URL env var is missing or empty - Mark task 1.4 as complete in code-review-fix/tasks.md	2026-02-18 10:56:49 +01:00
Marko Djordjevic	55ee9c936a	fix: replace real credentials in .env.example with placeholders - Replace ml_password with change_me_to_a_strong_password placeholder - Replace ml_user with your_db_user placeholder - Mark task 1.3 as completed in tasks.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 10:56:23 +01:00
Marko Djordjevic	099f334fe9	chore: mark task 1.2 as completed	2026-02-18 10:55:56 +01:00
Marko Djordjevic	4ba1327a53	task 1.1: add .env to .gitignore and untrack from git	2026-02-18 10:55:24 +01:00
Marko Djordjevic	c0f5654450	sync: ml-ui-connection delta specs to main specs	2026-02-18 10:21:05 +01:00
Marko Djordjevic	12a9603fce	feat: add TalibPatternPanel, TrainingPanel, ModelSelector UI components (tasks 5-8) - TalibPatternPanel: pattern checkboxes, detect button, results summary, clear-all and per-pattern delete - TrainingPanel: model type selector, dataset info, start training, polling, run history - ModelSelector: dropdown of completed runs, wired into PredictionPanel for model switching - page.tsx: integrate all three panels into sidebar, wire callbacks (model load, annotations refresh) - tasks.md: mark all 39 tasks complete	2026-02-17 18:55:52 +01:00
Marko Djordjevic	2a02669222	feat: add FastAPI model/load endpoint and all Next.js proxy routes (tasks 2-4)	2026-02-17 18:47:04 +01:00
Marko Djordjevic	b8e649e333	feat: add FastAPI pattern detection endpoints (Section 1) - Extract CDL pattern detection logic into services/ml/app/patterns.py with TALIB_PATTERNS dict, get_available_patterns(), validate_pattern_names(), and detect_patterns(candles, pattern_names) functions - Add GET /patterns/available endpoint returning all 54 supported CDL pattern names with display names - Add POST /patterns/detect endpoint accepting {candles, patterns}, running selected CDL functions, returning span annotations with source "talib" - Add input validation: reject invalid pattern names with HTTP 400, treat empty patterns list as "run all" Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-17 18:34:14 +01:00
Marko Djordjevic	38df874255	chore: archive ml-db-consolidation change and sync specs - Archived change to openspec/changes/archive/2026-02-17-ml-db-consolidation/ - Created new postgres-data-layer spec with PostgreSQL connection, schema definitions, Drizzle migrations, npm deps, and SQLite migration requirements - Updated docker-deployment spec: Docker Compose now PostgreSQL-based (postgres dependency, ml-data volume, DATABASE_URL); env vars updated (DATABASE_URL added, DATABASE_PATH removed); database persistence updated to PostgreSQL volumes; health check updated to PostgreSQL - Updated ml-training spec: added database name scenario (candle_annotator) and new direct annotation data access requirement Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-17 18:22:28 +01:00
Marko Djordjevic	0e8dcc6707	chore: archive line-rectangle-annotations change and sync specs - Archived change to openspec/changes/archive/2026-02-17-line-rectangle-annotations/ - Updated annotation-tools spec: added rectangle tool mode, TrendLine plugin rendering, line hit testing, line selection handles; updated line drawing and delete requirements; removed SVG overlay rendering - Created new rectangle-annotation spec with full requirements for rectangle drawing, rendering, hit testing, selection, deletion, and database storage Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-17 18:16:49 +01:00
Marko Djordjevic	d1557a3846	fix: resolve numpy type conversion issues in ML service data access - Convert numpy.int64 to Python int before passing to SQLAlchemy queries - Prevents psycopg2.ProgrammingError: can't adapt type 'numpy.int64' - Applied to get_candles(), get_span_annotations(), and get_point_annotations() - All ML service database access tests now passing successfully	2026-02-17 14:10:21 +01:00
Marko Djordjevic	5377431c9d	test: verify Next.js application works with PostgreSQL - Updated .env to use DATABASE_URL instead of DATABASE_PATH - Tested all API endpoints: health, charts, candles, span annotations - Confirmed JSONB fields work correctly (geometry, sub_spans, model_prediction) - All 2,836 rows accessible via API - Database connection pooling working correctly	2026-02-17 14:02:22 +01:00
Marko Djordjevic	bfe437857b	feat: add Python migration script and successfully test SQLite to PostgreSQL data migration - Created scripts/migrate-sqlite-to-postgres.py as alternative to TypeScript version - Handles all type conversions: timestamps, booleans, and JSONB fields - Successfully migrated all 2,836 rows from SQLite to PostgreSQL - Verified data integrity: all 6 tables migrated correctly - Charts: 1, Candles: 2,592, Annotations: 4, Span annotations: 223	2026-02-17 14:01:21 +01:00
Marko Djordjevic	5f70f13da3	feat: migrate from SQLite to PostgreSQL - complete schema and API updates - Remove better-sqlite3, add pg driver - Convert schema to PostgreSQL types (serial, timestamp, boolean, jsonb) - Generate fresh PostgreSQL migrations - Update database connection layer with pg.Pool - Fix all API routes: remove JSON.parse/stringify, use native timestamps and booleans - Update drizzle.config.ts and .env.example for PostgreSQL	2026-02-17 13:43:06 +01:00
Marko Djordjevic	2cde02b722	docs: mark all line-rectangle-annotations tasks as complete	2026-02-16 12:14:22 +01:00
Marko Djordjevic	73e07c9050	feat: complete SVG overlay removal and line endpoint dragging (tasks 6.1-7.3)	2026-02-16 12:13:29 +01:00
Marko Djordjevic	aea1791122	feat: complete rectangle annotation tool (tasks 4.1-5.2) - Add rectangle primitive management in CandleChart - Handle chart switching with proper primitive cleanup - Implement rectangle selection via hitTest - Add rectangle deletion in delete tool - Add rectangle tool button to Toolbox - Wire rectangle tool with toggle behavior	2026-02-16 11:58:49 +01:00
Marko Djordjevic	82fd5ce819	feat: wire up drawing interaction for line and rectangle tools (tasks 3.1-3.5) - Add drawing state and preview primitive refs - Implement two-click drawing flow via subscribeClick - Add crosshair move handler to update preview in real-time - Add Escape key handler to cancel drawing - Manage TrendLine primitives for saved line annotations	2026-02-16 11:55:06 +01:00
Marko Djordjevic	bec0aeb6ca	feat: enhance TrendLine plugin and create RectangleDrawingPrimitive - Add hitTest, setSelected, attached/detached lifecycle methods to TrendLine - Add preview mode support with dashed lines and reduced opacity - Draw selection handles on endpoints when selected - Create RectangleDrawingPrimitive plugin with full ISeriesPrimitive implementation - Support preview mode, selection, hit testing, and autoscaling for rectangles - Set z-order to bottom for rectangles to render behind candlesticks Tasks completed: 1.1-1.4, 2.1-2.7	2026-02-16 11:51:07 +01:00
Marko Djordjevic	28e3f83cf7	archive: candle-backend change complete	2026-02-16 11:44:53 +01:00
Marko Djordjevic	7e0579f65d	sync: migrate delta specs to main openspec/specs - Added 5 new capabilities: feature-engineering, annotation-ingestion, ml-training, ml-inference, prediction-ui - Updated 2 existing capabilities: backend-api, span-annotation - All specs synced from openspec/changes/candle-backend/specs/	2026-02-16 11:44:34 +01:00
Marko Djordjevic	65f00e6ce7	feat: complete prediction UI feedback tasks (11.2, 11.4, 11.5) - Implement disagreement visual highlighting with distinct colors - Yellow highlight for 'missed_by_human' predictions - Orange for 'label_mismatch' disagreements - Warning icon on disagreement markers - Add click-to-convert prediction feedback - Click disagreement predictions to create span annotations - Auto-fill with predicted label and times - Set source as 'model_confirmed' or 'model_corrected' - Add dismiss action for false positive predictions - Alt+Click or Ctrl+Click to dismiss predictions - Saves negative annotation with label 'O' - Records original prediction in model_prediction field - Filter predictions when 'Show only disagreements' is enabled	2026-02-16 11:40:55 +01:00
Marko Djordjevic	21f184aa8d	feat(ui): implement disagreement detection, prediction summary, loading states, and update documentation - Add disagreement detection logic comparing human annotations vs predictions - Display prediction summary in PredictionPanel (agreements/disagreements) - Wire up 'Show only disagreements' filter toggle - Add loading overlay during prediction fetching - Update docker-compose.yml with healthchecks for all services - Update DEPLOYMENT.md with comprehensive ML service setup instructions - Update README.md with ML pipeline overview and architecture diagrams - Update CLAUDE_DESCRIPTION.md with v3.0.0 ML integration details Remaining tasks (11.2, 11.4, 11.5) deferred - core functionality complete	2026-02-15 16:34:02 +01:00
Marko Djordjevic	952eb7413c	feat(ui): add prediction chart rendering with histogram overlay, markers, filtering, and visibility toggle - Add histogram series to CandleChart for per-bar prediction colors (15% opacity) - Add series markers showing label name and confidence % at prediction span starts - Implement confidence threshold filtering for both histogram and markers - Implement label type filtering from PredictionPanel checkboxes - Implement prediction layer visibility toggle (show/hide) - Add getVisibleCandles method to CandleChartHandle for on-demand prediction fetching - Pass prediction state props from page.tsx to CandleChart Tasks 10.1-10.5 complete.	2026-02-15 16:26:17 +01:00
Marko Djordjevic	28ebe2c5d1	feat(ui): add prediction state management and PredictionPanel component - Create prediction type definitions in src/types/predictions.ts - Add prediction state management to page.tsx with caching - Implement PredictionPanel component with: - Master visibility toggle - Model info display (name, version, type, metrics) - Action buttons (Run on Visible, Predict All) - Confidence threshold slider - Label filter checkboxes with per-class metrics - Disagreement filter toggle - Prediction summary display - Model server offline banner - Add on-demand and batch prediction fetching - Implement prediction caching by chart and model version - Add health polling for inference API (30s interval when offline) - Ensure annotation tools work independently of prediction API Tasks completed: 9.1-9.5, 12.1-12.3 (59/78 total)	2026-02-15 16:20:07 +01:00
Marko Djordjevic	bb1b6d573f	feat(api): add span annotation export and feedback loop support - Add GET /api/span-annotations/export endpoint for ML pipeline JSON/CSV export - Add source and model_prediction fields to span_annotations schema - Update POST endpoint to accept source (human/model/human_correction) and model_prediction metadata - Support negative annotations (label 'O' for user corrections to model predictions) - Create migration 0005 for new schema fields Completes tasks 8.1-8.4 of candle-backend change	2026-02-15 14:35:31 +01:00
Marko Djordjevic	205021e810	feat(api): add Next.js proxy routes for ML inference service	2026-02-15 14:30:09 +01:00
Marko Djordjevic	3a83fd38e9	feat(ml): implement FastAPI inference service with model loading, preprocessing, and prediction endpoints	2026-02-15 14:29:07 +01:00
Marko Djordjevic	f4c0f9a836	feat(ml): implement training stage with MLflow tracking and model wrappers - Create RandomForestModel and XGBoostModel wrappers with class weight support - Implement temporal and random train/val/test splitting - Add MLflow experiment tracking with full parameter and metric logging - Create evaluation module for confusion matrix, feature importance, and classification reports - Implement model training with sklearn/xgboost flavor logging and optional registry registration - Store training run metadata in PostgreSQL - Wire training stage into pipeline.py orchestrator - Support both RandomForest and XGBoost models with configurable hyperparameters	2026-02-15 14:22:19 +01:00
Marko Djordjevic	16763b967e	feat(ml): implement annotation ingestion with windowed/BIO encoding and TA-Lib patterns	2026-02-15 12:28:58 +01:00
Marko Djordjevic	fd29ab91e0	feat(ml): implement feature engineering pipeline - Create pipeline.py with CLI argument parsing for running stages - Implement TA-Lib indicator computation with multi-output support - Add candle feature extraction (body_size, wicks, ratios, etc.) - Create custom feature loader with dynamic module import - Wire all feature engineering stages with NaN handling - Tasks completed: 2.2, 2.3, 3.1, 3.2, 3.3, 3.4, 3.5	2026-02-15 12:22:59 +01:00
Marko Djordjevic	ea339a54a7	feat(ml): add database schema, config parser, and DVC setup - Initialize DVC with local storage backend (task 1.6) - Create PostgreSQL schema for training_runs table (task 1.7) - Add SQLAlchemy database connection setup (task 1.8) - Create Pydantic config models for pipeline.yaml (task 2.1) - Add migration runner for database setup - Fix pyproject.toml package discovery config	2026-02-15 12:08:53 +01:00
Marko Djordjevic	1a653c5866	feat: add ML service scaffolding with Python FastAPI, Docker, and MLflow setup	2026-02-15 11:58:31 +01:00

1 2 3 4

185 commits