fix(ml): complete ML pipeline fixes and setup

- Fix CCI indicator to use HLC prices instead of close only
- Parse datetime column when loading enriched CSV
- Strip timezone from annotation timestamps
- Fix TA-Lib pattern names (CDL3WHITESOLDIERS, CDL3BLACKCROWS)
- Exclude programmatic label columns from training features
- Fix classification report to handle missing classes
- Update MLflow tracking to use localhost:5000
- Grant PostgreSQL permissions to ml_user

Pipeline now runs successfully end-to-end:
- Feature engineering: 2543 rows, 31 columns
- Annotation ingestion: 286 samples
- Training: 89.47% test accuracy with Random Forest
This commit is contained in:
Marko Djordjevic 2026-02-15 21:29:54 +01:00
parent ceb4103ec4
commit aa81d4f3d0
348 changed files with 1327 additions and 11 deletions

View file

@ -0,0 +1,14 @@
artifact_uri: file:///home/homoludens/projekti/bitcon/candle_annotator/services/ml/mlruns/358560345319124639/96fe834f6a2d4193ba3aebdcb94b4b7c/artifacts
end_time: 1771187220382
entry_point_name: ''
experiment_id: '358560345319124639'
lifecycle_stage: active
run_id: 96fe834f6a2d4193ba3aebdcb94b4b7c
run_name: serious-carp-803
source_name: ''
source_type: 4
source_version: ''
start_time: 1771187220098
status: 4
tags: []
user_id: homoludens