- Fix CCI indicator to use HLC prices instead of close only - Parse datetime column when loading enriched CSV - Strip timezone from annotation timestamps - Fix TA-Lib pattern names (CDL3WHITESOLDIERS, CDL3BLACKCROWS) - Exclude programmatic label columns from training features - Fix classification report to handle missing classes - Update MLflow tracking to use localhost:5000 - Grant PostgreSQL permissions to ml_user Pipeline now runs successfully end-to-end: - Feature engineering: 2543 rows, 31 columns - Annotation ingestion: 286 samples - Training: 89.47% test accuracy with Random Forest
15 lines
240 B
YAML
15 lines
240 B
YAML
channels:
|
|
- conda-forge
|
|
dependencies:
|
|
- python=3.13.5
|
|
- pip
|
|
- pip:
|
|
- mlflow==3.9.0
|
|
- cloudpickle==3.1.2
|
|
- numpy==2.4.2
|
|
- pandas==2.3.3
|
|
- psutil==7.2.2
|
|
- pyarrow==22.0.0
|
|
- scikit-learn==1.8.0
|
|
- scipy==1.17.0
|
|
name: mlflow-env
|