token-monitor

4 commits 2 branches 0 tags 145 KiB

Author	SHA1	Message	Date
Hannibal Smith	1b4e299461	build: add gemini and xai provider modules Expands token-monitor with two new provider types: - providers/gemini.js — Google Gemini API (body-based quota, no headers) - Probes generateContent endpoint (1 token), falls back gemini-2.0-flash → gemini-2.5-flash - Parses QuotaFailure violations + RetryInfo from 429 JSON body - Returns: status, quota_violations[], retry_delay_seconds, severity - providers/xai.js — x.ai/Grok (OpenAI-compatible header schema) - Reads x-ratelimit-{limit,remaining}-{requests,tokens} headers - Handles: no_key, ok, rate_limited, invalid_key states - Warning threshold: < 10% remaining on requests or tokens Both providers handle missing API keys gracefully (status: no_key). Classification via providers/index.js using baseUrl patterns. 140/140 tests passing. Closes recon findings from trentuna/a-team#91.	2026-04-04 17:52:37 +00:00
B.A. Baracus	988618e165	test: add gemini and xai parser unit tests	2026-04-04 17:51:38 +00:00
Hannibal Smith	07a544c50d	build: token-monitor v0.1.0 — modular LLM API quota visibility Implements modular provider probing with two distinct header schemas: - Teams direct (unified schema): 5h/7d utilization floats, status, reset countdown - Shelley proxy (classic schema): token/request counts + Exedev-Gateway-Cost (USD/call) - api-ateam: reports no billing data (confirmed non-existent by recon) Key: uses claude-haiku-4-5-20251001 for minimal probe calls (1 token). Rate-limit headers present on ALL responses (200 and 429). 113/113 tests passing. Built from Face recon (trentuna/a-team#91) — live header capture confirmed unified schema with utilization floats replaces old per-count schema.	2026-04-04 17:01:05 +00:00
Vigilio Desto	760049a25e	Initial commit	2026-04-04 16:35:33 +00:00