Changelog
All notable changes to Agent Probe.
2025-03-15
Guided Tour, User Management & Chat Filters
v0.4.0- Added interactive guided tour for new users
- User management modal for admin users (approve / reject signups)
- Chat panel filter improvements — search and filter test results
- Improved dashboard responsive layout
2025-03-10
Profile Menu, API Keys & Webhook Improvements
v0.3.0- Profile menu with settings, API key management, and password change
- API key generation (
ap-xxxx...format) for programmatic access - Signup approval workflow — new users require admin approval
- Webhook panel improvements with cron scheduling
- JWT-based authentication system
2025-02-25
Test Comparison & History
v0.2.0- Test history panel — view and reload past test runs
- Side-by-side test comparison (select 2 tests to compare)
- Evaluator-level diff analysis between test runs
- Improved real-time streaming of test results
- Configuration strip showing active test parameters
- Turkish language support for dashboard UI
2025-02-10
Initial Release
v0.1.0- 6-layer test pyramid with 16 evaluators
- Golden Datasets in English and Turkish (MMLU, TruthfulQA, BBQ, ToxiGen, JailbreakBench, PAWS, AdvGLUE, NeedleBench)
- Split-screen dashboard: chat + real-time evaluation
- 300+ LLM models via OpenRouter integration
- Manual and automated test modes
- Docker deployment with Google Cloud Run support
- JSON and HTML report generation