Changelog

All notable changes to Agent Probe.

2025-03-15

Guided Tour, User Management & Chat Filters

v0.4.0
  • Added interactive guided tour for new users
  • User management modal for admin users (approve / reject signups)
  • Chat panel filter improvements — search and filter test results
  • Improved dashboard responsive layout
2025-03-10

Profile Menu, API Keys & Webhook Improvements

v0.3.0
  • Profile menu with settings, API key management, and password change
  • API key generation (ap-xxxx... format) for programmatic access
  • Signup approval workflow — new users require admin approval
  • Webhook panel improvements with cron scheduling
  • JWT-based authentication system
2025-02-25

Test Comparison & History

v0.2.0
  • Test history panel — view and reload past test runs
  • Side-by-side test comparison (select 2 tests to compare)
  • Evaluator-level diff analysis between test runs
  • Improved real-time streaming of test results
  • Configuration strip showing active test parameters
  • Turkish language support for dashboard UI
2025-02-10

Initial Release

v0.1.0
  • 6-layer test pyramid with 16 evaluators
  • Golden Datasets in English and Turkish (MMLU, TruthfulQA, BBQ, ToxiGen, JailbreakBench, PAWS, AdvGLUE, NeedleBench)
  • Split-screen dashboard: chat + real-time evaluation
  • 300+ LLM models via OpenRouter integration
  • Manual and automated test modes
  • Docker deployment with Google Cloud Run support
  • JSON and HTML report generation