Hindsight Benchmarks
Industry benchmarks and model leaderboard for Hindsight — the memory layer for AI agents.
Industry Benchmarks
Model Leaderboards
Leaderboard
Retain Leaderboard
Ranked LLMs for retain() and observation consolidation — fact extraction quality, speed, cost, and reliability.
14
Models
Top model
View leaderboard
Leaderboard
Reflect Leaderboard
Ranked LLMs for the reflect operation in Hindsight.
14
Models
Top model
View leaderboard
Leaderboard
Reranker Leaderboard
Ranked rerankers for recall() — which reranker surfaces the most relevant facts first.
10
Rerankers
Top model
View leaderboard
Leaderboard
Embeddings Leaderboard
Ranked embedding models for Hindsight — affects both retain() storage and recall() retrieval quality.
7
Models
Top model
View leaderboard