How good is Crowkis agent memory, really? The LoCoMo and LongMemEval numbers
We ran Crowkis memory against two public, hostile retrieval benchmarks — SNAP's LoCoMo and LongMemEval — on a laptop with no cloud calls. Here are the recall numbers, by question type, with the reranker on and off.
Read it →