Suite coverage
ELO ladder evaluation against classic engines, plus tactical puzzle sets and positional endgame studies.
Benchmark
Strategic planning benchmarks that evaluate search depth, memory, and move-by-move compositional reasoning.
ELO ladder evaluation against classic engines, plus tactical puzzle sets and positional endgame studies.
Sharpening endgame conversion rates while keeping explainability accessible to human analysts.