The SherlockBench Problem-sets
SherlockBench problem-sets are just files written in Clojure which can be loaded by the API server. Each has it's own repo in GitHub.
Irene1 Problem-Set
This is our newest problem-set. It is very small to make it easy to run and is designed to test fluid intelligence.
GitHub Link: sherlockbench-problems-irene1
Mycroft1 Problem-Set
This contains a lot of problems which smart models sometimes get right.
GitHub Link: sherlockbench-problems-mycroft1
Sherlock1 Problem-Set
This is the problem-set we use on the Leaderboard. It has a mixture of easy and hard problems.
GitHub Link: sherlockbench-problems-sherlock1
Moriarty1 Problem-Set
This problem-set is designed for testing frontier reasoning models which get very high scores on Sherlock1.
GitHub Link: sherlockbench-problems-moriarty1
InterroBench Problem-Set
This is a port of the version 6 problem-set from the old InterroBench benchmark.
GitHub Link: sherlockbench-problems-interrobench6