My benchmark for large language models:
A benchmark of ~100 tests for language models, collected from actual questions
I've asked of language models in the last year.
My Research Idea Logfile, 2016-2019:
A description of how I keep track of my research ideas,
with my complete log from when I started it in 2016 through to the end of 2019.