Tatsunori Hashimoto recommended on on 1/23/2024This is a pretty interesting multi-domain perplexity eval. Especially the multi-domain evals seem like a useful resource for thinking about 'are there tradeoffs between different domain perplexities', and 'whats the right perplexity mix to optimize for?'