Generative AI
Large Language Models (LLMs)
Models
Proprietary
Open-weights
Evaluation and Benchmarks
LLM benchmark results:
Other overviews of LLM benchmarks:
Summarization:
Question answering / explaining concepts / trivia questions:
- Explaining concepts / question answering: https://huggingface.co/datasets/Hello-SimpleAI/HC3
- TriviQA
Multiple choice: