Generative AI

Table of contents

Large Language Models (LLMs)

Large Language Models (LLMs)

Models

| Model | Layers | Total Params | Active Params Per Token | Total Experts | Active Experts Per Token | Context Length | | | ------------ | ---------- | ---------------- | --------------------------- | ----------------- | ---------------------------- | ------------------ | --- | | gpt-oss-120b | 36 | 117B | 5.1B | 128 | 4 | 128k | | | gpt-oss-20b | 24 | 21B | 3.6B | 32 | 4 | 128k | |

Proprietary

Open-weights

Prompting

asgeirtj/system_prompt_leaks. Leaked system prompts.

Evaluation and Benchmarks

LLM benchmark results:

AI benchmarking hub (Epoch AI)

Other overviews of LLM benchmarks:

Summarization:

Question answering / explaining concepts / trivia questions:

Explaining concepts / question answering: https://huggingface.co/datasets/Hello-SimpleAI/HC3
TriviQA

Multiple choice:

sciq (HuggingFace
- Multiple choice questions on science
Measuring Massive Multitask Language Understanding (MMLU)
- Multiple choice questions across 57 different subjects, ranging from STEM to social sciences