Current study materials

[2510.20817] KL-Regularized Reinforcement Learning is Designed to Mode Collapse

I Figured Out How to Engineer Emergence - by Erik Hoel

A Retrospective on Active Inference

Variational inference - Princeton cos597C 2011

Collective Intelligence with LLMs - by CIP

Datasets

The Actuary's Final Word - by Ben Recht - arg min

Severity: Strong vs Weak | Error Statistics Philosophy

Guillotine: Hypervisors for Isolating Malicious AIs - guillotine.pdf

Stephen Shenker: Chaos, Black Holes, and Quantum Mechanics - YouTube

Building and evaluating alignment auditing agents

[2502.01492] Develop AI Agents for System Engineering in Factorio

Common Elements of Frontier AI Safety Policies

[2507.20964] Core Safety Values for Provably Corrigible Agents

Introduction to deep learning with applications to stochastic control and games - YouTube

Learning the natural history of human disease with generative transformers | Nature

How to Make the Future Better: Concrete Actions for Flourishing

Claude Code: Behind-the-scenes of the master agent loop

Deriving Muon

  • core numerical methods derived from an exact theoretical principle
  • contrast with popular optimizers like Adam, which have more heuristic origins

David Ha’s early work: ōtoro.net

Molnar: From Frequencies to Coverage: Rethinking What “Representative” Means

Molnar: Don’t fix your imbalanced data

GPT-oss from the Ground Up - by Cameron R. Wolfe, Ph.D.

Gemma 3 270M: Can Tiny Models Learn New Tasks?

Neuronpedia

Building CERN for AI - An institutional blueprint - Centre for Future Generations

Process knowledge is crucial to economic development

The Artificiality of Alignment - by jessica dai - Reboot

[2503.05336v3] Toward an Evaluation Science for Generative AI Systems

Data Provenance Initiative

The Big LLM Architecture Comparison

FIIR

On the criteria to be used in decomposing systems into modules - 361598.361623.pdf

SciCode - SciCode Benchmark

Aurora GPT - Argonne National Lab

Evaluation Framework for AI Systems in "the Wild" | alphaXiv

Raft implemented in Go, Eli Bendersky

AI and the Everything in the Whole Wide World Benchmark

Tracing the thoughts of a large language model \ Anthropic

[2401.17173] Zero-Shot Reinforcement Learning via Function Encoders

Severe deviation in protein fold prediction by advanced AI: a case study | Scientific Reports

MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis - 2025.03.27.645630v1.full.pdf

[2502.01706] Comply: Learning Sentences with Complex Weights inspired by Fruit Fly Olfaction

[2503.20511] From reductionism to realism: Holistic mathematical modelling for complex biological systems

Learning with not Enough Data Part 1: Semi-Supervised Learning | Lil'Log


Demystifying Chains, Trees, and Graphs of Thoughts

Sequential decision making - Kevin Murphy, DeepMind

Strategic Foundation Models - Large_Language_Models__Foundation_Models_and_Game_Theory___Research_Manifesto (16).pdf

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

International AI safety report - International_AI_Safety_Report_2025_accessible_f.pdf

7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient

A Recipe for Training Neural Networks

[2309.16177] Navigating the Noise: Bringing Clarity to ML Parameterization Design with O(100) Ensembles


Evergreen re-reads

The Feynman Lectures on Physics

[2502.15657] Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?

Physics of Language Models - Allen Zhu

Lecture Videos | Introduction to Algorithms | Electrical Engineering and Computer Science | MIT OpenCourseWare

Bertsekas - RL courses and book - mit.edu/~dimitrib/RLbook.html

Stanford CS336 | Language Modeling from Scratch

Marin

Introduction | RLHF Book by Nathan Lambert

A Little Bit of Reinforcement Learning from Human Feedback

Causal Artificial Intelligence Book

[1807.02811] A Tutorial on Bayesian Optimization

CSES - CSES Problem Set - Tasks

Statistical Significance, p-Values, and the Reporting of Uncertainty - imbens-2021-statistical-significance-p-values-and-the-reporting-of-uncertainty.pdf

There is only one model - by Jack Morris - Token for Token

Jared Kaplan: ContemporaryMLforPhysicists (pdf)

  • Starting on page 55, the Architectures section covers the structure and componentes of deep neural networks from a (mathematical and statistical) modeling perspective.

Tips for Empirical Alignment Research — AI Alignment Forum