Current study materials

(pinned section)

Frank, M. C., Braginsky, M., Cachia, J., Coles, N. A., Hardwicke, T. E., Hawkins, R. D., Mathur, M. B., & Williams, R. 2025. Experimentology: An Open Science Approach to Experimental Psychology Methods. Stanford University. https://doi.org/10.25936/3JP6-5M50. (Also published by MIT Press, ISBN 978-0-262-55256-1)

From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence - 2601.03220v1.pdf

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization - 2601.05242v1.pdf

Categorization is baked into the brain, Barrett & Miller, 2026

Reasoning Models Generate Societies of Thought, Kim et al., Google (2026)

Embedded Universal Predictive Intelligence: a coherent framework for multi-agent learning - 2511.22226v1.pdf

the Google team’s github: paradigms-of-intelligence repositories

Improving the Effectiveness of Potential-Based Reward Shaping in Reinforcement Learning

[1802.06070] Diversity is All You Need: Learning Skills without a Reward Function

Fundamentals - build/cement understanding

Better gradient descent:

AI agents: collective intelligence, evolutionary frameworks, biology, physics, complex systems / emergent phenomena

Renormalization and effective lagrangians - ScienceDirect

Undecidability of the spectral gap | Nature

BICAN: A cell census of the developing human brain

Patterns, Predictions, and Actions (2025) - by Ben Recht

Active Inference: The Free Energy Principle in Mind, Brain, and Behavior | Books Gateway | MIT Press

Active Inference - book_9780262369978.pdf
concepts:
- predictive processing: brains act as approximately Bayesian prediction machines
- active inference: tools of state evaluation (sensors + processing) are the same tools that enable state change (agency / action)
naive interpretation:
- predictive loss depends on a real-world objective (reality)
- active inference depends on model-internal objective (agent goals)

[2510.20817] KL-Regularized Reinforcement Learning is Designed to Mode Collapse

I Figured Out How to Engineer Emergence - by Erik Hoel

[2510.02649] Engineering Emergence

A Retrospective on Active Inference

Variational inference - Princeton cos597C 2011

[2108.13083] An Introduction to Variational Inference

Hot take: The end of mech interp as we know it? A Pragmatic Vision for Interpretability — AI Alignment Forum

Collective Intelligence with LLMs - by CIP

The limits of falsifiability: Dimensionality, measurement thresholds, and the sub-Landauer domain in biological systems

[2502.01706] Comply: Learning Sentences with Complex Weights inspired by Fruit Fly Olfaction

Rethinking Generative AI and LLMs with Fruit Flies & Complex Math [Breakdowns]

[2503.20511] From reductionism to realism: Holistic mathematical modelling for complex biological systems

AI - mix

[2510.21890] The Principles of Diffusion Models

[2510.25781] A Practitioner's Guide to Kolmogorov-Arnold Networks

Datasets

CS281A 101425 - datasets - CS281A_101425_datasets.pdf

The Actuary's Final Word - by Ben Recht - arg min

The Actuary’s Final Word on Algorithmic Decision Making

Severity: Strong vs Weak | Error Statistics Philosophy

Stephen Shenker: Chaos, Black Holes, and Quantum Mechanics - YouTube

Building and evaluating alignment auditing agents

[2502.01492] Develop AI Agents for System Engineering in Factorio

Common Elements of Frontier AI Safety Policies

[2507.20964] Core Safety Values for Provably Corrigible Agents

Introduction to deep learning with applications to stochastic control and games - YouTube

Learning the natural history of human disease with generative transformers | Nature

Dawn of A New Era of Primary Prevention in Medicine

Claude Code: Behind-the-scenes of the master agent loop

Deriving Muon

core numerical methods derived from an exact theoretical principle
contrast with popular optimizers like Adam, which have more heuristic origins

David Ha’s early work: ōtoro.net

Molnar: From Frequencies to Coverage: Rethinking What “Representative” Means

Molnar: Don’t fix your imbalanced data

[lcamtuf: Minkowski dimension](How many dimensions is this? - lcamtuf’s thing)

GPT-oss from the Ground Up - by Cameron R. Wolfe, Ph.D.

Gemma 3 270M: Can Tiny Models Learn New Tasks?

Neuronpedia

Building CERN for AI - An institutional blueprint - Centre for Future Generations

Process knowledge is crucial to economic development

Data Provenance Initiative

github

The Big LLM Architecture Comparison

FIIR

On the criteria to be used in decomposing systems into modules - 361598.361623.pdf

SciCode - SciCode Benchmark

Aurora GPT - Argonne National Lab

[2401.17173] Zero-Shot Reinforcement Learning via Function Encoders

Severe deviation in protein fold prediction by advanced AI: a case study | Scientific Reports

MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis - 2025.03.27.645630v1.full.pdf

Learning with not Enough Data Part 1: Semi-Supervised Learning | Lil'Log

Demystifying Chains, Trees, and Graphs of Thoughts

Sequential decision making - Kevin Murphy, DeepMind

Strategic Foundation Models - Large_Language_Models__Foundation_Models_and_Game_Theory___Research_Manifesto (16).pdf

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient

A Recipe for Training Neural Networks

[2309.16177] Navigating the Noise: Bringing Clarity to ML Parameterization Design with O(100) Ensembles

AI benchmarks and evals

Reading today's open-closed performance gap, Nathan Lambert

OII | Study identifies weaknesses in how AI systems are evaluated

AI and the Everything in the Whole Wide World Benchmark

Evaluation Framework for AI Systems in "the Wild" | alphaXiv

splattered with iffy claims but also some directionally-interesting commentary: AI is Hitting a Measurement Wall - by Devansh

AI safety & alignment

International AI safety report - International_AI_Safety_Report_2025_accessible_f.pdf

How to Make the Future Better: Concrete Actions for Flourishing

[2503.05336v3] Toward an Evaluation Science for Generative AI Systems

Guillotine: Hypervisors for Isolating Malicious AIs - guillotine.pdf

The Artificiality of Alignment - by jessica dai - Reboot

Mech interp

Tracing the thoughts of a large language model \ Anthropic

Sparsify: A mechanistic interpretability research agenda — AI Alignment Forum

Evergreen re-reads

The Feynman Lectures on Physics

[2502.15657] Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?

Superintelligent Agents Pose Catastrophic Risks — Yoshua Bengio lecture - YouTube

Physics of Language Models - Allen Zhu

ICML 2024 Tutorial: Physics of Language Models - YouTube

Lecture Videos | Introduction to Algorithms | Electrical Engineering and Computer Science | MIT OpenCourseWare

Bertsekas - RL courses and book - mit.edu/~dimitrib/RLbook.html

Stanford CS336 | Language Modeling from Scratch

lecture videos (2025) - youtube

Marin

Introduction | RLHF Book by Nathan Lambert

A Little Bit of Reinforcement Learning from Human Feedback

Causal Artificial Intelligence Book

[1807.02811] A Tutorial on Bayesian Optimization

CSES - CSES Problem Set - Tasks

Statistical Significance, p-Values, and the Reporting of Uncertainty - imbens-2021-statistical-significance-p-values-and-the-reporting-of-uncertainty.pdf

There is only one model - by Jack Morris - Token for Token

Ilya’s talk on compression as intelligence

Jared Kaplan: ContemporaryMLforPhysicists (pdf)

Starting on page 55, the Architectures section covers the structure and componentes of deep neural networks from a (mathematical and statistical) modeling perspective.

Tips for Empirical Alignment Research — AI Alignment Forum

Compute

C is not a low-level language, David Chisnall

Raft implemented in Go, Eli Bendersky