Data

Designing Data-Intensive Applications (DDIA) — an O’Reilly book by Martin Kleppmann (The Wild Boar Book)

CMU 15-445/645 :: Intro to Database Systems (Fall 2023; Andy Pavlo, Jignesh Patel)

Schemas

Schema.org - Schema.org

mlcommons/croissant: Croissant is a high-level format for machine learning datasets that brings together four rich layers.

Databases

Analyses

Database of Databases

PostgreSQL

What I Wish Someone Told Me About Postgres | ChallahScript

Privacy

Distributed Aggregation Protocol for Privacy Preserving Measurement

Divvi Up

Hestia Labs - Switzerland - the nice data co

Collections

Nomic Atlas

mfarre/fineVideo

Data, Lu Wang, CSE, University of Michigan

LittleSis - Profiling the powers that be

Art

The Metropolitan Museum of Art

Collection Online | Stedelijk Museum Amsterdam

Accueil | Musée Rodin

Enjoy the Museum from Home - Van Gogh Museum

About SMK Open | SMK – National Gallery of Denmark in Copenhagen (Statens Museum for Kunst)

AI

Data Vampires: Going Hyperscale (Episode 1) - Tech Won't Save Us | Podcast on Spotify

Data quality

Introducing the Data Measurements Tool: an Interactive Tool for Looking at Datasets

Know Your Data

Machine learning datasets

Hugging Face – The AI community building the future.

Machine Learning Datasets | Papers With Code

[2307.00682] Tools for Verifying Neural Models' Training Data

Data IRL

The Great Data Integration Schlep - by Sarah Constantin

Reflections on Palantir - Nabeel S. Qureshi

The “it” in AI models is the dataset. – Non_Interactive – Software & ML