Tidy Data

Tidy datasets (, ) have the following structure:

A dataset is a collection of values (numbers, strings etc). Every value belongs to a variables and an observation.

Every variables contains all values that measure the same underlying attribute (e.g. some metric, score, temperature).

An observation contains all values measured on the same unit (e.g. person, day etc) across all attributes.

Some common patterns of messy (non-tidy) datasets:

Emacs 29.4 (Org mode 9.6.15)