← Data catalog
Self-hosted — Download

Old Faithful Eruptions

Eruption durations and waiting times for the Old Faithful geyser — the classic two-cluster scatter plot.

Each row is one eruption of Old Faithful in Yellowstone National Park: duration of the eruption and the wait until the next one.

Small, famous, and perfect: the scatter plot has two visible clusters, the correlation is strong, and prediction via regression actually works. This is the dataset where prediction “clicks” for most students.

Used in

Course style guide rule: cite the dataset name, provider, and the date you accessed it — e.g. "Old Faithful Eruptions. Course data catalog, courses.langd0n.com. Accessed [date]." If the entry names an upstream source (a city portal, a public agency), cite that too.