Bluebikes Stations (May 2026)
Self-hostedEvery Bluebikes station — name, coordinates, docks, and municipality. The join partner for the trips data.
Data catalog
Every dataset my courses use, catalogued like a proper data portal: what it is, where it came from, what license it carries, and which course uses it. Self-hosted files download straight from this site's storage; external entries are badged so you know a click leaves the site.
Every Bluebikes station — name, coordinates, docks, and municipality. The join partner for the trips data.
A workable sample of Boston Bluebikes bike-share trips from September 2021 — start/end stations, timestamps, and rider type.
The live, continuously updated 311 dataset on Analyze Boston — for when the 2025 snapshot isn't enough.
A year of Bostonians asking the city for help — every 311 service request from 2025, with type, neighborhood, and resolution timestamps.
Energy and water use reported by Boston's large buildings under BERDO — sustainability data with policy teeth.
Long-run daily temperature observations — seasonality, smoothing, and long-term trends.
A synthetic stream of health-event records for the Epidemic Engine — the raw material for DS-551's ingestion and streaming pipelines.
Top 50 actors by total US box-office gross, with per-film averages and their biggest movie.
Birth weight, gestation, and maternal health for 1,174 mother-baby pairs from the Child Health and Development Studies — the classic causality-vs-correlation dataset.
Player name, team, position, and salary for the 2015–16 NBA season — histograms with a long right tail.
Eruption durations and waiting times for the Old Faithful geyser — the classic two-cluster scatter plot.
Housing violations, complaints, and inspections for Boston rental properties — civic data with real housing-justice questions in it.
Compensation for every San Francisco city employee in 2015 — job titles, salaries, overtime, and benefits.
Average SAT scores and participation rates by US state — the textbook example of a lurking variable.
Highest-grossing films with unadjusted and inflation-adjusted gross — a lesson about units hiding inside a fun dataset.
Departure delays for United flights out of SFO — thousands of rows for sampling and the law of averages.
Birth data for US presidents — a tiny table for early table operations and date arithmetic.
Name, city, height, and completion year for notable US skyscrapers — heights, eras, and city skylines in one table.
A 2026 snapshot of the world's billionaires — name, net worth, industry, and citizenship. Great for rankings, group-bys, and skeptical questions about wealth data.
Annual world population estimates — the simplest possible time series for first plots and growth rates.
No datasets match those filters — try widening one.
Missing something from class? The catalog is curated — if a file you need isn't here, ask on Piazza and it'll get added.