01 — Data Acquisition & Exploratory Analysis

Ingest aggregated natural gas consumption by postal code, regional HDD data, and MPAC property tax roll summaries. Filter to residential consumers and perform exploratory analysis on gas-temperature relationships.

1.1 — Environment Setup

1.2 — Generate Synthetic Gas Consumption Data

Simulate 150 postal codes × 24 months of residential natural gas consumption. Each postal code has a true thermal slope, baseload, and customer count.

1.3 — Assemble Gas Consumption DataFrame

1.4 — Generate MPAC Property Tax Roll Summary

Synthetic building stock characteristics per postal code: footprint, storeys, structure type, basement indicator.

1.5 — Exploratory Analysis: Gas vs. HDD

1.6 — Distribution of Customer Counts and Building Types

1.7 — Monthly HDD Pattern

1.8 — Save Intermediate Data


Next: Notebook 02 runs the HDD regression per postal code, normalizes by building stock, and produces the thermal intensity metric.