Metrics matrix creation logic from database #17

juaristi22 · 2025-07-29T12:16:28Z

Fix #20
Fix #13
Fix #14
Fix #18
Fix #1

This matrix creation will now integrate geography hierarchical information as long as the database contains ucgid_str as its variable and "in" as the constraint operation @baogorek

codecov · 2025-07-29T12:31:12Z

Codecov Report

❌ Patch coverage is 85.60606% with 19 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (main@37690e7). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
...engine_data/calibration/metrics_matrix_creation.py	85.49%	19 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main      #17   +/-   ##
=======================================
  Coverage        ?   42.85%           
=======================================
  Files           ?       10           
  Lines           ?      679           
  Branches        ?        0           
=======================================
  Hits            ?      291           
  Misses          ?      388           
  Partials        ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

baogorek

I won't call this a review yet, just some comments. The code feels very performant, which is great. I didn't fully go through all the logic. Some of my comments from the scaling PR will apply here too, so perhaps it would make sense to handle that one first and then return.

src/policyengine_data/calibration/metrics_matrix_creation.py

tests/test_matrix_creation.py

src/policyengine_data/calibration/metrics_matrix_creation.py

into maria/matrix_creation

…ng test)

docs/calibration.md

tests/test_normalise_keys.py

src/policyengine_data/calibration/dataset_duplication.py

src/policyengine_data/calibration/calibrate.py

baogorek

I'm really excited to be in this repo. Great stuff here, especially the functions for handling the database constraints.

As I'm trying to understand the code, I'm looking for one end-to-end example that incorporates the state stacking and uses it. I'm in the Jupyter notebook and perhaps getting close with:

sim = Microsimulation(dataset = national_level_calibrated_dataset)
np.unique(sim.calculate('ucgid_str').values, return_counts=True)

But I still see that California has more samples, and I thought that state stacking meant we'd sample the same amount from every state.

Might want to turn down the logger in the python notebook:

import logging

# Option 1: just lower that logger
logger = logging.getLogger("microcalibrate.calibration")
logger.setLevel(logging.ERROR)

Also, I think this was triggered by running the notebook code. Any idea what happened?

baogorek@T14-Gen6:~/devl/policyengine-data$ ls                                                                                                            
 Alabama_calibration.csv                        Iowa_calibration_sparse.csv             'North Dakota_calibration_sparse.csv'                             
 Alabama_calibration_sparse.csv                 Kansas_calibration.csv                   Ohio_calibration.csv                                             
 Alaska_calibration.csv                         Kansas_calibration_sparse.csv            Ohio_calibration_sparse.csv                                      
 Alaska_calibration_sparse.csv                  Kentucky_calibration.csv                 Oklahoma_calibration.csv                                         
 Arizona_calibration.csv                        Kentucky_calibration_sparse.csv          Oklahoma_calibration_sparse.csv                                  
 Arizona_calibration_sparse.csv                 LICENSE                                  Oregon_calibration.csv                                           
 Arkansas_calibration.csv                       Louisiana_calibration.csv                Oregon_calibration_sparse.csv                                    
 Arkansas_calibration_sparse.csv                Louisiana_calibration_sparse.csv         Pennsylvania_calibration.csv                                     
 California_calibration.csv                     Maine_calibration.csv                    Pennsylvania_calibration_sparse.csv                              
 California_calibration_sparse.csv              Maine_calibration_sparse.csv             pyproject.toml
 changelog_entry.yaml                           Makefile                                 README.md
 CHANGELOG.md                                   Maryland_calibration.csv                'Rhode Island_calibration.csv'
 changelog.yaml                                 Maryland_calibration_sparse.csv         'Rhode Island_calibration_sparse.csv'
 Colorado_calibration.csv                       Massachusetts_calibration.csv           'South Carolina_calibration.csv'
 Colorado_calibration_sparse.csv                Massachusetts_calibration_sparse.csv    'South Carolina_calibration_sparse.csv'
 Connecticut_calibration.csv                    Michigan_calibration.csv                'South Dakota_calibration.csv'
 Connecticut_calibration_sparse.csv             Michigan_calibration_sparse.csv         'South Dakota_calibration_sparse.csv'
 Dataset_stacked.h5                             Minnesota_calibration.csv                src
 Dataset_state_level.h5                         Minnesota_calibration_sparse.csv         Tennessee_calibration.csv
 Delaware_calibration.csv                       Mississippi_calibration.csv              Tennessee_calibration_sparse.csv
 Delaware_calibration_sparse.csv                Mississippi_calibration_sparse.csv       tests
'District of Columbia_calibration.csv'          Missouri_calibration.csv                 Texas_calibration.csv
'District of Columbia_calibration_sparse.csv'   Missouri_calibration_sparse.csv          Texas_calibration_sparse.csv
 docs                                           Montana_calibration.csv                 'United States_calibration.csv'
 download                                       Montana_calibration_sparse.csv           Untitled.ipynb
 Florida_calibration.csv                        Nebraska_calibration.csv                 Utah_calibration.csv
 Florida_calibration_sparse.csv                 Nebraska_calibration_sparse.csv          Utah_calibration_sparse.csv
 full_calibration.csv                           Nevada_calibration.csv                   uv.lock
 full_calibration_sparse.csv                    Nevada_calibration_sparse.csv            Vermont_calibration.csv
 Georgia_calibration.csv                       'New Hampshire_calibration.csv'           Vermont_calibration_sparse.csv
 Georgia_calibration_sparse.csv                'New Hampshire_calibration_sparse.csv'    Virginia_calibration.csv
 Hawaii_calibration.csv                        'New Jersey_calibration.csv'              Virginia_calibration_sparse.csv
 Hawaii_calibration_sparse.csv                 'New Jersey_calibration_sparse.csv'       Washington_calibration.csv
 Idaho_calibration.csv                         'New Mexico_calibration.csv'              Washington_calibration_sparse.csv
 Idaho_calibration_sparse.csv                  'New Mexico_calibration_sparse.csv'      'West Virginia_calibration.csv'
 Illinois_calibration.csv                      'New York_calibration.csv'               'West Virginia_calibration_sparse.csv'
 Illinois_calibration_sparse.csv               'New York_calibration_sparse.csv'         Wisconsin_calibration.csv
 Indiana_calibration.csv                       'North Carolina_calibration.csv'          Wisconsin_calibration_sparse.csv
 Indiana_calibration_sparse.csv                'North Carolina_calibration_sparse.csv'   Wyoming_calibration.csv
 Iowa_calibration.csv                          'North Dakota_calibration.csv'            Wyoming_calibration_sparse.csv

src/policyengine_data/calibration/dataset_duplication.py

tests/test_calibration/test_calibration.py

tests/test_calibration/test_dataset_duplication.py

baogorek · 2025-08-18T17:15:55Z

tests/test_calibration/test_dataset_duplication.py

+    ucgid_values = sim.calculate("ucgid").values
+    # The system returns enum names as strings, so compare with the name
+    assert all(val == california_ucgid.name for val in ucgid_values)
+


It just feels wierd to calculate ucgid and have the value be "CA" onscreen.

In [77]: sim.calculate('ucgid') Out[77]: value weight 0 CA 4709.080566 1 CA 4709.080566 2 CA 4709.080566

Agreed! I thought making ucgid an Enum was a great idea but maybe it simply introduces unnecessary complexity... maybe the alternative geography identifying variables should be simple strings instead of Enums?

Let's run this by Max tomorrow. I know he wanted us to move away from in and I imagine just a string will suffice. However, I'm currently using the Enum to build the initial hierarchy for the strata (which I thought was a nice source of truth), so I'll have to get it somewhere else if it's no longer an Enum.

juaristi22 · 2025-08-18T19:02:05Z

Any idea what happened?

Yep, microcalibrate's Calibration function has a parameter called csv_path which, when is not None, saves the calibration log to it. If regularization with L0 is enabled, it will save and additional _sparse.csv file. Because we iterate through every state, with 51, will save 102 csvs. I was doing this to explore how each state was calibrating with the dashboard, but I get it can be a lot. I disabled it by default so your directory doesnt clutter.

baogorek

The Jupyter notebook is working really nicely now. There are still a few outstanding issues that we need to figure out, but I see no reason to block the merge. Nice work!

tests/test_calibration/test_calibration.py

baogorek · 2025-08-20T03:25:10Z

tests/test_calibration/test_dataset_duplication.py

+    ucgid_values = sim.calculate("ucgid").values
+    # The system returns enum names as strings, so compare with the name
+    assert all(val == california_ucgid.name for val in ucgid_values)
+


Let's run this by Max tomorrow. I know he wanted us to move away from in and I imagine just a string will suffice. However, I'm currently using the Enum to build the initial hierarchy for the strata (which I thought was a nice source of truth), so I'll have to get it somewhere else if it's no longer an Enum.

juaristi22 added 3 commits July 29, 2025 14:15

matrix creation logic

26a9e5e

add sqlalchemy dependency

16a0f46

fix initialization

c030803

update "in" operation to check for string matches

f725fba

juaristi22 requested a review from baogorek July 29, 2025 17:07

baogorek reviewed Jul 29, 2025

View reviewed changes

src/policyengine_data/calibration/metrics_matrix_creation.py Outdated Show resolved Hide resolved

src/policyengine_data/calibration/metrics_matrix_creation.py Outdated Show resolved Hide resolved

tests/test_matrix_creation.py Outdated Show resolved Hide resolved

juaristi22 added 2 commits July 30, 2025 13:25

download database from huggingface

fff2e60

update test to also use db downloading logic

db5798e

juaristi22 requested a review from baogorek July 30, 2025 11:55

add stratum constraint filtering option

047c8fd

baogorek reviewed Aug 6, 2025

View reviewed changes

src/policyengine_data/calibration/metrics_matrix_creation.py Show resolved Hide resolved

juaristi22 added 18 commits August 6, 2025 16:19

Merge branch 'main' of https://github.com/PolicyEngine/policyengine-data

0fdf959

into maria/matrix_creation

adding note

ff23f25

update import path

8c3bf86

lint

aabee8a

Merge branch 'main' of https://github.com/PolicyEngine/policyengine-data

e8f37d2

into maria/matrix_creation

conversion between dataset classes

b183c6f

initial stab at state-level calibration logic

2506ee9

update key normalisation to take more than one start_index

5b8ab95

adding calibration function for all areas in a geography level (pendi…

24ba55e

…ng test)

debugged state-level calibration

00b38ac

state level calibration works except age mapping

75adba6

handled age entity mapping in constraint application

9cfce9d

fixed state-level calibration

f15c720

state and national calibration for age targets

db10495

fixing bug in microsims for when converting between dataset class types

803516f

Fix database True/False to be handled as bool instead of str

709eda0

more testing coverage

b108de3

add function for calibrating all geo levels at once

5ab75d7

create tests and document calibration

f9c5cfa

juaristi22 requested review from MaxGhenis, baogorek and nikhilwoodruff August 14, 2025 14:58

juaristi22 added 3 commits August 15, 2025 13:13

update database link to enable calibration in ci

96a2c99

update calibration test to use online database

2467e4f

add is_greater_than to be able to process snap

1611052

nikhilwoodruff reviewed Aug 18, 2025

View reviewed changes

docs/calibration.md Outdated Show resolved Hide resolved

tests/test_normalise_keys.py Outdated Show resolved Hide resolved

src/policyengine_data/calibration/dataset_duplication.py Show resolved Hide resolved

src/policyengine_data/calibration/calibrate.py Outdated Show resolved Hide resolved

juaristi22 added 3 commits August 18, 2025 13:36

update documentation

feaa022

remove -us Microsimulation dependencies

af9f536

update calibration docs with recent changes

c03c4ca

baogorek requested changes Aug 18, 2025

View reviewed changes

included target uprating

b1426b0

remove automatic saving of calibration log csvs

5480f94

juaristi22 requested a review from baogorek August 18, 2025 19:06

baogorek approved these changes Aug 20, 2025

View reviewed changes

juaristi22 merged commit ea40502 into main Aug 20, 2025
4 checks passed

Metrics matrix creation logic from database #17

Metrics matrix creation logic from database #17

Uh oh!

Conversation

juaristi22 commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jul 29, 2025

Codecov Report

Uh oh!

baogorek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

baogorek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

baogorek Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

juaristi22 Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

baogorek Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

juaristi22 commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

baogorek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

baogorek Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

juaristi22 commented Jul 29, 2025 •

edited

Loading

juaristi22 commented Aug 18, 2025 •

edited

Loading