Skip to content

Project: Migrate data.gov data from Harvard Innovation Lab to HDV #407

@sbarbosadataverse

Description

@sbarbosadataverse

Objective:

  • 300K datasets rescued from data.gov by Harvard Innovation Lab, to move into a collection on HDV
    Requires a Harvard faculty owner

Data details:

Our data is here, with a readme — let me know if there are parts of that we could expand:

https://source.coop/harvard-lil/gov-data

The metadata.csv.zip and metadata.jsonl.zip files have metadata about all of the datasets we collected.

We also now have a statically-hosted browser for the data, described here:

https://lil.law.harvard.edu/blog/2025/10/10/welcome-to-lil-s-data-gov-archive-search/

I’m adding our developer Chris Setzer as well — if there were other data formats that would be helpful, we’d be happy to consider.

I think we will have a great deal of CDC data, but you would have to analyze the metadata files or use the hosted browser to check what’s in there.

Thanks,
Jack

Contact:

Metadata

Metadata

Projects

Status

SPRINT- NEEDS SIZING

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions