Skip to content

Common-Longitudinal-ICU-data-Format/CLIF

Repository files navigation

Common Longitudinal ICU data Format (CLIF)

Official Website to the CLIF Consortium - CLIF Consortium

License

Maturity Level

CLIF Project Workflow Documentation

Introduction

Multicenter critical care research often relies on sharing sensitive patient data across sites, requiring complex data use agreements (DUAs) and yielding redundant work to account for diverse data infrastructures. Common data models (CDMs) like the Observational Medical Outcomes Partnership can allow for standardized analysis scripts, mitigating these challenges but requiring advanced data engineering skills and risking a loss of crucial granular clinical information. To overcome these barriers, we present the Common Longitudinal ICU Format (CLIF), designed specifically for observational studies of critically ill patients across multiple centres. Through CLIF, we aim to streamline data organization into a longitudinal format and establish standard vocabularies to facilitate standardized analysis scripts and improve data readability for researchers and clinicians.

The CLIF consortium, comprising critical care researchers from eight US academic health systems, collaboratively developed CLIF's schema, clinical vocabularies, and "proof of concept" datasets. CLIF’s tables emphasize care processes, clinical outcomes, and granular clinical physiology measures.

This README file contains detailed instructions on how to set up your heathcare system's EHR data into the Relational CLIF format. The repository also provides a detailed data dictionary for each table in the R-CLIF database, along with the required limited vocabulary defined by clinical experts in the consortium.

Relational CLIF

In an iterative and ongoing fashion, we developed CLIF's schema, contents, and limited clinical vocabularies through collective discussion to consensus. The consortium's broad use case of clinical research on critically ill patients focused the decision-making on care processes, outcomes, and granular measures of clinical physiology. The primary development outcomes were (1) an initial schema, (2) a limited vocabulary set for important clinical variables, and (3)"proof of concept" datasets to demonstrate usability and interoperability.

To develop a structured relational database, we initiated a comprehensive data collection and cleaning effort at the eight health systems. We designed CLIF as an encounter-centric relational database with a clinically determined limited vocabulary for vitals, laboratory names, medications, patient locations, and respiratory device names. By consensus, we determined that CLIF would prioritize (1) completeness of routine clinical data, (2) temporal granularity, and (3) consistently measured clinical outcomes. The entity-relationship diagram from relational CLIF is presented below as a human-readable and clinically meaningful flow of information. Tables are organized into clinically relevant column categories (demographics, objective measures, respiratory support, orders and inputs-outputs)

Updated ERD and data dictionary

CLIF table POCs

Each CLIF table has an assigned clinician who serves as the primary point of contact for any inquiries related to the schema or common data elements specific to that table.

Tables Point of Contact Email GitHub Username
adt Nicholas Ingraham, MD ingra107@umn.edu ingra107
clinical_notes_facts
clinical_notes_text
clinical_trial
code_status Nathan Mesfin, MD mesfin@umn.edu mesfi005
crrt_therapy William Parker, MD wparker@uchicago.edu 08wparker
ecmo_mcs Shan Guleria, MD shan.guleria@uchicagomedicine.org shanguleria
hospital_diagnosis
hospitalization Nicholas Ingraham, MD ingra107@umn.edu ingra107
intake_output
intermittent_dialysis Jay Koyner, MD jkoyner@uchicago.edu
invasive_hemodynamics
key_icu_orders
labs Catherine Gao, MD catherine.gao@northwestern.edu cloverbunny
medication_admin_continuous Chad Hochberg, MD chochbe1@jh.edu chochbe1
medication_admin_intermittent Anna Barker, MD, PhD baanna@med.umich.edu baanna23
medication_orders Anna Barker, MD, PhD baanna@med.umich.edu baanna23
microbiology_culture Kevin Buell, MBBS kevin.buell@uchicagomedicine.org kevingbuell
microbiology_nonculture Kevin Buell, MBBS kevin.buell@uchicagomedicine.org kevingbuell
microbiology_susceptibility Kevin Buell, MBBS kevin.buell@uchicagomedicine.org kevingbuell
output Shan Guleria, MD shan.guleria@uchicagomedicine.org shanguleria
patient Pat Lyons, MD lyonspa@ohsu.edu plyons
patient_assessments Snigdha Jain, MD snigdha.jain@yale.edu snigdhajainyale
patient_diagnosis
patient_procedures J.C. Rojas, MD juan_rojas@rush.edu sajor2000
place_based_index
position Chad Hochberg, MD chochbe1@jh.edu chochbe1
provider Nicholas Ingraham, MD ingra107@umn.edu ingra107
respiratory_support Nicholas Ingraham, MD ingra107@umn.edu ingra107
therapy_details William Parker, MD, PhD & Bhakti Patel, MD wparker@uchicago.edu; bpatel@bsd.uchicago.edu 08wparker
transfusion
validated_diagnosis
vitals Catherine Gao, MD catherine.gao@northwestern.edu cloverbunny

Data Architecture

One of CLIF's key contributions is an open-source web application for converting relational databases into longitudinal datasets, selecting study-specific variables, and generating code in multiple programming languages. This tool enables streamlined data processing and supports cross-center comparisons without requiring pooled patient-level data.

For more information about our data architecture and tools, please visit our website.

Current team members

Our team of clinicians, data scientists, and collaborators can be found on our website.

About

Documentation for the Common Longitudinal ICU data Format (CLIF) data standard for critical care illness research

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors