Development of an improved in-hospital mortality predictor based on SOFA

CLIF VERSION

2.1

Objective

The purpose of this project is to develop a more robust, statistically sound model for in-hospital mortality using the variables outlined by the Sequential Organ Failure Assessment (SOFA) score.

Required CLIF tables and fields

Please refer to the online CLIF data dictionary, ETL tools, and specific table contacts for more information on constructing the required tables and fields.

1. patient: patient_id, race_category, ethnicity_category, sex_category

2. hospitalization: patient_id, hospitalization_id, admission_dttm, discharge_dttm, age_at_admission, death_dttm

3. vitals: hospitalization_id, recorded_dttm, vital_value - vital_category == 'map'|'spo2'|'weight_kg'

4. labs: hospitalization_id, lab_result_dttm, lab_category, lab_value - lab_category == 'bilirubin_total'|'creatinine'|'platelet_count'|'pao2_arterial'

5. medication_admin_continuous: hospitalization_id, admin_dttm, med_name, med_category, med_dose, med_dose_unit - med_group == 'vasoactives'

6. respiratory_support: hospitalization_id, recorded_dttm, device_category, mode_category, tracheostomy, fio2_set, lpm_set, resp_rate_set, peep_set, resp_rate_obs

Cohort identification

The study population included all adults (age >= 18 years) that were admitted to the intensive care unit and identified as having been on life support for at least six hours. Life support was defined as receiving vasoactive medications, invasive or non-invasive mechanical ventilation, or high-flow/facemask oxygen therapy for hypoxic respiratory failure. In addition, the years selected are beginning of 2018 through the end of 2023.

Expected Results

A well-calibrated in-hospital mortality classification model that performs better than SOFA for critical resource allocation.

Detailed Instructions for running the project

1. Update `config/config.json`

Follow instructions in the config/README.md file for detailed configuration steps.

2. Set up the project environment

Run 00_renv_restore.R to set up the project environment. renv::init() in the command line also works.

3. Run code

Please read items 1-5 carefully. Run code in the following order:

#!/bin/bash

# This script runs the R scripts in the correct order.
Rscript code/0a_respiratory_support_waterfall.R
Rscript code/01_cohort_identification.R
Rscript code/02_feature_set_processing.R
Rscript code/03_table1.R
Rscript code/04_model_training.R

0a_respiratory_support_waterfall.R. This script runs Nick Ingraham's respiratory waterfall algorithm which will horizontally fill in various device categories. Requires lookup-table device_category_to_conversion.csv.
01_cohort_identification.R. Please remove the comment on line 98 in order to select the correct dates. This script creates the cohort dataframe. This script also outputs data needed to create a STROBE diagram. Make sure to specify the correct dates to select from clif_hospitalization.
02_feature_set_processing.R. This script creates the feature set needed to train/test the model.
03_table1.R. This script outputs data needed to create a Table 1. It also outputs a very basic STROBE diagram.
04_model_training.R This script trains the models and outputs the results.
05_training_visuals.R. This script is still a work in progress. It provides tables to create confusion matrices, calibration plots, and allocation efficiency plots for each training model object.

4. Exporting Results

Please read this next section carefully.

Script #	Output	Destination	Safe to Export?
`0a`	`clif_respiratory_support_processed.parquet`	output/intermediate	No
`01`	1. `inclusion_table.csv` 2. `sipa_clif_cohort.parquet`	1. output/exportable 2. output/intermediate	1. Yes 2. No
`02`	`sipa_features.parquet`	output/final	No
`03`	1. `strobe_diagram.png` 2. `table1.csv`	1. output/exportable 2. output/exportable	1. Yes 2. Yes
`04`	1. `best_model.rds` or `best_model.txt` 2. `.txt` and `.rds` files for every model object - Data in rds files is set to NULL before saving	1. output/exportable/models 2. output/exportable/models	1. Yes 2. Yes*

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
code		code
config		config
lookup-tables		lookup-tables
models		models
outlier-thresholds		outlier-thresholds
output		output
renv		renv
utils		utils
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
renv.lock		renv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Development of an improved in-hospital mortality predictor based on SOFA

CLIF VERSION

Objective

Required CLIF tables and fields

Cohort identification

Expected Results

Detailed Instructions for running the project

1. Update `config/config.json`

2. Set up the project environment

3. Run code

4. Exporting Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Development of an improved in-hospital mortality predictor based on SOFA

CLIF VERSION

Objective

Required CLIF tables and fields

Cohort identification

Expected Results

Detailed Instructions for running the project

1. Update config/config.json

2. Set up the project environment

3. Run code

4. Exporting Results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Update `config/config.json`

Packages