Chapter 4 Introduction to CRASH-2
Since a key principle of IDA is not to touch the research questions, before IDA commences the research aim and statistical analysis plan need to be in place. IDA may lead to an update or refinement of the analysis plan. To demonstrate the workflow and content of IDA, we created a hypothetical research aim and corresponding statistical analysis plan, which is described in more detail in the section Crash2_SAP.Rmd.
Hypothetical research aim for IDA is to develop a multivariable model for early death (death within 28 days from injury) using nine independent variables of mixed type (continuous, categorical, semicontinuous) with the primary aim of prediction and a secondary aim of describing the association of each variable with the outcome.
A prediction model was developed and validated based on this data set in “Predicting early death in patients with traumatic bleeding” Perel et al, BMJ 2012, [supplement available at]. The assumed research aim is in line with the prediction model
4.1 CRASH-2 Description
Clinical Randomisation of an Antifibrinolyticin Significant Haemorrhage(CRASH-2) was a large randomised placebo controlled trial among trauma patients with, or at risk of, significant haemorrhage, of the effects of antifibrinolytic treatment on death and transfusion requirement. The study is described at the original trial website. A public version of the data set is found at a repository of public data sets hosted by the Vanderbilt University’s Department of Biostatistics (Prof. Frank Harrell Jr.).
The data set includes 20,207 patients and 44 variables.
Note: In contrast to the analysis described in Perel et al, variables describing the economic region and the treatment allocation are missing in the public version of the data set, and while the data set contains 20,207 patients, the research paper mentions 20,127 patients having been included in the study.
4.2 Crash2 dataset contents
4.2.1 Source dataset
We refer to the source data set as the dataset available online here
Display the source dataset contents. This dataset is in the data-raw folder of the project directory.
Data frame:crash2
20207 observations and 44 variables, maximum # NAs:17121Name | Labels | Units | Levels | Class | Storage | NAs |
---|---|---|---|---|---|---|
entryid | Unique Numbers for Entry Forms | integer | integer | 0 | ||
source | Method of Transmission of Entry Form to CC | 5 | integer | 0 | ||
trandomised | Date of Randomization | Date | double | 0 | ||
outcomeid | Unique Number From Outcome Database | integer | integer | 80 | ||
sex | 2 | integer | 1 | |||
age | integer | 4 | ||||
injurytime | Hours Since Injury | numeric | double | 11 | ||
injurytype | 3 | integer | 0 | |||
sbp | Systolic Blood Pressure | mmHg | integer | integer | 320 | |
rr | Respiratory Rate | /min | integer | integer | 191 | |
cc | Central Capillary Refille Time | s | integer | integer | 611 | |
hr | Heart Rate | /min | integer | integer | 137 | |
gcseye | Glasgow Coma Score Eye Opening | integer | integer | 732 | ||
gcsmotor | Glasgow Coma Score Motor Response | integer | integer | 732 | ||
gcsverbal | Glasgow Coma Score Verbal Response | integer | integer | 735 | ||
gcs | Glasgow Coma Score Total | integer | integer | 23 | ||
ddeath | Date of Death | Date | double | 17121 | ||
cause | Main Cause of Death | 7 | integer | 17118 | ||
scauseother | Description of Other Cause of Death | 227 | integer | 0 | ||
status | Status of Patient at Outcome if Alive | 3 | integer | 3169 | ||
ddischarge | Date of discharge, transfer to other hospital or day 28 from randomization | Date | double | 3185 | ||
condition | Condition of Patient at Outcome if Alive | 5 | integer | 3251 | ||
ndaysicu | Number of Days Spent in ICU | numeric | double | 182 | ||
bheadinj | Significant Head Injury | integer | integer | 80 | ||
bneuro | Neurosurgery Done | integer | integer | 80 | ||
bchest | Chest Surgery Done | integer | integer | 80 | ||
babdomen | Abdominal Surgery Done | integer | integer | 80 | ||
bpelvis | Pelvis Surgery Done | integer | integer | 80 | ||
bpe | Pulmonary Embolism | integer | integer | 80 | ||
bdvt | Deep Vein Thrombosis | integer | integer | 80 | ||
bstroke | Stroke | integer | integer | 80 | ||
bbleed | Surgery for Bleeding | integer | integer | 80 | ||
bmi | Myocardial Infarction | integer | integer | 80 | ||
bgi | Gastrointestinal Bleeding | integer | integer | 80 | ||
bloading | Complete Loading Dose of Trial Drug Given | integer | integer | 80 | ||
bmaint | Complete Maintenance Dose of Trial Drug Given | integer | integer | 80 | ||
btransf | Blood Products Transfusion | integer | integer | 80 | ||
ncell | Number of Units of Red Call Products Transfused | numeric | double | 9963 | ||
nplasma | Number of Units of Fresh Frozen Plasma Transfused | integer | integer | 9964 | ||
nplatelets | Number of Units of Platelets Transfused | integer | integer | 9964 | ||
ncryo | Number of Units of Cryoprecipitate Transfused | integer | integer | 9964 | ||
bvii | Recombinant Factor VIIa Given | integer | integer | 374 | ||
boxid | Treatment Box Number | integer | integer | 0 | ||
packnum | Treatment Pack Number | integer | integer | 0 |
Variable | Levels |
---|---|
source | telephone |
telephone entered manually | |
electronic CRF by email | |
paper CRF enteredd in electronic CRF | |
electronic CRF | |
sex | male |
female | |
injurytype | blunt |
penetrating | |
blunt and penetrating | |
cause | bleeding |
head injury | |
myocardial infarction | |
stroke | |
pulmonary embolism | |
multi organ failure | |
other | |
scauseother | |
Acute Hypoxia | |
ACUTE LUNG INJURY | |
Acute Pulmonary Oedema | |
Acute Renal Failure | |
ACUTE RESPIRATORY DISTRESS SYNDROME (ARDS) | |
acute respiratory failure | |
acute respiratory failure+sepsis | |
air amboli (embolism) | |
Air embolism caused by penetrating lung trauma | |
... | |
status | discharged |
still in hospital | |
transferred to other hospital | |
condition | no symptoms |
minor symptoms | |
some restriction in lifestyle but independent | |
dependent, but not requiring constant attention | |
fully dependent, requiring attention day and night |
4.2.2 Updated analysis dataset
Additional meta-data is added to the original source data set. We write this new modified data set back to the data folder after adding additional meta-data for the following variables:
- age - add label “Age” and unit “years”.
- injury time - add unit “hours”.
- total Glasgow coma score - add unit “points”.
At the stage we select the variables of interest to take in to the IDA phase by dropping variables we do not check in IDA.
As a cross check we display the contents again to ensure the additional data is added, and then write back the changes to the data folder in the file “data/a_crash2.rds”.
Input object size: 1221480 bytes; 12 variables 20207 observations New object size: 1223272 bytes; 12 variables 20207 observations Input object size: 1546808 bytes; 14 variables 20207 observations New object size: 1385720 bytes; 14 variables 20207 observations
Data frame:a_crash2
20207 observations and 14 variables, maximum # NAs:17121Name | Labels | Units | Levels | Class | Storage | NAs |
---|---|---|---|---|---|---|
entryid | Unique Numbers for Entry Forms | integer | integer | 0 | ||
trandomised | Date of Randomization | Date | double | 0 | ||
ddeath | Date of Death | Date | double | 17121 | ||
age | Age | years | integer | integer | 4 | |
sex | Sex | 2 | integer | 1 | ||
sbp | Systolic Blood Pressure | mmHg | integer | integer | 320 | |
hr | Heart Rate | /min | integer | integer | 137 | |
rr | Respiratory Rate | /min | integer | integer | 191 | |
gcs | Glasgow Coma Score Total | points | integer | integer | 23 | |
cc | Central Capillary Refille Time | s | integer | integer | 611 | |
injurytime | Hours Since Injury | hours | numeric | double | 11 | |
injurytype | Injury type | 3 | integer | 0 | ||
time2death | integer | 17121 | ||||
earlydeath | Death within 28 days from injury | integer | integer | 0 |
Variable | Levels |
---|---|
sex | male |
female | |
injurytype | blunt |
penetrating | |
blunt and penetrating |
4.3 Section session info
## R version 4.1.3 (2022-03-10)
## Platform: x86_64-w64-mingw32/x64 (64-bit)
## Running under: Windows 10 x64 (build 17763)
##
## Matrix products: default
##
## locale:
## [1] LC_COLLATE=English_Austria.1252 LC_CTYPE=English_Austria.1252
## [3] LC_MONETARY=English_Austria.1252 LC_NUMERIC=C
## [5] LC_TIME=English_Austria.1252
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] Hmisc_4.6-0 Formula_1.2-4 survival_3.2-13 lattice_0.20-45
## [5] forcats_0.5.1 stringr_1.4.0 dplyr_1.0.8 purrr_0.3.4
## [9] readr_2.1.2 tidyr_1.2.0 tibble_3.1.6 ggplot2_3.3.5
## [13] tidyverse_1.3.1 here_1.0.1
##
## loaded via a namespace (and not attached):
## [1] httr_1.4.2 sass_0.4.1 jsonlite_1.8.0
## [4] splines_4.1.3 modelr_0.1.8 bslib_0.3.1
## [7] assertthat_0.2.1 latticeExtra_0.6-29 cellranger_1.1.0
## [10] yaml_2.3.5 pillar_1.7.0 backports_1.4.1
## [13] glue_1.6.2 digest_0.6.29 checkmate_2.0.0
## [16] RColorBrewer_1.1-2 rvest_1.0.2 colorspace_2.0-3
## [19] htmltools_0.5.2 Matrix_1.4-0 pkgconfig_2.0.3
## [22] broom_0.7.12 haven_2.4.3 bookdown_0.25
## [25] scales_1.1.1 jpeg_0.1-9 tzdb_0.2.0
## [28] htmlTable_2.4.0 generics_0.1.2 ellipsis_0.3.2
## [31] withr_2.5.0 nnet_7.3-17 cli_3.2.0
## [34] magrittr_2.0.2 crayon_1.5.1 readxl_1.3.1
## [37] evaluate_0.15 fs_1.5.2 fansi_1.0.3
## [40] xml2_1.3.3 foreign_0.8-82 data.table_1.14.2
## [43] tools_4.1.3 hms_1.1.1 lifecycle_1.0.1
## [46] munsell_0.5.0 reprex_2.0.1 cluster_2.1.2
## [49] compiler_4.1.3 jquerylib_0.1.4 rlang_1.0.2
## [52] grid_4.1.3 rstudioapi_0.13 htmlwidgets_1.5.4
## [55] base64enc_0.1-3 rmarkdown_2.13 gtable_0.3.0
## [58] DBI_1.1.2 R6_2.5.1 gridExtra_2.3
## [61] lubridate_1.8.0 knitr_1.38 fastmap_1.1.0
## [64] utf8_1.2.2 rprojroot_2.0.2 stringi_1.7.6
## [67] Rcpp_1.0.8.3 vctrs_0.3.8 rpart_4.1.16
## [70] png_0.1-7 dbplyr_2.1.1 tidyselect_1.1.2
## [73] xfun_0.30