Skip to main content
  • Original research
  • Open access
  • Published:

Simulating the methodological bias in the ATLS classification of hypovolemic shock: a critical reappraisal of the base deficit renaissance

Abstract

Background

The Advanced Trauma Life Support classification (ATLS) of hypovolemic shock is a widely used teaching and treatment reference in emergency medicine, but oversimplifies clinical reality. A decade ago, a landmark study compared vital parameters to base deficit (BD) in trauma patients. The investigators concluded that BD had higher accuracy to detect the need for early blood product administration. BD was subsequently introduced in the ATLS shock classification and has since been widely accepted as a laboratory standard for hypovolemia. The aim of this study is to investigate whether a methodological bias may have inadvertently contributed to the study’s results and interpretation.

Methods

In the current study, we replicate the original study by simulating a cohort of trauma patients with randomly generated data and applying the same methodological strategies. First, a predefined correlation between all predictor variables (vital parameters and BD) and outcome variable (transfusion) was set at 0.55. Then, in accordance with the methods of the original study we created a composite of ATLS parameters (highest class amongst heart rate, systolic blood pressure, and Glasgow Coma Scale) and compared it with BD for resulting transfusion quantity. Given the preset correlations between predictors and outcome, no predictor should exhibit a stronger association unless influenced by methodological bias.

Results

Applying the original imbalanced grouping and composite allocation strategies caused a systematic overestimation of shock class for traditional ATLS parameters, favoring the association between BD and transfusion. This effect persisted when the correlation between BD and transfusion was set substantially worse (rho = 0.3) than the correlation between ATLS parameters and transfusion (rho = 0.8).

Conclusions

In this fully reproducible simulation, we confirm the inadvertent presence of methodological bias. It is physiologically reasonable to include a metabolic parameter to classify hypovolemic shock, but more evidence is needed to support widespread and preferred use of BD.

Background

The Advanced Trauma Life Support (ATLS) shock classification is an international clinical reference standard and teaching instrument for hypovolemic shock severity [1]. Traditionally, the classification was based on a combination of vital signs including heart rate (HR), systolic blood pressure (SBP), and Glasgow Coma Scale (GCS). However, only 9.3% of trauma patients are in a single shock class based on their vital signs [2]. In an effort to improve and expand the ATLS classification, Mutschler and colleagues compared base deficit (BD) to traditional ATLS parameters. BD is a surrogate marker of metabolic acid–base status and may therefore reflect tissue hypoxemia in hypovolemic shock. Although previous studies described BD’s use as prognosticator and measure of resuscitative effort, this was the largest comparison with traditional vital parameters. In a landmark publication, the investigators concluded that BD may be superior for identification of transfusion requirements [3]. Consequently, BD was incorporated into the updated ATLS classification of hypovolemic shock.

A decade later, BD has been widely embraced as a singular laboratory standard for assessment of acute hypovolemia, permeating clinical practice in emergency medicine and critical care. At least 59% of surveyed anesthesia and critical care clinicians use BD to guide (intraoperative) fluid management [4]. This is an important development, as a point-of-care laboratory parameter is undoubtedly a valuable addition to a clinician’s diagnostic toolkit. However, BD’s widespread and preferred use prompts a closer examination of the available evidence.

In this study, we critically reappraise the “renaissance of BD” [3]. We hypothesize that there is a methodological bias in the head-to-head comparison between BD and traditional vital parameters caused by imbalanced grouping and composite allocation.

Methods

Context and aims

The original publication compared BD with traditional ATLS parameters (HR, SBP, and GCS) for prediction of transfusion quantity. The study population was an observational cohort of 16,305 multitrauma patients derived from a German trauma register. Four shock classes (I-IV) were defined with modifications and numeric interpretations for HR, SBP, GCS, and BD (Table 1) [1, 2, 6].

Table 1 Interpretation of the ATLS shock classification of hypovolemic shock by Mutschler and colleagues using modifications and numeric interpretations [1, 2, 6].

Patients were categorized in shock classes corresponding to their HR, SBP, GCS, and BD upon arrival at the emergency department. Then, a composite ATLS score with classes I through IV was created corresponding to the shock classes of HR, SBP, and GCS. However, the majority of patients could not be placed in a single shock class because the shock class assigned to each individual vital parameter was different in 90.7% of cases. Thus, the composite ATLS score value was assigned to the shock class corresponding to the highest shock class amongst traditional vital parameters.

Finally, the quantity of transfused blood products during admission was collected. The composite ATLS-derived shock class was compared with the BD-derived shock class for transfusion quantity. Patients classified in shock class II, III and IV by BD received significantly more transfusions than patients classified by the composite ATLS score. The investigators concluded that “BD-based classification displayed a higher accuracy for discriminating the need for early blood products than the current ATLS classification of hypovolemic shock”.

We hypothesize that there is a systematic bias embedded in the methodological approach. The subjective cut-off selection causes imbalanced data groups, whilst composite allocation to the highest shock class causes complementary overestimation of patients’ shock class. This weakens the association between composite ATLS score and administered blood products. To illustrate, a young patient with stress and pain due to fractured bones may have a HR of 145, but otherwise normal vitals. A patient with a moderate traumatic brain injury may have a GCS of 11, but otherwise normal vitals. The grouping and composite allocation strategies will place both patients in shock class IV, although neither is likely to receive transfusion products.

To validate this hypothesis, we replicate findings of the original study without gathering any patient data. Instead, we used a simple computer simulation with generated data and apply the same methodology. Data simulation allows regulation of correlations between variables, enabling a focused investigation of potential methodological bias independent of signals from patient data. If all predictors (HR, SBP, GCS, and BD) are preset at equal correlation with the outcome (transfusion), no individual predictor should demonstrate a stronger association with the outcome. Therefore, if applying the methodology from the original study results in one predictor showing a significantly stronger association, it would indicate the presence of bias in the methodology. All simulations were performed using Python (v3.8, Jupyter Notebook) language for computing with a statistical suite of libraries. We adhered to a framework for design and reporting of simulation studies where applicable [5].

Data simulation

We conducted a fully reproducible simulation study. First, we generated traditional ATLS vital parameters, BD, and transfusion outcome (units of packed red blood cells) in normal distributions based on average values and distributions as presented in the original cohort (supplemental material 1 and 2). Then, a copula was created to establish and regulate correlations of the joint multivariate distribution [7]. All correlations in the covariance matrix were initially set to 0.55 to simulate equal association between all variables. This correlation coefficient was selected as it represents the center of ‘moderate’ correlation and corresponds to a low number of patients in the same shock class [3, 8].

All distributions were truncated to physiological and realistic limits. For example, transfusion variable cannot be below zero and heart rate typically does not exceed 220. Moreover, GCS was rounded to integers and limited between 3 and 15. Its distribution was simulated trimodally with modes appearing at 3, 7, and 15, based on expected distributions from previous literature [9].

Traditional ATLS variables and BD were transformed into categorical shock class variables using the same thresholds as described in the original publication. The original publication classified a GCS of 15 into shock class I or II, corresponding to ‘normal’ or ‘mildly anxious’ mental status. We pragmatically, and randomly, redistributed half into class I and half into class II. Then, a composite ATLS score was created using the highest shock class allocation method from the original publication. As in the original publication, the BD and composite ATLS shock class were compared for transfusion quantity using a student’s t-test. Full code is available in supplemental material 3.

Finally, we changed the equivalent correlation to a weak correlation between BD and transfusion (rho = 0.3) and a strong correlation between HR, SBP, and GCS and transfusion (rho = 0.8) and repeated the same analysis. Moreover, to test whether the bias upholds independent of BD we compared HR shock class and highest allocation composite shock class of BD, SBP, and GCS using the same analysis with both correlation methods.

Results

Figure 1 shows distributions of simulated variables. Thirteen percent of patients were categorized in the same shock class for each traditional ATLS parameter. Supplemental material 4 shows the imbalanced distribution of cases across shock classes.

Fig. 1
figure 1

The distributions of simulated data. The y-axis represents value counts in the histograms. Transfusion in units packed red blood cells

Figure 2 shows the comparison of the composite ATLS score and BD for equal and unequal correlations. Note that BD seems to be superior for the identification of transfusion requirements even when its correlation with transfusion quantity is substantially lower. When comparing HR versus a composite of BD, SBP, and GCS, similar results occur in favor of HR (supplemental material 5).

Fig. 2
figure 2

The transfusion quantity as a function of composite ATLS parameters and BD with equal correlations A and unequal correlations B. The post-processing Spearman correlation coefficient in equal correlations between transfusion and HR, SBP, GCS, and BD was 0.526, − 0.527, − 0.511, and 0.529, respectively. The composite ATLS score contains HR, SBP, and GCS. The post-processing Spearman correlation coefficient in unequal correlations between transfusion and HR, SBP, GCS, and BD was 0.791, − 0.787, − 0.765, and 0.272, respectively

Discussion

In this simulation study we compare traditional ATLS vital parameters and BD for transfusion quantity using methodology described by a previous landmark publication. All variables were generated with an equal underlying correlation with transfusion quantity. First, despite these conditions, applying the original methodology produces a significant effect in favor of BD over traditional ATLS parameters, confirming the presence of a bias. Second, this effect persists when the correlation between transfusion and BD is set substantially lower than the correlation between transfusion and traditional ATLS parameters, emphasizing its magnitude. Last, the effect persists when comparing HR to a composite of BD, SBP, and GCS, indicating its independence from BD.

This study highlights how methodological choices may inadvertently influence results and conclusions of medical research. Allocating imbalanced data to higher composite shock classes leads to shock class overestimation and erroneous variable comparison. However, in clinical practice it is reasonable to be wary of outlier vital parameters in higher shock classes; initial overtreatment may be less hazardous than undertreatment.

Notably, despite being equivalent in root correlation, the categorized GCS group shows a consistent negative skew, whilst the other variables all show a positive skew. This can be attributed to grouping particularities. First, more than half of the original population has a GCS < 12, corresponding to shock class IV. Second, a GCS of 15 could be either class I or II depending on unspecified qualitative characteristics. Third, all parameters were measured at the emergency department, except for GCS when the patient was intubated at the scene, automatically classifying as shock class IV. These grouping-related factors consistently drive high shock class allocation for GCS without necessitating transfusion. Last, a low GCS in a patient with traumatic brain injury may not indicate ongoing systemic bleeding, potentially confounding the study results. Unfortunately, the original study did not clarify whether this specific subgroup of patients was excluded from the analysis.

Importantly, the relationship between BD and patient acuity is physiologically and epidemiologically sound. This study only identifies the methodological bias in the head-to-head comparison between BD and vital parameters in this specific landmark paper. Ultimately, resuscitation strategy should not be based on a singular ‘superior’ single-parameter, but rather on the integration of multiple parameters, context, repeated evaluation, and clinical judgement.

Limitations

This study contains artificially generated data based on pre-specified averages, distributions, and correlations. In addition, it was truncated and post-processed to resemble natural data, marginally affecting correlations. We have responsibly selected, and accounted for, pre-specified context, but unexpected post-processing bias may still impact results. Moreover, in this study 13.92% of patients were classified in a joint shock class, compared to 9.3% in the original publication. Since imbalanced data drives allocation bias, we suspect that methodological bias may be even more pronounced in the original data. Last, this study was performed to investigate potential methodological bias, but is unable to determine the true underlying correlations between traditional ATLS parameters, BD, and transfusion quantity. We therefore look forward to a secondary analysis of original data or new data to resolve these matters.

Conclusions

In this transparent and fully reproducible simulation we demonstrate the presence of a methodological bias in a landmark publication comparing ATLS parameters. The methodological bias causes favorable interpretation of BD’s value in shock classification compared to traditional parameters. Preferred use of BD over other vital parameters as a standard measure for acute hypovolemia cannot be justified using the analysis methods of the original study. Research with robust methods is needed to draw definitive conclusions on the “renaissance of BD”.

Availability of data and materials

The datasets generated and analyzed during the current study are available in the supplemental material section.

Abbreviations

ATLS:

Advanced trauma life support

BD:

Base deficit

GCS:

Glasgow coma scale

HR:

Heart rate

SBP:

Systolic blood pressure

References

  1. 10th Edition of the advanced trauma life support® (ATLS®) student course manual. Chicago (IL): American College of Surgeons; 2018.

  2. Mutschler M, Nienaber U, Brockamp T, Wafaisade A, Wyen H, Peiniger S, Paffrath T, Bouillon B, Maegele M, TraumaRegister DGU. A critical reappraisal of the ATLS classification of hypovolaemic shock: does it really reflect clinical reality? Resuscitation. 2013;84(3):309–13. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.resuscitation.2012.07.012.

    Article  CAS  PubMed  Google Scholar 

  3. Mutschler M, Nienaber U, Brockamp T, Wafaisade A, Fabian T, Paffrath T, Bouillon B, Maegele M, TraumaRegister DGU. Renaissance of base deficit for the initial assessment of trauma patients: a base deficit-based classification for hypovolemic shock developed on data from 16,305 patients derived from the TraumaRegister DGU®. Crit Care. 2013;17(2):R42. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/cc12555.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Heldeweg MLA, Stohlmann JAH, Loer SA. Base excess and lactate for guidance of peri-operative fluid management: a survey of anaesthetists, residents and intensive care physicians attending 2022 ESAIC in Milan. Eur J Anaesthesiol. 2023;40(8):610–2. https://doiorg.publicaciones.saludcastillayleon.es/10.1097/EJA.0000000000001850.

    Article  PubMed  Google Scholar 

  5. Burton A, Altman DG, Royston P, Holder RL. The design of simulation studies in medical statistics. Stat Med. 2006;25(24):4279–92. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/sim.2673.

    Article  PubMed  Google Scholar 

  6. Davis JW, Parks SN, Kaups KL, Gladen HE, O’Donnell-Nicol S. Admission base deficit predicts transfusion requirements and risk of complications. J Trauma. 1996;41(5):769–74. https://doiorg.publicaciones.saludcastillayleon.es/10.1097/00005373-199611000-00001.

    Article  CAS  PubMed  Google Scholar 

  7. D’Angelo GM, Weissfeld LA. Application of copulas to improve covariance estimation for partial least squares. Stat Med. 2013;32(4):685–96. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/sim.5533.

    Article  PubMed  Google Scholar 

  8. Schober P, Boer C, Schwarte LA. Correlation coefficients: appropriate use and Interpretation. Anesth Analg. 2018;126(5):1763–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1213/ANE.0000000000002864.

    Article  PubMed  Google Scholar 

  9. Di Saverio S, Gambale G, Coccolini F, Catena F, Giorgini E, Ansaloni L, Amadori N, Coniglio C, Giugni A, Biscardi A, Magnone S, Filicori F, Cavallo P, Villani S, Cinquantini F, Annicchiarico M, Gordini G, Tugnoli G. Changes in the outcomes of severe trauma patients from 15-year experience in a Western European trauma ICU of Emilia Romagna region (1996–2010). A population cross-sectional survey study. Langenbecks Arch Surg. 2014;399(1):109–26. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00423-013-1143-9.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

No funding was received for the conceptualization, design, data generation, analysis or preparation of this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

MLAH, TTRH, JAHS, PJFJ, and RB were responsible for the conception of the work. MLAH, TTRH, LAS, and PS were responsible for the design of the work, as well as the generation and analysis of the data. MLAH was responsible for the first draft of the manuscript. Subsequently all authors provided critical revisions for until the final manuscript was completed. All authors read and approved the final manuscript and ensured that questions related to the accuracy or integrity of any part of the work were investigated and resolved.

Corresponding author

Correspondence to M. L. A. Heldeweg.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

None.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Heldeweg, M.L.A., Heldeweg, T.T.R., Stohlmann, J.A.H. et al. Simulating the methodological bias in the ATLS classification of hypovolemic shock: a critical reappraisal of the base deficit renaissance. Scand J Trauma Resusc Emerg Med 32, 104 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13049-024-01276-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13049-024-01276-0

Keywords