- Research
- Open access
- Published:
Creation and validation of a roadside rescue skills scale for training pre-hospital medical teams: the RoadRes-Q scale
Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine volume 33, Article number: 56 (2025)
Abstract
Background
Road traffic injuries are the leading cause of death among young people worldwide. While advances in vehicle safety have reduced some of the risks, the speed and quality of pre-hospital care are critical to prevent fatalities. In France, patients are cared for by medical teams and firefighters who must work together as closely as possible to ensure the best possible survival rate. However, there is a lack of standardised scales to assess the performance of these multidisciplinary teams. This study aimed to create and validate a roadside rescue skills assessment scale, the RoadRes-Q scale, for healthcare teams.
Methods
We used a two-round Delphi method to develop the RoadRes-Q scale. A panel of 9 international roadside rescue experts, including 7 firefighters and 2 engineers in road rescue equipment, agreed to participate. The scale covers five key areas: healthcare provider protection, site securing, vehicle securing, first aid delivery, and patient extrication. The final version was tested during two one-day simulation-based training sessions, each involving 22 participants: 6 healthcare staff, 14 firefighters, and 2 simulated victims. Assessors completed the scale during and after each scenario, focusing on internal consistency and inter-observer reliability.
Results
The RoadRes-Q scale consists of 60 items. Internal consistency was excellent (Cronbach’s alpha of 0.86), indicating that items were non-redundant and consistently measured the required competencies. However, inter-observer reliability was low (intra-class correlation coefficient of 0.48), suggesting variability between assessors. Satisfaction among participants to the simulation-based training courses was high, and their knowledge increased.
Conclusions
The RoadRes-Q scale proved to be a valid and reliable scale for evaluating both technical and non-technical skills. While internal consistency was strong, improvements are needed in inter-observer reliability. Structured training for assessors and video-based assessments could enhance reproducibility. The RoadRes-Q scale has the potential for assessing the quality and safety of care provided by healthcare teams in roadside rescue situations.
Registration
As the study did not involve interventional research or patient participation, ethics committee approval was not required, but it received approval from the scientific referents of the Faculty of Medicine of Poitiers, and participants provided informed consent for using their anonymised data.
Introduction
Each year, approximately 1.2 million people lose their lives in road traffic accidents, making it the leading cause of death among children and young adults aged 5–29. In addition, between 20 and 50 million people sustain non-fatal injuries, many of which result in long-term disabilities. These incidents also lead to significant economic burdens, affecting individuals, their families, and nation as a whole. The costs stem from medical treatment and the loss of productivity, both from those killed or disabled and from family members who must take time off work or school to provide care. Road traffic accidents account for 3% of the gross domestic product in most countries, reflecting both direct costs, such as medical expenses and emergency response; and indirect costs, including loss of productivity and long-term disabilities [1].
Despite improvements in vehicle safety and road infrastructures, the speed and quality of pre-hospital care remain critical, particularly in preventing fatalities. About 50% of deaths occur within minutes of the accident, either at the scene or in route to the hospital; 15% of fatalities occur between one- and four-hours post-collision, and the remaining 35% occur more than four hours after the event [2]. This highlights the urgent need for rapid and efficient pre-hospital interventions to mitigate the consequences of trauma.
In France, emergency medical call centres (SAMU—one per county) and their mobile emergency and resuscitation units (SMUR) play a central role in treating the most critical patients outside the hospital. Established since 1956 by the Ministery of Health, each SMUR team traditionally includes an emergency physician, a nurse, and an ambulance driver [3]. The French “stay and play” model contrasts sharply with the “scoop and run” approach used by Anglo-Saxon paramedics, where the priority is to transport the patient quickly to the nearest hospital with minimal on-site care [4, 5]. In contrast, the French model emphasises comprehensive care at the scene, utilising advanced technical equipment to stabilise patients before transport.
In parallel, French firefighters are highly versatile, tasked with a range of responsibilities that include firefighting, environmental risk management and emergency personal assistance, the latter being their most frequent mission [6]. Firefighters are often the first responders at the scene of road accidents, and specialised technical teams are deployed for complex rescues, such as roadside extrications. These operations require specialised vehicles, equipment, and techniques to safely remove patients trapped in vehicles, underscoring the technical complexity and coordination required during such emergencies. Given the crucial roles of both emergency medical teams and firefighters in these situations, the need for a shared culture and standardised training is essential. A common framework would ensure that both medical and technical teams operate seamlessly together, optimising patient outcome. The success of such high-stakes interventions depends on clear communication, leadership, and the ability to work under pressure [7]. Poor coordination or communication breakdowns can delay critical care, leading to worsen patient outcome [8]. This makes it essential to not only provide thorough training, but also develop reliable ways to evaluate the performance of these multidisciplinary teams working together.
Training methods for emergency responders have traditionally focused on theoretical knowledge and practical skills. However, in recent years, simulation-based training has emerged as a key educational pathway, offering realistic, controlled environments where teams can practice handling real-world scenarios without risk to patients [9, 10]. Despite its many advantages, simulation-based training for medical teams in roadside rescue remains rare. Furthermore, the absence of a standardised assessment scale means that there is no reliable way to measure the effectiveness of this training. Developing such a scale is a great challenge, as it requires capturing both technical and non-technical skills, such as communication and leadership, which are crucial for successful rescues. It will enable educators to not only assess but also enhance the skills of rescue teams, ultimately improving patient outcome.
To address this complexity, this study aims to develop a valid and reliable assessment scale specifically designed for medical teams participating in roadside rescue scenarios. It serves as a teaching scale during multidisciplinary simulation training sessions, allowing for objective evaluation of both technical and non-technical skills such as teamwork, leadership, and decision-making under pressure.
Methods
Study design and setting
This study was conducted in collaboration with the firefighters of the Vienne County (Service Départemental d'Incendie et de Secours [SDIS] 86), recognised as a national benchmark in France for their expertise in managing road traffic accidents. Their advanced techniques in accident scene safety, vehicle security, first aid, and patient extrication have significantly influenced the development of national guidelines [11,12,13,14]. Since 2018, these firefighters have been working closely with the emergency medical team from the Poitiers University Hospital to develop a shared set of guidelines for roadside rescue operations. The primary aim of this collaboration was to enhance the skills of all rescue actors—physicians, nurses, ambulance drivers, and firefighters—by creating a unified framework that focuses on the roadside rescue environment. Although the framework does not cover trauma-specific medical care, it provides clear protocols for managing the scene, securing vehicles, coordinating rescue efforts, and safely extricating patients.
The guidelines, co-authored by two emergency physicians and two firefighters with specialised expertise in roadside rescue, were built on both practical experience and a review of the relevant literature. The reference manual they produced covers the full sequence of roadside rescue operations, including general rescue procedures, vehicle-specific considerations, scene and vehicle security, patient management, and extrication techniques. Additionally, the manual also provides details on beaconing methods, patient discharge protocols, and specific equipment descriptions to standardise practices across all rescue teams. Although this manual is not yet publicly available, it is currently being updated and formatted for wider dissemination, with plans for a future printed publication in both French and English.
The present study employed a two-round Delphi method to develop a scale for assessing the competencies of medical teams in roadside rescue scenarios, called the RoadRes-Q scale. We recruited a panel of international experts. The Delphi method aimed to ensure a consensus-driven approach to the content of the assessment scale, which was then validated for internal consistency and inter-rater reliability during two one-day multi-disciplinary road rescue simulation-training sessions.
Creation of the RoadRes-Q scale
Content drafting prior to the first Delphi round
The draft of the RoadRes-Q scale was grounded in a thorough review of existing guidelines and referential data on roadside rescue interventions. This process was structured using the Delphi method to ensure a systematic and consensus-driven approach [15, 16]. The initial framework of competencies and criteria was derived from established best practices in emergency and prehospital care, and organised into five main sections reflecting key operational domains:
-
Personal protection of responders: This section covered the essential actions and precautions to ensure the safety of the medical team at the scene.
-
Site security: This section explored criteria related to managing the safety of the overall scene, including controlling traffic flow, preventing additional hazards, and ensuring a secure working environment for all teams involved.
-
Securing vehicles: This category focused on actions necessary to stabilise vehicles involved in the accident, preventing further movement or risk to both the responders and the victims.
-
First-aid intervention: This section addressed the initial medical interventions, including patient triage, immediate care, inter-service communication, and ensuring continuity of care between different emergency services.
-
Patient extrication: This final category outlined the steps for safely extricating patients from vehicles, coordinating their transport, and ensuring their protection during the process.
Each of these sections was subdivided into specific, measurable actions that aligned with real-world expectations during roadside interventions. A core working group, consisting of three senior emergency physicians and two experienced firefighters, reviewed and refined this initial list of competencies to ensure its practical relevance. This preliminary version of the RoadRes-Q scale served as the foundation for the first round of the Delphi method, ensuring that all items presented to the expert panel were organised logically and represented the critical aspects of roadside rescue interventions.
Expert panel selection
The experts were selected based on their expertise in prehospital emergency care and road traffic rescue operations, from among the authors of leading articles, trauma recommendations, from trauma networks, international judges at road rescue competitions and engineers from companies developing road rescue equipment. Each expert had at least five years of experience in their respective fields. Invitations were sent via email, and participants were informed of the purpose and structure of the Delphi method, which required their input over multiple rounds. Their consent to take part in the study was clearly expressed in their reply email.
Delphi process
After the initial version of the RoadRes-Q scale was developed, it was sent to the panel of experts for review. Each expert was asked to rate the relevance of each item on a 7-point Likert scale, ranging from 0 ("not at all relevant") to 6 ("very relevant"). Additionally, a free-text field was provided for experts to offer comments or suggestions for improving the items and/or adding some ones. The median score for each item was calculated after the first round of responses. Items with a median score below 4 were eliminated from the scale. For items with median scores between 4 and 5, revisions were made based on the feedback received to improve their clarity and relevance, with the aim of making these items "highly relevant" to the experts. Items that received a median score of 6 were retained without any modifications. Following this, a second version of the RoadRes-Q scale was created and distributed to the experts who participated in the first round for further assessment. During this second round, the median score for each item was recalculated, and items that received a median score between 5 and 6 were retained in the final version.
Psychometric testing
Simulation training sessions
The RoadRes-Q scale was tested for two one-day sessions of multidisciplinary roadside rescue simulation training at the Vienne fire department training centre in Valdivienne, France. Prior to taking part, participants were given the road rescue guidelines and encouraged to consult them. Each one-day session involved 16 participants, comprising 14 firefighters and 2 simulated victims. Firefighters were split into four teams, with two first responders' ambulance teams and two roadside rescue teams. Each team was made up of 3 participants, which corresponds to the usual crew in real life. One rescue operations commander commanded both an ambulance team and a road rescue team. Six healthcare staffs were split into two SMUR teams, each consisting of an emergency physician, a nurse, and an ambulance driver. All participants were divided into two equal groups for each simulation scenario, i.e. one ambulance team, one roadside rescue team, one rescue operations commander, one SMUR team and one simulated victim. Six observers evaluated the medical team performance during the first one-day training session, and four the second one-day session. Each training day included two or three scenarios. Each scenario was carried out simultaneously and independently, allowing all participants to complete both scenarios. The day began with a reminder on key elements of the guidelines, including scene safety procedures, vehicle stabilisation techniques, initial patient assessment, coordinated team communication, and extrication protocols. This was followed by an introduction to specific rescue equipment, a hands-on airbag activation workshop, and the distribution of a cognitive aid sheet to assist with decision-making during the scenarios. The session continued with a pre-briefing led by a simulation training expert followed, introducing supervisors, outlining their roles, and reviewing the principles and benefits of simulation in improving patient care [17, 18]. The importance of learner engagement was emphasised, as were the limitations of simulation in fully replicating real-world situations. Participants were reminded that simulation offers a safe environment to practice and refine skills.
Each simulation session started with a briefing introducing the context, after which the teams began their operations. Scenarios ended when the simulated patient was safely extracted from the vehicle and stabilised for transport. If patient extraction took more than 40 min, a simulated cardiac arrest occurred, triggering an emergency extraction. Throughout the scenario, observers completed the RoadRes-Q scale to document the team's performance. Each simulation was followed by a 30-min debriefing session, using the after-action review (AAR) method, which is commonly employed by the US Army [19,20,21,22]. This structured debriefing process allowed participants to reflect on their actions, discuss areas for improvement, and consolidate their learning from the day.
RoadRes-Q scale application and validation
The RoadRes-Q scale was employed during each simulation session. Observers included three emergency physicians trained in simulation and debriefing, an academic educator and practitioner specialised in pediatrics care and simulation, a lieutenant from firefighter department with expertise in roadside rescue, a firefighter instructor with experience in first aid training, two ED nurses trained in simulation teaching, and two emergency medicine residents. Each observer received one hour's training during a preparatory meeting for the simulation days. These observers were responsible for completing the RoadRes-Q scale both during and immediately after each simulation scenario. They were all blinded to the evaluations of the others. The data collected through these assessments were used for the statistical analysis to study the validity and the reliability and consistency of the scale.
Each item was evaluated using a 3-point scale, where 0 represented “Not done”, 1 indicated “Done partially, unexpectedly, or inappropriately”, and 2 signified “Done correctly and on time”. For scenarios where an item could not be evaluated, a “Not Attributable” option was provided. This scoring system was based on the methodology of the Team Average Performance Assessment Scale (TAPAS) [23, 24]. Furthermore, key items in the RoadRes-Q scale highlighted specific actions that were expected when the medical team was the first to arrive at the scene of the accident, ensuring that the assessment reflected real-world expectations and operational priorities.
Evaluation of training sessions
Demographic criteria of the medical teams involved in this study were analysed based on several key variables, including gender, age, years of experience in the ED, the approximate number of roadside rescue interventions performed, and prior roadside rescue training. Age was categorised into four groups (25–30, 30–35, 35–40, and > 40 years). The number of years of experience in the ED was grouped into three categories (1–5 years, 6–10 years, and > 10 years). Participants reported the approximate number of roadside rescue interventions they had performed, categorised into 1–9, 10–20, and > 20 interventions, allowing for an evaluation of practical experience within the team. Finally, participants were asked whether they had previously attended a roadside rescue training program, with responses classified as “yes, in Poitiers”, “yes, in another centre”, or “no”.
The quality of the training sessions was evaluated using the first two levels of the Kirkpatrick's pyramid: learners' satisfaction and gain in Knowledge, Skills and Attitudes (KSA). Learners' satisfaction was evaluated using a Likert scale from 1 to 5 (1 = ”strongly disagree” to 5 = ”strongly agree”), which measured self-assessment participants for the interest, the organisation of the training, its theoretical and practical relevance, and their overall satisfaction. Gain in knowledge was assessed through a 23-item multiple-choice questionnaire (MCQ), which was based on the developed roadside rescue guidelines. Medical teams completed the questionnaire online before (pre-test) and after (post-test) the training sessions.
Clinical skills were assessed using the TAPAS tool. TAPAS was designed to provide a performance assessment score for a multiprofessional team’s approach to a simulated life-threatening situation, covering both medical and traumatic emergencies in adults. Its objective is to assess both quality and response time, based on three categories: not performed (0/2), performed but incorrectly or delayed (1/2), and correctly performed on time (2/2). TAPAS yields a score out of 100.
Teamwork performance is assessed using the Clinical Team Scale (CTS), a tool designed for evaluating team effectiveness in clinical scenarios, especially in high-stakes simulations. CTS measures critical aspects of teamwork—such as communication, coordination, leadership, and situational awareness, through a 10-point rating scale for each item, where 1 signifies poor performance and 10 represents exemplary performance. This scoring method provides a nuanced evaluation of team dynamics, with a higher cumulative score indicating a well-coordinated, communicative, and supportive team. The item concerning family information could not be scored in the scenarios, so the maximum score was 130.
Data analysis
Description of participants in simulation training sessions
Descriptive statistics such as number and percentages were calculated for each age group, measured using scale as described above.
Psychometric testing: validation of the RoadRes-Q scale
The validation of the RoadRes-Q scale focused on two key aspects: internal consistency and inter-observer reliability.
-
Internal consistency:
Internal consistency was analysed by calculating Cronbach's alpha coefficient (Cronbach LJ), which measures the degree to which items within the scale are correlated, thus assessing the reliability of the scale. According to Bujang et al. (2018), a Cronbach's alpha value greater than 0.5 is considered acceptable for exploratory research [25].
-
Inter-observer reliability:
Inter-observer reproducibility was assessed using different means:
-
Means and variances of each observer's score were calculated and compared with a T-Test and F-test respectively, which allowed for an assessment of whether the variability in scores was statistically significant. A p-value < 0.05 was considered indicative of statistical significance in all analyses.
-
Comparison of variances of the observers (F-Test)
-
Intra-Class Correlation coefficient (ICC). The ICC was selected to evaluate the agreement between different observers assessing the same medical team using the scale. ICC values were interpreted based on established guidelines: values below 0.5 indicate low reliability, values between 0.5 and 0.75 indicate moderate reliability, values between 0.75 and 0.90 indicate good reliability, and values above 0.90 indicate excellent reliability [26].
-
Linear regression analysis was performed to graphically represent the relationship between observer 1 and 2, as well as the calculation of R2, where a R2 above 0.5 indicates a strong reproducibility between observers.
-
Evaluation of training sessions
-
Learner satisfaction: Descriptive analysis, including means, were performed on learners' satisfaction, measured using a Likert scale as described above.
-
Knowledge improvement: The assessment of knowledge improvement was conducted through a 23-item MCQ administered both before and after the simulation training. A T-test was used to compare the pre-test and post-test scores of the medical teams, assessing whether there was a statistically significant increase in knowledge following the training. The difference between pre- and post-test scores was analysed to quantify the overall impact of the training. A p-value of < 0.05 was considered statistically significant, indicating meaningful knowledge acquisition between the pre- and post-test phases.
-
Clinical skills and teamwork performances: Descriptive analysis, including means and standard deviation, were performed on TAPAS and CTS.
Software used for data analysis
All statistical analyses were performed using R software (Biocore, Boston, Massachusetts, USA), stats, irr and lpSolve packages [27,28,29], ensuring a rigorous approach to data analysis and reliability testing.
Ethics
Since this study did not involve any interventional research or direct involvement of patients, approval from an ethics committee was not required. However, the study received approval from the scientific referents of the Faculty of Medicine, Poitiers, France. Additionally, all participants were informed about the study, and each provided consent for the use of anonymised data.
Results
RoadRes-Q scale development
A total of 22 experts from various countries were contacted in 2021, including professionals from France, Switzerland, Belgium, Canada, the Netherlands, Germany, Spain, Norway, Austria, and Italy. The panel consisted of 14 firefighters, six emergency physicians, and two engineers in road rescue equipment.
In the first round, 11 experts responded to the invitation, though two declined participation, and 11 did not respond to the initial email. The first version of the scale included 73 items. After the first round, 38 items remained unchanged, 33 items required reformulation bases on comments received and 2 items were deleted (71 items). In the second round, seven experts participated. After the second round, 11 items were removed, resulting in a final version of the RoadRes-Q scale with 60 items (Table 1). Experts' participation and development of the RoadRes-Q scale are summarised in Fig. 1.
Psychometric testing: validity and reliability analysis
The RoadRes-Q scale had excellent internal consistency with no redundancy between items, as assessed by a Cronbach’s alpha coefficient of 0.86.
Concerning reliability, several statistical analyses were performed. The T-test comparison of group means showed no significant difference between observers (p = 0.10). Accordingly, the F-test comparison of variances between observers showed no significance (p = 0.21).
The ICC calculation showed a weak concordance between 2 observers, with an ICC of 0.48. The R2 coefficient using linear regression showed a mediocre level of reliability between observers, with a value of 0.47.
Simulation sessions evaluation
Demographic data, satisfaction, knowledge, TAPAS and CTS could only be assessed on the first day of the simulation due to technical issues. A third of participants were under 35 years old, and half had less than 5 years’ experience in the ED. Most participants (89%) had previous roadside rescue training, and half had completed more than 20 interventions. Participant demographics are summarised in Table 2.
Satisfaction was high among participants, with all respondents rating improvements in inter-team relations as 5/5 and recommending the training to their colleagues. Overall satisfaction scores ranged from 8 to 10 out of 10, with an average of 9 across all participants. All the results about the satisfaction survey are described in the Table 3.
Knowledge improved after simulation, as reflected by differences between pre-test and post-test scores (4.2 ± 1.1 versus 12.0 ± 2.0, difference: 7.8 [IC 95%, 4.6–11.0], p = 0.0025).
The average TAPAS score for all observers across all scenarios was 67/100 ± 15. Similarly, the average CTS score was 97/130 ± 14.
Discussion
Main results
In this study, which involved nine international experts in a Delphi method, a 60-item scale was developed to assess the roadside rescue performance of medical teams. Subsequently, this evaluation scale was used during six roadside rescue scenarios, involving 12 healthcare professionals and 32 firefighters. This allowed for the validation process to be conducted, demonstrating very high validity and good reliability. Additionally, the evaluation of the simulation-based training process for roadside patient care showed high learner satisfaction, with improvements in knowledge and the acquisition of both technical and non-technical skills. To our knowledge, this is the first study addressing this topic by creating an evaluation tool and testing its psychometric properties during simulation sessions.
The RoadRes-Q scale development and its psychometric properties
Using a Delphi method involving nine international experts, we created a 60-item skills RoadRes-Q scale that covers all essential techniques and procedures for caring patients during roadside rescues. The scale was developed to comprehensively evaluate the critical competencies required in these high-stakes scenarios, ensuring both clinical and operational efficacy. The internal validity of the RoadRes-Q scale is supported by a Cronbach's alpha coefficient of 0.86, demonstrating excellent internal consistency and confirming that the items are non-redundant. However, the results related to inter-observer reproducibility were mixed. While comparison of mean scores across observers showed no significant differences, the ICC indicated weak agreement between observers. It would be interesting to explore whether certain types of observers, such as physicians versus firefighters, experienced greater difficulty in agreeing on specific criteria. This discrepancy may be attributed to the subjective nature of non-technical skills, such as leadership, teamwork, and communication, which are often more challenging to assess consistently across different observers [7].
The simulation-based training provided a realistic platform to apply the RoadRes-Q scale. Ensuring the quality of training is critical for the validation of the assessment scale, as it guarantees that the scenarios and learning conditions reflect real-world situations as accurately as possible. A well-structured, high-quality simulation provides a reliable platform for observers to apply the scale in a meaningful way. If the training itself is of poor quality, it could introduce bias into the assessment of team performance and obscure the true effectiveness of the scale. Therefore, the thorough assessment of both knowledge improvement and satisfaction ensures that the context in which the scale is tested is rigorous and conducive to the collection of valid and reliable data. The satisfaction levels, significant knowledge improvement, clinical skills and teamwork performances seen among participants reinforce the RoadRes-Q scale effectiveness in contributing to common operational culture among roadside rescue actors.
This present study aligns with previous research in emergency medicine that highlights the importance of robust evaluation scales. While the scientific literature is rich in studies focusing on creation of scores or validation of measurement scales in emergency settings [23, 30,31,32,33,34,35,36], assessment of roadside rescue skills remains relatively unexplored. In 2022, Nutbeam et al. developed a 91-item scale for best practices in handling entrapped victims using a Delphi method [37]. Although comprehensive, this scale lacks the structured, time-sensitive elements needed for use in simulation training, as it does not provide a mechanism for evaluating the real-time performance of rescue teams. The RoadRes-Q scale fills this gap by offering a structured evaluation method designed for use during simulations, allowing for real-time performance evaluation and feedback.
Limitations and strengths
This study has some limitations. Firstly, the response rate from experts involved in the Delphi process was relatively low, with only half of them responding to the initial solicitation. Additionally, the relatively low number of physicians involved in the development of the scale may have impacted its comprehensiveness, particularly regarding medical management. Despite this, the high internal consistency of the scale suggests that items included are non-redundant and effectively cover all the necessary skills. While the participating experts provided valuable input, a higher participation rate would have increased the diversity of perspectives and potentially strengthened the consensus. To improve expert engagement in future studies, it may be beneficial to implement regular reminders and possibly incentives to encourage broader participation. Secondly, although this is not an objective of the present study, the generalisation of results to other contexts could be questioned. The RoadRes-Q scale was assessed for use in the French prehospital care system, which follows a “stay and play” model of patient management. This model contrasts with the “scoop and run” approach used in many Anglo-Saxon countries, where the focus is on rapid transport to the hospital [4]. However, the RoadRes-Q scale was developed with the help of international experts, some of whom use paramedics and the “scoop and run” approach and we believe that this scale could also be suitable for this model. The scale focuses specifically on roadside rescue skills rather than broader medical competencies, making it adaptable to both French pre-hospital teams operating under the “stay and play” model and paramedics practicing the “scoop and run” approach. Finally, inter-observer agreement was weak, which could compromise the objectivity of the assessment. This can be attributed to the limited number of simulation scenarios used for validation, and the lack of dedicated training for observers in applying the scale.
This study has also several strengths. Firstly, the international multidisciplinary approach to developing the reference manual and the RoadRes-Q scale ensured input from a diverse range of experts, fostering a shared culture among roadside rescue stakeholders. This collaborative process is crucial in creating a common framework that aligns with the principle of “training together those who work together” through multi-professional simulation exercises. Secondly, by providing a standardised scale, it allows trainers to objectively assess both technical and non-technical performance, identify areas for improvement, and ultimately enhance the quality of simulation-based training. The high levels of participant satisfaction and the improvement in knowledge following the training suggest that the scale has great potential in fostering a shared operational culture between medical teams and firefighters, thereby improving collaboration and coordination in real-world rescue scenarios. Finally, although the RoadRes-Q scale was developed with input from expert firefighters and healthcare professionals, its applicability extends beyond this group. The simulation-based validation included professionals with varying levels of experience, demonstrating that the scale can be used both as an assessment tool and as a structured training framework to enhance the skills of less experienced health care professionals.
Use of RoadRes-Q scale and future research
By providing a structured method for performance assessment, the RoadRes-Q scale seeks to enhance the quality of training, foster a common operational culture, and ultimately improve patient outcomes during real-life emergencies. The RoadRes-Q scale was developed to address the significant gap in current training frameworks, where no standardised method exists to evaluate skills of emergency teams during high-pressure interventions.
Looking ahead, future research could focus on refining the scale to improve inter-observer reliability, particularly in the assessment of non-technical skills. To address this issue, several strategies could be implemented in future applications of the scale. First, standardised training for observers could significantly improve inter-rater agreement. Structured sessions focusing on the use of the RoadRes-Q scale, alongside calibration exercises involving real or recorded simulations, would help align evaluation criteria and reduce subjective variability. Additionally, video recordings of simulation sessions could be used for retrospective evaluation, allowing observers to review performances collectively and discuss discrepancies in scoring. This method, widely used in simulation research, has proven effective in enhancing inter-observer reliability by enabling shared reflection and calibration of assessments. Implementing these strategies could improve the consistency and objectivity of evaluations in future iterations of the scale [30, 31]. Additionally, further studies could explore the long-term impact of using this scale on actual rescue performance and patient outcome in real-world settings. Expanding the use of this scale across various geographical and cultural contexts would provide valuable insights into its broader applicability and effectiveness. Additional studies are needed to evaluate the applicability of this scale in different prehospital care systems, such as those found in the UK, the US, or other countries with varied emergency response practices.
Conclusion
The creation and validation of the RoadRes-Q scale mark a significant step forward in the training and evaluation of multidisciplinary emergency teams involved in roadside rescue operations. With strong internal consistency and high participant satisfaction, this scale offers a structured, objective method for assessing both technical and non-technical skills under high-pressure conditions. While improvements in inter-observer reliability are needed, particularly through enhanced evaluator training and the use of video-assisted assessments, the RoadRes-Q scale has the potential to become a vital tool in fostering a shared operational culture between medical teams—or paramedics—and firefighters. By standardising performance evaluation, it not only enhances the quality of simulation-based training but also holds the promise of improving real-world patient outcomes during road traffic accidents. Future research should focus on refining the scale reproducibility and exploring its broader applicability across diverse emergency response systems.
Availability of data and materials
The datasets generated and analysed during the current study are available from the corresponding author upon reasonable request. All relevant data have been anonymised to protect participant confidentiality. The assessment scale and related materials used in the study can also be made available for educational or research purposes, subject to approval by the research team.
Change history
01 May 2025
A Correction to this paper has been published: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13049-025-01389-0
Abbreviations
- AAR:
-
After action review
- CIC:
-
Clinical investigation center
- CHU:
-
Centre Hospitalier Universitaire (University Hospital)
- COS:
-
Commandant des Opérations de Secours (rescue operations commander)
- ICC:
-
Intraclass correlation coefficient
- INSERM:
-
Institut National de la Santé et de la Recherche Médicale (National Institute of Health and Medical Research)
- KSA:
-
Knowledge, skills and attitudes
- MCQ:
-
Multiple choice questionnaire
- SAMU:
-
Service d'Aide Médicale Urgente (emergency medical assistance service)
- SDIS:
-
Service Départemental d'Incendie et de Secours (Departmental Fire and Rescue Service)
- SMUR:
-
Service Mobile d'Urgence et de Réanimation (mobile emergency and resuscitation service)
- TAPAS:
-
Team average performance assessment scale
References
Road traffic injuries [Internet]. [cited oct 23th, 2024]. Available on: https://www.who.int/news-room/fact-sheets/detail/road-traffic-injuries
Bilan définitif de l’accidentalité routière en 2023 : baisse de la mortalité routière en France de - 4,3 % par rapport à 2022 | Sécurité Routière [Internet]. [cited oct 21th, 2024]. Available on: http://www.securite-routiere.gouv.fr/actualites-page-1-189/bilan-definitif-de-laccidentalite-routiere-en-2023-baisse-de-la-mortalite
Dick WF. Anglo-American vs. Franco-German emergency medical services system. Prehospital Disaster Med. 2003;18(1):29–35.
Guenezan J, Marjanovic N, Drugeon B, Mimoz O. Caring for severe trauma patients in France. A call for a national strategy? Anaesth Crit Care Pain Med. 2019;38(2):105–6.
Smith RM, Conn AK. Prehospital care—scoop and run or stay and play? Inj Int J Care Injured. 2009;40:23–6.
Jouanne E, Charron C, Chauvin C, Morel G. Correlates of team effectiveness: an exploratory study of firefighter’s operations during emergency situations. Appl Ergon. 2017;61:69–77.
Flin R. Identifying and training non-technical skills for teams in acute medicine. Qual Saf Health Care. 2004;13(1):80–4.
Dojmi Di Delupis F, Pisanelli P, Di Luccio G, Kennedy M, Tellini S, Nenci N, et al. Communication during handover in the pre-hospital/hospital interface in Italy: from evaluation to implementation of multidisciplinary training through high-fidelity simulation. Intern Emerg Med. 2014;9(5):575–82.
Kim J, Lee O. Effects of a simulation-based education program for nursing students responding to mass casualty incidents: a pre-post intervention study. Nurse Educ Today. 2020;85: 104297.
Wayne DB, Didwania A, Feinglass J, Fudala MJ, Barsuk JH, McGaghie WC. Simulation-based education improves quality of care during cardiac arrest team responses at an academic teaching hospital : a case-control study. Chest. 2008;133(1):56–61.
[U-EV]Tesla Model X | U-CAR ([U-EV] Tesla Model X Self-Collision and Fire Incident in the Country Highlights Issues of Emergency Rescue for Electric Vehicles and Public Information Transparency | U-CAR News) [Internet]. [cited nov 4th, 2024]. Available on: https://news.u-car.com.tw/news/article/71479
Pompiers.fr [Internet]. 2016 [cited nov 4th, 2024]. Guide opérationnel sur les interventions d’urgence sur les véhicules. Available on: https://www.pompiers.fr/actualites/guide-operationnel-sur-les-interventions-durgence-sur-les-vehicules
Disabling direct hazards in a vehicle crash – going beyond the high-volt systems | CTIF - International Association of Fire Services for Safer Citizens through Skilled Firefighters [Internet]. 2023 [cited nov 4th, 2024]. Available on: https://ctif.org/news/disabling-direct-hazards-vehicle-crash-what-does-it-mean-emergency-responders
Un C, Aydın K. Thermal runaway and fire suppression applications for different types of lithium ion batteries. Vehicles. 2021;3(3):480–97.
Varndell W, Fry M, Lutze M, Elliott D. Use of the Delphi method to generate guidance in emergency nursing practice: a systematic review. Int Emerg Nurs. 2021;56: 100867.
Keeney S, Hasson F, McKenna H. Consulting the oracle: ten lessons from using the Delphi technique in nursing research. J Adv Nurs. 2006;53(2):205–12.
Chopra V, Gesink BJ, De Jong J, Bovill JG, Spierdijk J, Brand R. Does training on an anaesthesia simulator lead to improvement in performance? Br J Anaesth. 1994;73(3):293–7.
Aebersold M, Tschannen D. Simulation in Nursing Practice: The Impact on Patient Care. Online J Issues Nurs [Internet]. 31 mai 2013 [cited oct 27th, 2024];18(2). Available on: https://ojin.nursingworld.org/table-of-contents/volume-18-2013/number-2-may-2013/simulation-in-nursing-practice/
Sawyer T, Eppich W, Brett-Fleegler M, Grant V, Cheng A. More than one way to debrief: a critical review of healthcare simulation debriefing methods. Sim Healthcare. 2016;11(3):209–17.
Stoto M. Measuring and assessing public health emergency preparedness. J Public Health Manag Pract. 2013;19(Supplement 2):S16-21.
World Health Organization. Guidance for after action review (AAR). 2019
Sauzeau M, Marjanovic N, Guignard L, Charre A, Contal P, Drugeon B, et al. Le débriefing par revue après action pour la simulation en santé : applications à la médecine d’urgence. Annales françaises de médecine d’urgence. 2022;12(3):143–51.
Oriot D, Bridier A. Development and assessment of an evaluation tool for team clinical performance: the team average performance assessment scale (TAPAS). Health Care: Curr Rev. 2016;4(2):1000164.
Everett TC, Ng E, Power D, Marsh C, Tolchard S, Shadrina A, et al. The Managing Emergencies in Paediatric Anaesthesia global rating scale is a reliable tool for simulation‐based assessment in pediatric anesthesia crisis management. Anderson B, éditeur. Pediatr Anesth. 2013;23(12):1117‑23.
Clinical Research Centre, Sarawak General Hospital, Ministry of Health, Sarawak, Malaysia, Bujang MA, Omar ED, Clinical Research Centre, Serdang Hospital, Ministry of Health, Selangor, Malaysia, Baharum NA, National Clinical Research Centre, Ministry of Health, Kuala Lumpur, Malaysia. A Review on Sample Size Determination for Cronbach’s Alpha Test: A Simple Guide for Researchers. MJMS. 2018;25(6):85‑99.
Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155–63.
R Core Team. R: A language and environment for statistical computing. [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2024. Available on: https://www.R-project.org/
Gamer M, Lemon J, Fellows I, Singh P. irr: Various Coefficients of Interrater Reliability and Agreement [Internet]. 2019. Available on: https://CRAN.R-project.org/package=irr
Berkelaar M. lpSolve: Interface to « Lp_solve » v. 5.5 to Solve Linear/Integer Programs [Internet]. 2024. Available on: https://CRAN.R-project.org/package=lpSolve
Donnelly PD, Lester CA, Morgan CL, Assar D. Evaluating CPR performance in basic life support: the VIDRAP protocol. Resuscitation. 1998;36:51–7.
Ghazali DA, Delaire C, Blottiaux E, Lardeur JY, Jost D, Violeau M, et al. Development of a performance assessment scale for simulated dispatcher-assisted cardiopulmonary resuscitation (Telephone-CPR): a multi-center randomized simulation-based clinical trial. Prehosp Disaster med. 2021;36(5):561–9.
Lester CA, Morgan CL, Donnelly PD, Assar D. Assessing with CARE: an innovative method of testing the approach and casualty assessment components of basic life support, using video recording. Resuscitation. 1997;34(1):43–9.
Ghazali DA, Thomas J, Deilhes E, Laland C, Thévenot S, Richer JP, et al. Design and validation of an anatomically based assessment scale for handwashing with alcohol-based hand rub. Infect Control Hosp Epidemiol. 2018;39(8):1000–2.
Ghazali A, Leger A. Development and validation of a performance assessment scale for chest tube insertion in traumatic pneumothorax. J Pulm Respir Med. 2016;6(3):1000346.
Ghazali DA, Ilha-Schuelter P, Barreyre L, Stephan O, Barbosa SS, Oriot D, et al. Development and validation of the first performance assessment scale for interdisciplinary chest tube insertion: a prospective multicenter study. Eur J Trauma Emerg Surg. 2022;48(5):4069–78.
Oriot D, Darrieux E, Boureau-Voultoury A, Ragot S, Scépi M. Validation of a performance assessment scale for simulated intraosseous access. Simul Healthc J Soc Simul Healthc. 2012;7(3):171–5.
Nutbeam T, Fenwick R, Smith JE, Dayson M, Carlin B, Wilson M, et al. A Delphi study of rescue and clinical subject matter experts on the extrication of patients following a motor vehicle collision. Scand J Trauma Resusc Emerg Med. 2022;30(1):41.
Acknowledgements
We would like to thank the Vienne Departmental Fire and Rescue Service for letting us use their equipment and training centre free of charge, as well as ABS lab for lending us simulation equipment. We would also like to thank the experts who took the time to respond to our request. We have called this group of experts the Road Rescue Study Group and it is made up of :—Jean Gabriel Clouet, Service de protection et sauvetage Lausanne, Lausanne, Switzerland—Tom Van Esbroeck, Center of expertise in Belgium, directorate general for civil security, Brandweerzone Centrum, Ghent, Belgium—Sébastien Guay, Institut de protection contre les incendies du Québec, Québec, Canada—Ruediger Knoll, Weber rescue systems, Güglingen, Germany—Julien Manesse, Service Départemental d'Incendie et de Secours 17, La Rochelle, France—Frans Van Maurik, RIFRA instructions, Netherlands—Bart Noyens, Belgian federal working group on technical rescue, Belgium—Raphael Mouth, Falck, Montigny le Bretonneux, France—Cédric Rigollet, Service Départemental d'Incendie et de Secours 51, Fagnieres, France
Funding
This study did not receive any external funding. All participants contributed on a voluntary basis during their free time. The meals provided during the two days of simulation were generously offered by the Vienne Departmental Fire and Rescue Service. No other financial support or resources were provided for the study.
Author information
Authors and Affiliations
Consortia
Contributions
This study was conceptualised and designed by BD, DO and OM. The data collection was carried out by KL and MC. Data analysis and interpretation were performed by NM. The initial draft of the manuscript was written by BD, with revisions contributed by DO, OM en NM. The manuscript was critically reviewed and revised by BD, OM and DO for important intellectual content. The study was supervised and coordinated by BD, PC and NM. Technical or material support was provided by AG, AM and MG. All authors have read and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Since this study did not involve any interventional research or direct involvement of patients, approval from an ethics committee was not required. However, the study received approval from the scientific referents of the Faculty of Medicine, Poitiers, France. Additionally, all participants were informed about the study, and each provided consent for the use of anonymised data.
Consent for publication
All the authors, the Roadside Rescue Study Group have given their permission for this article to be published. Participants have given their consent during the simulation sessions.
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original version of this article has been revised: the authors reported that all given and family names had been mistakenly reversed.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Lavabre, K., Marjanovic, N., Oriot, D. et al. Creation and validation of a roadside rescue skills scale for training pre-hospital medical teams: the RoadRes-Q scale. Scand J Trauma Resusc Emerg Med 33, 56 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13049-025-01370-x
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13049-025-01370-x