Assessment of non-technical skills: why aren’t we there yet? | BMJ Quality & Safety

Subscribe
Log In More

Log in via Institution
Log in via OpenAthens
Log in via IHI

Log in using your username and password
For personal accounts OR managers of institutional accounts

Username *

Password *

Forgot your log in details?Register a new account?
Forgot your user name or password?
Basket
Search More

Search for this keyword

Advanced search

Close More
Main menu

Latest content

Current issue

Archive

Authors

About
Subscribe
Log in More

Log in via Institution
Log in via OpenAthens
Log in via IHI

Log in using your username and password
For personal accounts OR managers of institutional accounts

Username *

Password *

Forgot your log in details?Register a new account?
Forgot your user name or password?
BMJ Journals

Article Text

Editorial

Assessment of non-technical skills: why aren’t we there yet?

Free

Adam P Johnson,
Rajesh Aggarwal

Department of Surgery, Thomas Jefferson University Hospital, Philadelphia, Pennsylvania, USA

Correspondence to Dr Adam P Johnson, Department of Surgery, Thomas Jefferson University Hospital, Philadelphia, PA 19107, USA; adam.johnson{at}jefferson.edu

https://doi.org/10.1136/bmjqs-2018-008712

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Knowledge and application of non-technical skills (NTS) may represent the greatest challenge facing medical education today. For centuries, medical education focused on developing individual clinical knowledge and technical skills. But, the modern complexities of healthcare delivery and rapid expansion of medical knowledge necessitate a high-functioning team approach, which requires human factors engineering and NTS to operate effectively.

Other complex high-risk industries—like aviation, oil drilling, nuclear power and the military—have aligned their educational systems to match.1–3 While certain healthcare disciplines have developed frameworks to ensure the acquisition and maintenance of clinical and technical skills, no standard framework for NTS exists. With the increasing computational support for clinical and technical skills—decision aids, predictive algorithms, robotic surgery and image interpretation—the true added value of human clinicians may lie with their mastery of NTS.

Failure of NTS has been linked to poor quality and safety of care.2 A prospective observational study of 28 laparoscopic cholecystectomies found a strong correlation between surgical team situational awareness and fewer technical errors.4 In Japan, a 3-year retrospective review of fatal medical accidents submitted to a third-party safety organisation found roughly half to be due to failures of NTS, most often related to situational awareness, teamwork and decision-making.5 A review of malpractice claims identified >1 personnel involved in 83% of errors, but only 24% directly attributable to communication breakdown, which was the only NTS specifically studied.6 A review of trauma and orthopaedic-related adverse events from the National Reporting and Learning System found many to be related to NTS—situational awareness (52%), communication/teamwork (21%), leadership (16%) and decision-making (12%).7 Prospective direct observation of 293 surgical procedures found a strong association between less effective teamwork behaviours—as measured by the Behavioural Marker Risk Index—and a higher risk of death or serious complication, even after controlling for American Society of Anesthesiologists risk category.8 A multi-institutional retrospective review of Veteran’s Health Administration found interventions to increase NTS were associated with reductions in perioperative mortality.9 Although these studies demonstrate a link of NTS to quality and safety, quantifying the impact and comparing across studies is limited by a lack of standard definitions of NTS and relevant outcomes. Advancing our understanding of NTS requires a more thoughtful and standardised framework.

Boet et al review assessment focused on team performance in crisis situations10 and Higham et al review assessments of broader NTS domains across multiple healthcare disciplines.11 These rigorous and thorough reviews of the literature provide a snapshot of the contemporary science underpinning the measurement of NTS in healthcare and provide researchers guidance for selecting the best tools available to measure NTS. However, the greatest lesson to take from these reviews may be the current gaps in the measurement of NTS and how to advance the field.

The first gap is the lack of a standardised definition of NTS and subdomains. The framework used by Boet et al for teamwork-related NTS included a total of 14 subdomains, whereas Higham et al include four domains for all NTS.10 11 Each tool included in these reviews measured a unique subset of overlapping domains and often apply to narrow contexts.2 In addition, each of these tools is based on external observations, which may not capture some domains effectively, such as situational awareness and decision-making. These may be better assessed through direct measurement via mental task load index, eye-gaze patterns to assess attention and written examinations.2 Like the blind men and the elephant—each tool only measures a portion of NTS and so may instead interpret the elephant as a wall, spear, snake, tree, fan or rope.12 Generalising a tool beyond the context in which it was developed, combined with Goodhart’s Law where ‘when a measure becomes a target, it ceases to be a good measure’ leaves policy changes based on any of the existing measurement tools open to potentially damaging unintended consequences.13

The studies cited by both reviews also exhibit heterogeneity in the psychometric principles used to assess each tool. The Boet et al framework for reliability and validity include nine domains, whereas Higham and colleagues compare two domains of validity and treat reliability and usability as distinct single domains.10 11 Again, each of the tools is often assessed by only a subset of these psychometric test domains. Emphasising validity and reliability overlooks feasibility testing, which is critical for effective implementation. Assessing NTS at first may appear straightforward, but requires thorough training to assess properly.14

Finally, the included studies provide no benchmark for adequate performance in NTS. While NTS have been associated with improved quality and safety, the dose–response relationship remains poorly understood. Providing actionable feedback to clinicians requires comparison to a standard, therefore establishing ‘good’ or ‘poor’ performance. Currently, each individual institution or researcher is left to interpret the results on their own. Without standard definitions for NTS, psychometric assessment inclusive of feasibility and benchmarks for determining competence, any system-wide intervention to increase assessment and training in NTS will likely fail to reach critical mass and acceptance.

An important initial next step is to exponentially increase the data collected regarding the current state of NTS and their impact on patient outcomes. Automated real-time data collection of clinician interactions may provide better insight into the association between NTS and decrease the burden of assessment. In aviation, the cockpit black box records all communications between pilot and team, which are then rigorously analysed after adverse events. In healthcare, the medical record and clinician recollection of events are used as a proxy for this black box, which under-represents the nuances of decisions and clinical communications.15 Some surgeons are routinely capturing video to assess for technical skill.16 17 A system to automatically collect intraoperative events, dubbed the OR Black Box, has been used to assess for intraoperative distractions and adverse events but may also provide insight into NTS.18 Ongoing fears of loss of prestige and litigation stand in the way of routine recording of clinical interactions.2 Legal protection against the subpoena of clinical recordings is necessary.19

Once this robust data set is collected, we must establish a parsimonious set of domains—prioritised according to impact on patient outcomes, transferable across specialties and clinical domains and benchmarked for adequate performance. We must design assessment tools that prioritise domains of NTS most closely associated with poor outcomes. The emphasis should be on a small set of generalisable domains. Opportunities may then arise to automate data analysis through natural language processing and machine learning algorithms.20 21

The final step will be to link incentives to performance in NTS and an ability to surpass the benchmark. In the UK and the USA, the General Medical Council and Accreditation Council for Graduate Medical Education outline general principles, but no specific standards for certification.22 23 There exists a curriculum through the American College of Surgeons geared towards surgical residents.24 Guidebooks are available but lack a broader systematic approach with external accountability for organisations and clinicians.25 Some countries have attempted to more systematically incorporate human factors and ergonomics systematically into healthcare.26 Any system for accountability must provide a blame-free framework to support and remediate poor performers—both trainees and active clinicians—and thus prevent a failure to fail.27 Mature team simulation for NTS has already been initiated by malpractice insurers.28 We can look to Crew Resource Management (CRM) framework for licensing and competence assurance from civil aviation, nuclear power, offshore oil drilling, mining, rail and emergency services.29 30 However, we must not forget the importance to align systems design with any training paradigm to foster the application of CRM in the workplace. Training alone without system redesign with human factors in mind will be insufficient to ensure appropriate quality and safety.3 31

The authors of both of these systematic reviews10 11 should be commended for their rigorous work to aggregate and compare a wide range of disparate instruments. They provide guidance on how to navigate the current literature on assessing NTS in healthcare. However, the strongest take away from their work may be recognising the vast amount of work yet to do to quantify the impact of NTS in healthcare and standardise assessment. We need more robust data, a parsimonious set of NTS and a set of benchmarks and incentives to guide adoption among clinicians.

References

↵
2. Flin R ,
3. O’Conner P
. Safety at the sharp end: a guide to non-technical skills. Burlington, VT: CRC Press, 2008.
↵
2. Flin R ,
3. Youngson G ,
4. Yule S
. Enhancing surgical performance : a primer in non-technical skill. Boca Raton, FL: CRC Press, 2016.
↵
2. Catchpole K
. Spreading human factors expertise in healthcare: untangling the knots in people and systems. BMJ Qual Saf 2013;22:793–7.doi:10.1136/bmjqs-2013-002036
OpenUrl FREE Full Text
↵
2. Mishra A ,
3. Catchpole K ,
4. Dale T , et al
. The influence of non-technical performance on technical outcome in laparoscopic cholecystectomy. Surg Endosc 2008;22:68–73.doi:10.1007/s00464-007-9346-1
OpenUrl CrossRef PubMed Web of Science
↵
2. Uramatsu M ,
3. Fujisawa Y ,
4. Mizuno S , et al
. Do failures in non-technical skills contribute to fatal medical accidents in Japan? A review of the 2010-2013 national accident reports. BMJ Open 2017;7:e013678–7.doi:10.1136/bmjopen-2016-013678
OpenUrl Abstract/FREE Full Text
↵
2. Rogers SO ,
3. Gawande AA ,
4. Kwaan M , et al
. Analysis of surgical errors in closed malpractice claims at 4 liability insurers. Surgery 2006;140:25–33.doi:10.1016/j.surg.2006.01.008
OpenUrl CrossRef PubMed Web of Science
↵
2. Panesar SS ,
3. Carson-Stevens A ,
4. Mann BS , et al
. Mortality as an indicator of patient safety in orthopaedics: lessons from qualitative analysis of a database of medical errors. BMC Musculoskelet Disord 2012;13.doi:10.1186/1471-2474-13-93
↵
2. Mazzocco K ,
3. Petitti DB ,
4. Fong KT , et al
. Surgical team behaviors and patient outcomes. Am J Surg 2009;197:678–85.doi:10.1016/j.amjsurg.2008.03.002
OpenUrl CrossRef PubMed Web of Science
↵
2. Neily J ,
3. Mills PD ,
4. Young-Xu Y , et al
. Association between implementation of a medical team training program and surgical mortality. JAMA 2010;304:1693–700.doi:10.1001/jama.2010.1506
OpenUrl CrossRef PubMed Web of Science
↵
2. Boet S ,
3. Etherington N ,
4. Larrigan S , et al
. Measuring the teamwork performance of teams in crisis situations: a systematic review of assessment tools and their measurement properties. BMJ Qual Saf 2019;28:327–37.doi:10.1136/bmjqs-2018-008260
OpenUrl Abstract/FREE Full Text
↵
2. Higham H
. Observer-Based tools for the assessment of non-technical skills in simulated or real clinical environments in healthcare.
↵
2. Saxe JG
. The blind men and the elephant. McGraw-Hill, 1963.
↵
2. Strathern M
. ‘Improving ratings’: audit in the British University system. Eur.rev. 1997;5:305–21.doi:10.1017/S1062798700002660
OpenUrl
↵
2. Hull L ,
3. Arora S ,
4. Symons NRA , et al
. Training faculty in nontechnical skill assessment: national guidelines on program requirements. Ann Surg 2013;258:370–5.doi:10.1097/SLA.0b013e318279560b
OpenUrl CrossRef PubMed
↵
2. Wong BM ,
3. Dyal S ,
4. Etchells EE , et al
. Application of a trigger tool in near real time to inform quality improvement activities: a prospective study in a general medicine ward. BMJ Qual Saf 2015;24:272–81.doi:10.1136/bmjqs-2014-003432
OpenUrl Abstract/FREE Full Text
↵
2. Birkmeyer JD ,
3. Finks JF ,
4. O'Reilly A , et al
. Surgical skill and complication rates after bariatric surgery. N Engl J Med 2013;369:1434–42.doi:10.1056/NEJMsa1300625
OpenUrl CrossRef PubMed Web of Science
↵
2. Bonrath EM ,
3. Gordon LE ,
4. Grantcharov TP
. Characterising 'near miss' events in complex laparoscopic surgery through video analysis. BMJ Qual Saf 2015;24:516–21.doi:10.1136/bmjqs-2014-003816
OpenUrl Abstract/FREE Full Text
↵
2. Jung JJ ,
3. Jüni P ,
4. Lebovic G , et al
. First-year analysis of the operating room black box study. Ann Surg 2018.doi:10.1097/SLA.0000000000002863
↵
2. Lloyd A ,
3. Dewar A ,
4. Edgar S , et al
. How to implement live video recording in the clinical environment: a practical guide for clinical services. Int J Clin Pract 2017;71:e12951–7.doi:10.1111/ijcp.12951
OpenUrl
↵
2. Deo RC
. Machine learning in medicine. Circulation 2015;132:1920–30.doi:10.1161/CIRCULATIONAHA.115.001593
OpenUrl Abstract/FREE Full Text
↵
2. Hart Y ,
3. Czerniak E ,
4. Karnieli-Miller O , et al
. Automated video analysis of Non-verbal communication in a medical setting. Front Psychol 2016;7.doi:10.3389/fpsyg.2016.01130
↵
1. General Medical Council
. Generic professional capabilities framework guidance on implementation for colleges and faculties 2017.
↵
1. Accreditation Council for Graduate Medical Education, American Board of Surgery
. The general surgery milestone project 2015.
↵
1. American College of Surgeons
. ACS/APDS Surgery Resident Skills Curriculum – Phase 3, 2018. Available: https://learning.facs.org/content/acsapds-surgery-resident-skills-curriculum-phase-3
↵
2. Carthey J ,
3. Clarke J
. Implementing human factors in healthcare, 2010. Available: http://www.patientsafetyfirst.nhs.uk/
↵
1. National Quality Board
. Human factors in healthcare a Concordat from the National quality board. Natl Qual Board 2013:1–22.
↵
2. Dudek NL ,
3. Marks MB ,
4. Regehr G
. Failure to fail: the perspectives of clinical supervisors. Acad Med 2005;80(10 Suppl):S84–S87.doi:10.1097/00001888-200510001-00023
OpenUrl CrossRef PubMed Web of Science
↵
2. Arriaga AF ,
3. Gawande AA ,
4. Raemer DB , et al
. Pilot testing of a model for insurer-driven, large-scale multicenter simulation training for operating room teams. Ann Surg 2014;259:403–10.doi:10.1097/SLA.0000000000000342
OpenUrl CrossRef PubMed
↵
2. Flin R
. Safe in their hands? Non-technical skills and competence assessment, 2015. Available: https://ncsbn.org/2015_AM_RFlin.pdf
↵
2. Okray R ,
3. Lubnau T
. Crew resource management for the fire service, 2004. Available: https://www.fireengineering.com/articles/print/volume-154/issue-8/features/crew-resource-management-for-the-fire-service.html
↵
2. Russ AL ,
3. Fairbanks RJ ,
4. Karsh B-T , et al
. The science of human factors: separating fact from fiction. BMJ Qual Saf 2013;22:802–8.doi:10.1136/bmjqs-2012-001450
OpenUrl Abstract/FREE Full Text

Footnotes

Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests RA is a consultant for Applied Medical
Patient consent for publication Not required.
Provenance and peer review Commissioned; internally peer reviewed.

Linked Articles

Systematic review
Observer-based tools for non-technical skills assessment in simulated and real clinical environments in healthcare: a systematic review

Helen Higham Paul R Greig John Rutherford Laura Vincent Duncan Young Charles Vincent
BMJ Quality & Safety 2019; 28 672-686 Published Online First: 25 May 2019. doi: 10.1136/bmjqs-2018-008565