Assessing the utility of natural language processing for detecting postoperative complications from free medical text

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Assessing the utility of natural language processing for detecting postoperative complications from free medical text. / Dencker, Emilie Even; Bonde, Alexander; Troelsen, Anders; Sillesen, Martin.

In: BJS open, Vol. 8, No. 2, zrae020, 2024.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Dencker, EE, Bonde, A, Troelsen, A & Sillesen, M 2024, 'Assessing the utility of natural language processing for detecting postoperative complications from free medical text', BJS open, vol. 8, no. 2, zrae020. https://doi.org/10.1093/bjsopen/zrae020

APA

Dencker, E. E., Bonde, A., Troelsen, A., & Sillesen, M. (2024). Assessing the utility of natural language processing for detecting postoperative complications from free medical text. BJS open, 8(2), [zrae020]. https://doi.org/10.1093/bjsopen/zrae020

Vancouver

Dencker EE, Bonde A, Troelsen A, Sillesen M. Assessing the utility of natural language processing for detecting postoperative complications from free medical text. BJS open. 2024;8(2). zrae020. https://doi.org/10.1093/bjsopen/zrae020

Author

Dencker, Emilie Even ; Bonde, Alexander ; Troelsen, Anders ; Sillesen, Martin. / Assessing the utility of natural language processing for detecting postoperative complications from free medical text. In: BJS open. 2024 ; Vol. 8, No. 2.

Bibtex

@article{72ae541639a84ca8ae3e5721bd19c679,
title = "Assessing the utility of natural language processing for detecting postoperative complications from free medical text",
abstract = "Background: Postoperative complication rates are often assessed through administrative data, although this method has proven to be imprecise. Recently, new developments in natural language processing have shown promise in detecting specific phenotypes from free medical text. Using the clinical challenge of extracting four specific and frequently undercoded postoperative complications (pneumonia, urinary tract infection, sepsis, and septic shock), it was hypothesized that natural language processing would capture postoperative complications on a par with human-level curation from electronic health record free medical text. Methods: Electronic health record data were extracted for surgical cases (across 11 surgical sub-specialties) from 18 hospitals in the Capital and Zealand regions of Denmark that were performed between May 2016 and November 2021. The data set was split into training/validation/test sets (30.0%/48.0%/22.0%). Model performance was compared with administrative data and manual extraction of the test data set. Results: Data were obtained for 17 486 surgical cases. Natural language processing achieved a receiver operating characteristic area under the curve of 0.989 for urinary tract infection, 0.993 for pneumonia, 0.992 for sepsis, and 0.998 for septic shock, whereas administrative data achieved a receiver operating characteristic area under the curve of 0.595 for urinary tract infection, 0.624 for pneumonia, 0.571 for sepsis, and 0.625 for septic shock. Conclusion: The natural language processing approach was able to capture complications with acceptable performance, which was superior to administrative data. In addition, the model performance approached that of manual curation and thereby offers a potential pathway for complete real-time coverage of postoperative complications across surgical procedures based on natural language processing assessment of electronic health record free medical text.",
author = "Dencker, {Emilie Even} and Alexander Bonde and Anders Troelsen and Martin Sillesen",
note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",
year = "2024",
doi = "10.1093/bjsopen/zrae020",
language = "English",
volume = "8",
journal = "BJS open",
issn = "2474-9842",
publisher = "Wiley",
number = "2",

}

RIS

TY - JOUR

T1 - Assessing the utility of natural language processing for detecting postoperative complications from free medical text

AU - Dencker, Emilie Even

AU - Bonde, Alexander

AU - Troelsen, Anders

AU - Sillesen, Martin

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024

Y1 - 2024

N2 - Background: Postoperative complication rates are often assessed through administrative data, although this method has proven to be imprecise. Recently, new developments in natural language processing have shown promise in detecting specific phenotypes from free medical text. Using the clinical challenge of extracting four specific and frequently undercoded postoperative complications (pneumonia, urinary tract infection, sepsis, and septic shock), it was hypothesized that natural language processing would capture postoperative complications on a par with human-level curation from electronic health record free medical text. Methods: Electronic health record data were extracted for surgical cases (across 11 surgical sub-specialties) from 18 hospitals in the Capital and Zealand regions of Denmark that were performed between May 2016 and November 2021. The data set was split into training/validation/test sets (30.0%/48.0%/22.0%). Model performance was compared with administrative data and manual extraction of the test data set. Results: Data were obtained for 17 486 surgical cases. Natural language processing achieved a receiver operating characteristic area under the curve of 0.989 for urinary tract infection, 0.993 for pneumonia, 0.992 for sepsis, and 0.998 for septic shock, whereas administrative data achieved a receiver operating characteristic area under the curve of 0.595 for urinary tract infection, 0.624 for pneumonia, 0.571 for sepsis, and 0.625 for septic shock. Conclusion: The natural language processing approach was able to capture complications with acceptable performance, which was superior to administrative data. In addition, the model performance approached that of manual curation and thereby offers a potential pathway for complete real-time coverage of postoperative complications across surgical procedures based on natural language processing assessment of electronic health record free medical text.

AB - Background: Postoperative complication rates are often assessed through administrative data, although this method has proven to be imprecise. Recently, new developments in natural language processing have shown promise in detecting specific phenotypes from free medical text. Using the clinical challenge of extracting four specific and frequently undercoded postoperative complications (pneumonia, urinary tract infection, sepsis, and septic shock), it was hypothesized that natural language processing would capture postoperative complications on a par with human-level curation from electronic health record free medical text. Methods: Electronic health record data were extracted for surgical cases (across 11 surgical sub-specialties) from 18 hospitals in the Capital and Zealand regions of Denmark that were performed between May 2016 and November 2021. The data set was split into training/validation/test sets (30.0%/48.0%/22.0%). Model performance was compared with administrative data and manual extraction of the test data set. Results: Data were obtained for 17 486 surgical cases. Natural language processing achieved a receiver operating characteristic area under the curve of 0.989 for urinary tract infection, 0.993 for pneumonia, 0.992 for sepsis, and 0.998 for septic shock, whereas administrative data achieved a receiver operating characteristic area under the curve of 0.595 for urinary tract infection, 0.624 for pneumonia, 0.571 for sepsis, and 0.625 for septic shock. Conclusion: The natural language processing approach was able to capture complications with acceptable performance, which was superior to administrative data. In addition, the model performance approached that of manual curation and thereby offers a potential pathway for complete real-time coverage of postoperative complications across surgical procedures based on natural language processing assessment of electronic health record free medical text.

U2 - 10.1093/bjsopen/zrae020

DO - 10.1093/bjsopen/zrae020

M3 - Journal article

C2 - 38593027

AN - SCOPUS:85190249824

VL - 8

JO - BJS open

JF - BJS open

SN - 2474-9842

IS - 2

M1 - zrae020

ER -

ID: 389507905