Article ; Online: Extracting COVID-19 diagnoses and symptoms from clinical text: A new annotated corpus and neural event extraction framework.
Journal of biomedical informatics
2021 Volume 117, Page(s) 103761
Abstract: ... This work presents a new clinical corpus, referred to as the COVID-19 Annotated Clinical Text (CACT) Corpus ... which comprises 1,472 notes with detailed annotations characterizing COVID-19 diagnoses, testing, and clinical ... achieving high performance in identifying COVID-19 and symptom events with associated assertion values (0.83 ...
Abstract | Coronavirus disease 2019 (COVID-19) is a global pandemic. Although much has been learned about the novel coronavirus since its emergence, there are many open questions related to tracking its spread, describing symptomology, predicting the severity of infection, and forecasting healthcare utilization. Free-text clinical notes contain critical information for resolving these questions. Data-driven, automatic information extraction models are needed to use this text-encoded information in large-scale studies. This work presents a new clinical corpus, referred to as the COVID-19 Annotated Clinical Text (CACT) Corpus, which comprises 1,472 notes with detailed annotations characterizing COVID-19 diagnoses, testing, and clinical presentation. We introduce a span-based event extraction model that jointly extracts all annotated phenomena, achieving high performance in identifying COVID-19 and symptom events with associated assertion values (0.83-0.97 F1 for events and 0.73-0.79 F1 for assertions). Our span-based event extraction model outperforms an extractor built on MetaMapLite for the identification of symptoms with assertion values. In a secondary use application, we predicted COVID-19 test results using structured patient data (e.g. vital signs and laboratory results) and automatically extracted symptom information, to explore the clinical presentation of COVID-19. Automatically extracted symptoms improve COVID-19 prediction performance, beyond structured data alone. |
---|---|
MeSH term(s) | COVID-19/diagnosis ; Electronic Health Records ; Humans ; Information Storage and Retrieval ; Natural Language Processing ; Symptom Assessment |
Language | English |
Publishing date | 2021-03-26 |
Publishing country | United States |
Document type | Journal Article ; Research Support, N.I.H., Extramural ; Research Support, Non-U.S. Gov't |
ZDB-ID | 2057141-0 |
ISSN | 1532-0480 ; 1532-0464 |
ISSN (online) | 1532-0480 |
ISSN | 1532-0464 |
DOI | 10.1016/j.jbi.2021.103761 |
Database | MEDical Literature Analysis and Retrieval System OnLINE |
Full text online
More links
Kategorien
Order via subito
This service is chargeable due to the Delivery terms set by subito. Orders including an article and supplementary material will be classified as separate orders. In these cases, fees will be demanded for each order.