LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 1 of total 1

Search options

Article ; Online: Data pipelines in a public health emergency: The human in the machine.

Gaythorpe, Katy A M / Fitzjohn, Rich G / Hinsley, Wes / Imai, Natsuko / Knock, Edward S / Perez Guzman, Pablo N / Djaafara, Bimandra / Fraser, Keith / Baguelin, Marc / Ferguson, Neil M

Epidemics

2023  Volume 43, Page(s) 100676

Abstract: In an emergency epidemic response, data providers supply data on a best-faith effort to modellers and analysts who are typically the end user of data collected for other primary purposes such as to inform patient care. Thus, modellers who analyse ... ...

Abstract In an emergency epidemic response, data providers supply data on a best-faith effort to modellers and analysts who are typically the end user of data collected for other primary purposes such as to inform patient care. Thus, modellers who analyse secondary data have limited ability to influence what is captured. During an emergency response, models themselves are often under constant development and require both stability in their data inputs and flexibility to incorporate new inputs as novel data sources become available. This dynamic landscape is challenging to work with. Here we outline a data pipeline used in the ongoing COVID-19 response in the UK that aims to address these issues. A data pipeline is a sequence of steps to carry the raw data through to a processed and useable model input, along with the appropriate metadata and context. In ours, each data type had an individual processing report, designed to produce outputs that could be easily combined and used downstream. Automated checks were in-built and added as new pathologies emerged. These cleaned outputs were collated at different geographic levels to provide standardised datasets. Finally, a human validation step was an essential component of the analysis pathway and permitted more nuanced issues to be captured. This framework allowed the pipeline to grow in complexity and volume and facilitated the diverse range of modelling approaches employed by researchers. Additionally, every report or modelling output could be traced back to the specific data version that informed it ensuring reproducibility of results. Our approach has been used to facilitate fast-paced analysis and has evolved over time. Our framework and its aspirations are applicable to many settings beyond COVID-19 data, for example for other outbreaks such as Ebola, or where routine and regular analyses are required.
MeSH term(s) Humans ; COVID-19/epidemiology ; Public Health ; Reproducibility of Results ; Disease Outbreaks
Language English
Publishing date 2023-03-08
Publishing country Netherlands
Document type Journal Article ; Research Support, Non-U.S. Gov't
ZDB-ID 2467993-8
ISSN 1878-0067 ; 1755-4365
ISSN (online) 1878-0067
ISSN 1755-4365
DOI 10.1016/j.epidem.2023.100676
Database MEDical Literature Analysis and Retrieval System OnLINE

More links

Kategorien

To top