LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 10 of total 16

Search options

  1. Article ; Online: PredictPTB

    Rawan AlSaad / Qutaibah Malluhi / Sabri Boughorbel

    BioData Mining, Vol 15, Iss 1, Pp 1-

    an interpretable preterm birth prediction model using attention-based recurrent neural networks

    2022  Volume 23

    Abstract: Abstract Background Early identification of pregnant women at risk for preterm birth (PTB), a major cause of infant mortality and morbidity, has a significant potential to improve prenatal care. However, we lack effective predictive models which can ... ...

    Abstract Abstract Background Early identification of pregnant women at risk for preterm birth (PTB), a major cause of infant mortality and morbidity, has a significant potential to improve prenatal care. However, we lack effective predictive models which can accurately forecast PTB and complement these predictions with appropriate interpretations for clinicians. In this work, we introduce a clinical prediction model (PredictPTB) which combines variables (medical codes) readily accessible through electronic health record (EHR) to accurately predict the risk of preterm birth at 1, 3, 6, and 9 months prior to delivery. Methods The architecture of PredictPTB employs recurrent neural networks (RNNs) to model the longitudinal patient’s EHR visits and exploits a single code-level attention mechanism to improve the predictive performance, while providing temporal code-level and visit-level explanations for the prediction results. We compare the performance of different combinations of prediction time-points, data modalities, and data windows. We also present a case-study of our model’s interpretability illustrating how clinicians can gain some transparency into the predictions. Results Leveraging a large cohort of 222,436 deliveries, comprising a total of 27,100 unique clinical concepts, our model was able to predict preterm birth with an ROC-AUC of 0.82, 0.79, 0.78, and PR-AUC of 0.40, 0.31, 0.24, at 1, 3, and 6 months prior to delivery, respectively. Results also confirm that observational data modalities (such as diagnoses) are more predictive for preterm birth than interventional data modalities (e.g., medications and procedures). Conclusions Our results demonstrate that PredictPTB can be utilized to achieve accurate and scalable predictions for preterm birth, complemented by explanations that directly highlight evidence in the patient’s EHR timeline.
    Keywords Deep learning ; Predictive models ; Attention mechanism ; Electronic health record ; Preterm birth ; Pregnancy ; Computer applications to medicine. Medical informatics ; R858-859.7 ; Analysis ; QA299.6-433
    Subject code 610
    Language English
    Publishing date 2022-02-01T00:00:00Z
    Publisher BMC
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  2. Article ; Online: Predicting emergency department utilization among children with asthma using deep learning models

    Rawan AlSaad / Qutaibah Malluhi / Ibrahim Janahi / Sabri Boughorbel

    Healthcare Analytics, Vol 2, Iss , Pp 100050- (2022)

    2022  

    Abstract: Pediatric asthma is a leading cause of emergency department (ED) utilization, which is expensive and often preventable. Therefore, development of ED utilization predictive models that can accurately predict patients at high-risk of frequent ED use and ... ...

    Abstract Pediatric asthma is a leading cause of emergency department (ED) utilization, which is expensive and often preventable. Therefore, development of ED utilization predictive models that can accurately predict patients at high-risk of frequent ED use and subsequently steering their treatment pathway towards more personalized interventions, has high clinical utility. In this paper, we investigate the extent to which deep learning models, specifically recurrent neural networks (RNNs), coupled with routinely collected electronic health record (EHR) clinical data can predict the frequency of emergency department utilization among children with asthma.We use retrospective longitudinal EHR data of 87,413 children with asthma aged 0–18 years, who were attributed to one or more healthcare facility for at least 2 consecutive years between 2000–2013. The models were trained for the task of predicting the frequency of emergency department visits in the next 12 months. We compared prediction results of three recurrent neural network (RNN) models: bidirectional long short-term memory (BiLSTM), bidirectional gated recurrent unit (BiGRU), and reverse time attention model (RETAIN), to a baseline multinomial logistic regression model. We assessed the predictive accuracy of the models using receiver operating characteristic curve (AUC–ROC), precision–recall curve (AUC-PR), and F1-score.The results indicated that all RNN models have similar performances reaching AUC–ROC: 0.85, AUC-PR: 0.74, and F1-score: 0.61, compared to AUC–ROC: 0.81, AUC-PR: 0.69, and F1-score: 0.56 for a baseline multinomial logistic regression.Predictive models created from large routinely available EHR data using RNN models can accurately identify children with asthma at high-risk of repeated ED visits, without interacting with the patient or collecting information beyond the patient’s EHR.
    Keywords Deep learning ; Predictive models ; Electronic health record ; Emergency medicine ; Asthma ; Computer applications to medicine. Medical informatics ; R858-859.7
    Subject code 310
    Language English
    Publishing date 2022-11-01T00:00:00Z
    Publisher Elsevier
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  3. Article ; Online: DASSI

    Shabir Moosa / Prof. Abbes Amira / Dr. Sabri Boughorbel

    BioData Mining, Vol 14, Iss 1, Pp 1-

    differential architecture search for splice identification from DNA sequences

    2021  Volume 17

    Abstract: Abstract Background The data explosion caused by unprecedented advancements in the field of genomics is constantly challenging the conventional methods used in the interpretation of the human genome. The demand for robust algorithms over the recent years ...

    Abstract Abstract Background The data explosion caused by unprecedented advancements in the field of genomics is constantly challenging the conventional methods used in the interpretation of the human genome. The demand for robust algorithms over the recent years has brought huge success in the field of Deep Learning (DL) in solving many difficult tasks in image, speech and natural language processing by automating the manual process of architecture design. This has been fueled through the development of new DL architectures. Yet genomics possesses unique challenges that requires customization and development of new DL models. Methods We proposed a new model, DASSI, by adapting a differential architecture search method and applying it to the Splice Site (SS) recognition task on DNA sequences to discover new high-performance convolutional architectures in an automated manner. We evaluated the discovered model against state-of-the-art tools to classify true and false SS in Homo sapiens (Human), Arabidopsis thaliana (Plant), Caenorhabditis elegans (Worm) and Drosophila melanogaster (Fly). Results Our experimental evaluation demonstrated that the discovered architecture outperformed baseline models and fixed architectures and showed competitive results against state-of-the-art models used in classification of splice sites. The proposed model - DASSI has a compact architecture and showed very good results on a transfer learning task. The benchmarking experiments of execution time and precision on architecture search and evaluation process showed better performance on recently available GPUs making it feasible to adopt architecture search based methods on large datasets. Conclusions We proposed the use of differential architecture search method (DASSI) to perform SS classification on raw DNA sequences, and discovered new neural network models with low number of tunable parameters and competitive performance compared with manually engineered architectures. We have extensively benchmarked DASSI model with other state-of-the-art ...
    Keywords Deep learning ; Splice site ; Genomics ; Neural architecture search ; Convolutional neural networks ; Computer applications to medicine. Medical informatics ; R858-859.7 ; Analysis ; QA299.6-433
    Subject code 006
    Language English
    Publishing date 2021-02-01T00:00:00Z
    Publisher BMC
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  4. Article ; Online: A curated transcriptome dataset collection to investigate the blood transcriptional response to viral respiratory tract infection and vaccination. [version 1; peer review

    Salim Bougarn / Sabri Boughorbel / Damien Chaussabel / Nico Marr

    F1000Research, Vol

    2 approved]

    2019  Volume 8

    Abstract: The human immune defense mechanisms and factors associated with good versus poor health outcomes following viral respiratory tract infections (VRTI), as well as correlates of protection following vaccination against respiratory viruses, remain ... ...

    Abstract The human immune defense mechanisms and factors associated with good versus poor health outcomes following viral respiratory tract infections (VRTI), as well as correlates of protection following vaccination against respiratory viruses, remain incompletely understood. To shed further light into these mechanisms, a number of systems-scale studies have been conducted to measure transcriptional changes in blood leukocytes of either naturally or experimentally infected individuals, or in individual’s post-vaccination. Here we are making available a public repository, for research investigators for interpretation, a collection of transcriptome datasets obtained from human whole blood and peripheral blood mononuclear cells (PBMC) to investigate the transcriptional responses following viral respiratory tract infection or vaccination against respiratory viruses. In total, Thirty one31 datasets, associated to viral respiratory tract infections and their related vaccination studies, were identified and retrieved from the NCBI Gene Expression Omnibus (GEO) and loaded in a custom web application designed for interactive query and visualization of integrated large-scale data. Quality control checks, using relevant biological markers, were performed. Multiple sample groupings and rank lists were created to facilitate dataset query and interpretation. Via this interface, users can generate web links to customized graphical views, which may be subsequently inserted into manuscripts to report novel findings. The GXB tool enables browsing of a single gene across projects, providing new perspectives on the role of a given molecule across biological systems in the diagnostic and prognostic following VRTI but also in identifying new correlates of protection. This dataset collection is available at: http://vri1.gxbsidra.org/dm3/geneBrowser/list.
    Keywords Medicine ; R ; Science ; Q
    Language English
    Publishing date 2019-03-01T00:00:00Z
    Publisher F1000 Research Ltd
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  5. Article ; Online: A curated transcriptome dataset collection to investigate inborn errors of immunity [version 2; peer review

    Salim Bougarn / Sabri Boughorbel / Damien Chaussabel / Nico Marr

    F1000Research, Vol

    2 approved]

    2019  Volume 8

    Abstract: Primary immunodeficiencies (PIDs) are a heterogeneous group of inherited disorders, frequently caused by loss-of-function and less commonly by gain-of-function mutations, which can result in susceptibility to a broad or a very narrow range of infections ... ...

    Abstract Primary immunodeficiencies (PIDs) are a heterogeneous group of inherited disorders, frequently caused by loss-of-function and less commonly by gain-of-function mutations, which can result in susceptibility to a broad or a very narrow range of infections but also in inflammatory, allergic or malignant diseases. Owing to the wide range in clinical manifestations and variability in penetrance and expressivity, there is an urgent need to better understand the underlying molecular, cellular and immunological phenotypes in PID patients in order to improve clinical diagnosis and management. Here we have compiled a manually curated collection of public transcriptome datasets mainly obtained from human whole blood, peripheral blood mononuclear cells (PBMCs) or fibroblasts of patients with PIDs and of control subjects for subsequent meta-analysis, query and interpretation. A total of eighteen (18) datasets derived from studies of PID patients were identified and retrieved from the NCBI Gene Expression Omnibus (GEO) database and loaded in GXB, a custom web application designed for interactive query and visualization of integrated large-scale data. The dataset collection includes samples from well characterized PID patients that were stimulated ex vivo under a variety of conditions to assess the molecular consequences of the underlying, naturally occurring gene defects on a genome-wide scale. Multiple sample groupings and rank lists were generated to facilitate comparisons of the transcriptional responses between different PID patients and control subjects. The GXB tool enables browsing of a single transcript across studies, thereby providing new perspectives on the role of a given molecule across biological systems and PID patients. This dataset collection is available at http://pid.gxbsidra.org/dm3/geneBrowser/list.
    Keywords Medicine ; R ; Science ; Q
    Subject code 610
    Language English
    Publishing date 2019-08-01T00:00:00Z
    Publisher F1000 Research Ltd
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  6. Article ; Online: Interpreting patient-Specific risk prediction using contextual decomposition of BiLSTMs

    Rawan AlSaad / Qutaibah Malluhi / Ibrahim Janahi / Sabri Boughorbel

    BMC Medical Informatics and Decision Making, Vol 19, Iss 1, Pp 1-

    application to children with asthma

    2019  Volume 11

    Abstract: Abstract Background Predictive modeling with longitudinal electronic health record (EHR) data offers great promise for accelerating personalized medicine and better informs clinical decision-making. Recently, deep learning models have achieved state-of- ... ...

    Abstract Abstract Background Predictive modeling with longitudinal electronic health record (EHR) data offers great promise for accelerating personalized medicine and better informs clinical decision-making. Recently, deep learning models have achieved state-of-the-art performance for many healthcare prediction tasks. However, deep models lack interpretability, which is integral to successful decision-making and can lead to better patient care. In this paper, we build upon the contextual decomposition (CD) method, an algorithm for producing importance scores from long short-term memory networks (LSTMs). We extend the method to bidirectional LSTMs (BiLSTMs) and use it in the context of predicting future clinical outcomes using patients’ EHR historical visits. Methods We use a real EHR dataset comprising 11071 patients, to evaluate and compare CD interpretations from LSTM and BiLSTM models. First, we train LSTM and BiLSTM models for the task of predicting which pre-school children with respiratory system-related complications will have asthma at school-age. After that, we conduct quantitative and qualitative analysis to evaluate the CD interpretations produced by the contextual decomposition of the trained models. In addition, we develop an interactive visualization to demonstrate the utility of CD scores in explaining predicted outcomes. Results Our experimental evaluation demonstrate that whenever a clear visit-level pattern exists, the models learn that pattern and the contextual decomposition can appropriately attribute the prediction to the correct pattern. In addition, the results confirm that the CD scores agree to a large extent with the importance scores generated using logistic regression coefficients. Our main insight was that rather than interpreting the attribution of individual visits to the predicted outcome, we could instead attribute a model’s prediction to a group of visits. Conclusion We presented a quantitative and qualitative evidence that CD interpretations can explain patient-specific predictions ...
    Keywords Interpretability ; Deep learning ; Predictive models ; Electronic health record ; Computer applications to medicine. Medical informatics ; R858-859.7
    Subject code 006
    Language English
    Publishing date 2019-11-01T00:00:00Z
    Publisher BMC
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  7. Article ; Online: ­A curated transcriptomic dataset collection relevant to embryonic development associated with in vitro fertilization in healthy individuals and patients with polycystic ovary syndrome [version 1; referees

    Rafah Mackeh / Sabri Boughorbel / Damien Chaussabel / Tomoshige Kino

    F1000Research, Vol

    1 approved, 2 approved with reservations]

    2017  Volume 6

    Abstract: The collection of large-scale datasets available in public repositories is rapidly growing and providing opportunities to identify and fill gaps in different fields of biomedical research. However, users of these datasets should be able to selectively ... ...

    Abstract The collection of large-scale datasets available in public repositories is rapidly growing and providing opportunities to identify and fill gaps in different fields of biomedical research. However, users of these datasets should be able to selectively browse datasets related to their field of interest. Here we made available a collection of transcriptome datasets related to human follicular cells from normal individuals or patients with polycystic ovary syndrome, in the process of their development, during in vitro fertilization. After RNA-seq dataset exclusion and careful selection based on study description and sample information, 12 datasets, encompassing a total of 85 unique transcriptome profiles, were identified in NCBI Gene Expression Omnibus and uploaded to the Gene Expression Browser (GXB), a web application specifically designed for interactive query and visualization of integrated large-scale data. Once annotated in GXB, multiple sample grouping has been made in order to create rank lists to allow easy data interpretation and comparison. The GXB tool also allows the users to browse a single gene across multiple projects to evaluate its expression profiles in multiple biological systems/conditions in a web-based customized graphical views. The curated dataset is accessible at the following link: http://ivf.gxbsidra.org/dm3/landing.gsp.
    Keywords Pregnancy ; Labor ; Delivery & Postpartum Care ; Medicine ; R ; Science ; Q
    Language English
    Publishing date 2017-02-01T00:00:00Z
    Publisher F1000 Research Ltd
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  8. Article ; Online: Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric.

    Sabri Boughorbel / Fethi Jarray / Mohammed El-Anbari

    PLoS ONE, Vol 12, Iss 6, p e

    2017  Volume 0177678

    Abstract: Data imbalance is frequently encountered in biomedical applications. Resampling techniques can be used in binary classification to tackle this issue. However such solutions are not desired when the number of samples in the small class is limited. ... ...

    Abstract Data imbalance is frequently encountered in biomedical applications. Resampling techniques can be used in binary classification to tackle this issue. However such solutions are not desired when the number of samples in the small class is limited. Moreover the use of inadequate performance metrics, such as accuracy, lead to poor generalization results because the classifiers tend to predict the largest size class. One of the good approaches to deal with this issue is to optimize performance metrics that are designed to handle data imbalance. Matthews Correlation Coefficient (MCC) is widely used in Bioinformatics as a performance metric. We are interested in developing a new classifier based on the MCC metric to handle imbalanced data. We derive an optimal Bayes classifier for the MCC metric using an approach based on Frechet derivative. We show that the proposed algorithm has the nice theoretical property of consistency. Using simulated data, we verify the correctness of our optimality result by searching in the space of all possible binary classifiers. The proposed classifier is evaluated on 64 datasets from a wide range data imbalance. We compare both classification performance and CPU efficiency for three classifiers: 1) the proposed algorithm (MCC-classifier), the Bayes classifier with a default threshold (MCC-base) and imbalanced SVM (SVM-imba). The experimental evaluation shows that MCC-classifier has a close performance to SVM-imba while being simpler and more efficient.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2017-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  9. Article ; Online: Model Comparison for Breast Cancer Prognosis Based on Clinical Data.

    Sabri Boughorbel / Rashid Al-Ali / Naser Elkum

    PLoS ONE, Vol 11, Iss 1, p e

    2016  Volume 0146413

    Abstract: We compared the performance of several prediction techniques for breast cancer prognosis, based on AU-ROC performance (Area Under ROC) for different prognosis periods. The analyzed dataset contained 1,981 patients and from an initial 25 variables, the 11 ...

    Abstract We compared the performance of several prediction techniques for breast cancer prognosis, based on AU-ROC performance (Area Under ROC) for different prognosis periods. The analyzed dataset contained 1,981 patients and from an initial 25 variables, the 11 most common clinical predictors were retained. We compared eight models from a wide spectrum of predictive models, namely; Generalized Linear Model (GLM), GLM-Net, Partial Least Square (PLS), Support Vector Machines (SVM), Random Forests (RF), Neural Networks, k-Nearest Neighbors (k-NN) and Boosted Trees. In order to compare these models, paired t-test was applied on the model performance differences obtained from data resampling. Random Forests, Boosted Trees, Partial Least Square and GLMNet have superior overall performance, however they are only slightly higher than the other models. The comparative analysis also allowed us to define a relative variable importance as the average of variable importance from the different models. Two sets of variables are identified from this analysis. The first includes number of positive lymph nodes, tumor size, cancer grade and estrogen receptor, all has an important influence on model predictability. The second set incudes variables related to histological parameters and treatment types. The short term vs long term contribution of the clinical variables are also analyzed from the comparative models. From the various cancer treatment plans, the combination of Chemo/Radio therapy leads to the largest impact on cancer prognosis.
    Keywords Medicine ; R ; Science ; Q
    Subject code 310
    Language English
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  10. Article ; Online: Precision medicine in the era of artificial intelligence

    Murugan Subramanian / Anne Wojtusciszyn / Lucie Favre / Sabri Boughorbel / Jingxuan Shan / Khaled B. Letaief / Nelly Pitteloud / Lotfi Chouchane

    Journal of Translational Medicine, Vol 18, Iss 1, Pp 1-

    implications in chronic disease management

    2020  Volume 12

    Abstract: Abstract Aberrant metabolism is the root cause of several serious health issues, creating a huge burden to health and leading to diminished life expectancy. A dysregulated metabolism induces the secretion of several molecules which in turn trigger the ... ...

    Abstract Abstract Aberrant metabolism is the root cause of several serious health issues, creating a huge burden to health and leading to diminished life expectancy. A dysregulated metabolism induces the secretion of several molecules which in turn trigger the inflammatory pathway. Inflammation is the natural reaction of the immune system to a variety of stimuli, such as pathogens, damaged cells, and harmful substances. Metabolically triggered inflammation, also called metaflammation or low-grade chronic inflammation, is the consequence of a synergic interaction between the host and the exposome—a combination of environmental drivers, including diet, lifestyle, pollutants and other factors throughout the life span of an individual. Various levels of chronic inflammation are associated with several lifestyle-related diseases such as diabetes, obesity, metabolic associated fatty liver disease (MAFLD), cancers, cardiovascular disorders (CVDs), autoimmune diseases, and chronic lung diseases. Chronic diseases are a growing concern worldwide, placing a heavy burden on individuals, families, governments, and health-care systems. New strategies are needed to empower communities worldwide to prevent and treat these diseases. Precision medicine provides a model for the next generation of lifestyle modification. This will capitalize on the dynamic interaction between an individual’s biology, lifestyle, behavior, and environment. The aim of precision medicine is to design and improve diagnosis, therapeutics and prognostication through the use of large complex datasets that incorporate individual gene, function, and environmental variations. The implementation of high-performance computing (HPC) and artificial intelligence (AI) can predict risks with greater accuracy based on available multidimensional clinical and biological datasets. AI-powered precision medicine provides clinicians with an opportunity to specifically tailor early interventions to each individual. In this article, we discuss the strengths and limitations of ...
    Keywords Exposome ; Chronic inflammation ; Chronic diseases ; Precision medicine ; Personalized treatment ; Deep phenotyping ; Medicine ; R
    Subject code 610
    Language English
    Publishing date 2020-12-01T00:00:00Z
    Publisher BMC
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

To top