LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 10 of total 23

Search options

  1. Article ; Online: Use of a deep learning and random forest approach to track changes in the predictive nature of socioeconomic drivers of under-5 mortality rates in sub-Saharan Africa

    Justine B Nasejje / Rendani Mbuvha / Henry Mwambi

    BMJ Open, Vol 12, Iss

    2022  Volume 2

    Keywords Medicine ; R
    Language English
    Publishing date 2022-02-01T00:00:00Z
    Publisher BMJ Publishing Group
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  2. Article ; Online: Alleviating Class Imbalance in Actuarial Applications Using Generative Adversarial Networks

    Kwanda Sydwell Ngwenduna / Rendani Mbuvha

    Risks, Vol 9, Iss 49, p

    2021  Volume 49

    Abstract: To build adequate predictive models, a substantial amount of data is desirable. However, when expanding to new or unexplored territories, this required level of information is rarely always available. To build such models, actuaries often have to: ... ...

    Abstract To build adequate predictive models, a substantial amount of data is desirable. However, when expanding to new or unexplored territories, this required level of information is rarely always available. To build such models, actuaries often have to: procure data from local providers, use limited unsuitable industry and public research, or rely on extrapolations from other better-known markets. Another common pathology when applying machine learning techniques in actuarial domains is the prevalence of imbalanced classes where risk events of interest, such as mortality and fraud, are under-represented in data. In this work, we show how an implicit model using the Generative Adversarial Network (GAN) can alleviate these problems through the generation of adequate quality data from very limited or highly imbalanced samples. We provide an introduction to GANs and how they are used to synthesize data that accurately enhance the data resolution of very infrequent events and improve model robustness. Overall, we show a significant superiority of GANs for boosting predictive models when compared to competing approaches on benchmark data sets. This work offers numerous of contributions to actuaries with applications to inter alia new sample creation, data augmentation, boosting predictive models, anomaly detection, and missing data imputation.
    Keywords actuarial science ; class imbalance ; data augmentation ; generative models ; generative adversarial network ; synthetic sampling ; Insurance ; HG8011-9999
    Subject code 310 ; 006
    Language English
    Publishing date 2021-03-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  3. Article ; Online: Use of a deep learning and random forest approach to track changes in the predictive nature of socioeconomic drivers of under-5 mortality rates in sub-Saharan Africa.

    Nasejje, Justine B / Mbuvha, Rendani / Mwambi, Henry

    BMJ open

    2022  Volume 12, Issue 2, Page(s) e049786

    Abstract: Objectives: We used machine learning algorithms to track how the ranks of importance and the survival outcome of four socioeconomic determinants (place of residence, mother's level of education, wealth index and sex of the child) of under-5 mortality ... ...

    Abstract Objectives: We used machine learning algorithms to track how the ranks of importance and the survival outcome of four socioeconomic determinants (place of residence, mother's level of education, wealth index and sex of the child) of under-5 mortality rate (U5MR) in sub-Saharan Africa have evolved.
    Settings: This work consists of multiple cross-sectional studies. We analysed data from the Demographic Health Surveys (DHS) collected from four countries; Uganda, Zimbabwe, Chad and Ghana, each randomly selected from the four subregions of sub-Saharan Africa.
    Participants: Each country has multiple DHS datasets and a total of 11 datasets were selected for analysis. A total of n=85 688 children were drawn from the eleven datasets.
    Primary and secondary outcomes: The primary outcome variable is U5MR; the secondary outcomes were to obtain the ranks of importance of the four socioeconomic factors over time and to compare the two machine learning models, the random survival forest (RSF) and the deep survival neural network (DeepSurv) in predicting U5MR.
    Results: Mother's education level ranked first in five datasets. Wealth index ranked first in three, place of residence ranked first in two and sex of the child ranked last in most of the datasets. The four factors showed a favourable survival outcome over time, confirming that past interventions targeting these factors are yielding positive results. The DeepSurv model has a higher predictive performance with mean concordance indexes (between 67% and 80%), above 50% compared with the RSF model.
    Conclusions: The study reveals that children under the age of 5 in sub-Saharan Africa have favourable survival outcomes associated with the four socioeconomic factors over time. It also shows that deep survival neural network models are efficient in predicting U5MR and should, therefore, be used in the big data era to draft evidence-based policies to achieve the third sustainable development goal.
    MeSH term(s) Child ; Child Mortality ; Cross-Sectional Studies ; Deep Learning ; Ghana/epidemiology ; Humans ; Socioeconomic Factors
    Language English
    Publishing date 2022-02-17
    Publishing country England
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 2599832-8
    ISSN 2044-6055 ; 2044-6055
    ISSN (online) 2044-6055
    ISSN 2044-6055
    DOI 10.1136/bmjopen-2021-049786
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  4. Article ; Online: Bayesian inference of COVID-19 spreading rates in South Africa.

    Mbuvha, Rendani / Marwala, Tshilidzi

    PloS one

    2020  Volume 15, Issue 8, Page(s) e0237126

    Abstract: The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has highlighted the need for performing accurate inference with limited data. Fundamental to the design of rapid state responses is the ability to perform epidemiological model ... ...

    Abstract The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has highlighted the need for performing accurate inference with limited data. Fundamental to the design of rapid state responses is the ability to perform epidemiological model parameter inference for localised trajectory predictions. In this work, we perform Bayesian parameter inference using Markov Chain Monte Carlo (MCMC) methods on the Susceptible-Infected-Recovered (SIR) and Susceptible-Exposed-Infected-Recovered (SEIR) epidemiological models with time-varying spreading rates for South Africa. The results find two change points in the spreading rate of COVID-19 in South Africa as inferred from the confirmed cases. The first change point coincides with state enactment of a travel ban and the resultant containment of imported infections. The second change point coincides with the start of a state-led mass screening and testing programme which has highlighted community-level disease spread that was not well represented in the initial largely traveller based and private laboratory dominated testing data. The results further suggest that due to the likely effect of the national lockdown, community level transmissions are slower than the original imported case driven spread of the disease.
    MeSH term(s) Algorithms ; Bayes Theorem ; COVID-19 ; Coronavirus Infections/diagnosis ; Coronavirus Infections/epidemiology ; Coronavirus Infections/transmission ; Coronavirus Infections/virology ; Humans ; Markov Chains ; Monte Carlo Method ; Pandemics ; Pneumonia, Viral/diagnosis ; Pneumonia, Viral/epidemiology ; Pneumonia, Viral/transmission ; Pneumonia, Viral/virology ; South Africa/epidemiology
    Keywords covid19
    Language English
    Publishing date 2020-08-05
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 2267670-3
    ISSN 1932-6203 ; 1932-6203
    ISSN (online) 1932-6203
    ISSN 1932-6203
    DOI 10.1371/journal.pone.0237126
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  5. Article ; Online: Bayesian inference of COVID-19 spreading rates in South Africa.

    Rendani Mbuvha / Tshilidzi Marwala

    PLoS ONE, Vol 15, Iss 8, p e

    2020  Volume 0237126

    Abstract: The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has highlighted the need for performing accurate inference with limited data. Fundamental to the design of rapid state responses is the ability to perform epidemiological model ... ...

    Abstract The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has highlighted the need for performing accurate inference with limited data. Fundamental to the design of rapid state responses is the ability to perform epidemiological model parameter inference for localised trajectory predictions. In this work, we perform Bayesian parameter inference using Markov Chain Monte Carlo (MCMC) methods on the Susceptible-Infected-Recovered (SIR) and Susceptible-Exposed-Infected-Recovered (SEIR) epidemiological models with time-varying spreading rates for South Africa. The results find two change points in the spreading rate of COVID-19 in South Africa as inferred from the confirmed cases. The first change point coincides with state enactment of a travel ban and the resultant containment of imported infections. The second change point coincides with the start of a state-led mass screening and testing programme which has highlighted community-level disease spread that was not well represented in the initial largely traveller based and private laboratory dominated testing data. The results further suggest that due to the likely effect of the national lockdown, community level transmissions are slower than the original imported case driven spread of the disease.
    Keywords Medicine ; R ; Science ; Q ; covid19
    Subject code 310
    Language English
    Publishing date 2020-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  6. Article ; Online: Bayesian inference of local government audit outcomes.

    Mongwe, Wilson Tsakane / Mbuvha, Rendani / Marwala, Tshilidzi

    PloS one

    2021  Volume 16, Issue 12, Page(s) e0261245

    Abstract: The scandals in publicly listed companies have highlighted the large losses that can result from financial statement fraud and weak corporate governance. Machine learning techniques have been applied to automatically detect financial statement fraud with ...

    Abstract The scandals in publicly listed companies have highlighted the large losses that can result from financial statement fraud and weak corporate governance. Machine learning techniques have been applied to automatically detect financial statement fraud with great success. This work presents the first application of a Bayesian inference approach to the problem of predicting the audit outcomes of financial statements of local government entities using financial ratios. Bayesian logistic regression (BLR) with automatic relevance determination (BLR-ARD) is applied to predict audit outcomes. The benefit of using BLR-ARD, instead of BLR without ARD, is that it allows one to automatically determine which input features are the most relevant for the task at hand, which is a critical aspect to consider when designing decision support systems. This work presents the first implementation of BLR-ARD trained with Separable Shadow Hamiltonian Hybrid Monte Carlo, No-U-Turn sampler, Metropolis Adjusted Langevin Algorithm and Metropolis-Hasting algorithms. Unlike the Gibbs sampling procedure that is typically employed in sampling from ARD models, in this work we jointly sample the parameters and the hyperparameters by putting a log normal prior on the hyperparameters. The analysis also shows that the repairs and maintenance as a percentage of total assets ratio, current ratio, debt to total operating revenue, net operating surplus margin and capital cost to total operating expenditure ratio are the important features when predicting local government audit outcomes using financial ratios. These results could be of use for auditors as focusing on these ratios could potentially speed up the detection of fraudulent behaviour in municipal entities, and improve the speed and quality of the overall audit.
    MeSH term(s) Algorithms ; Bayes Theorem ; Financial Audit/methods ; Financial Audit/standards ; Financial Audit/statistics & numerical data ; Fraud/economics ; Fraud/prevention & control ; Fraud/statistics & numerical data ; Humans ; Local Government ; Models, Statistical ; Monte Carlo Method
    Language English
    Publishing date 2021-12-14
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 2267670-3
    ISSN 1932-6203 ; 1932-6203
    ISSN (online) 1932-6203
    ISSN 1932-6203
    DOI 10.1371/journal.pone.0261245
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  7. Article ; Online: Quantum-Inspired Magnetic Hamiltonian Monte Carlo.

    Mongwe, Wilson Tsakane / Mbuvha, Rendani / Marwala, Tshilidzi

    PloS one

    2021  Volume 16, Issue 10, Page(s) e0258277

    Abstract: Hamiltonian Monte Carlo (HMC) is a Markov Chain Monte Carlo algorithm that is able to generate distant proposals via the use of Hamiltonian dynamics, which are able to incorporate first-order gradient information about the target posterior. This has ... ...

    Abstract Hamiltonian Monte Carlo (HMC) is a Markov Chain Monte Carlo algorithm that is able to generate distant proposals via the use of Hamiltonian dynamics, which are able to incorporate first-order gradient information about the target posterior. This has driven its rise in popularity in the machine learning community in recent times. It has been shown that making use of the energy-time uncertainty relation from quantum mechanics, one can devise an extension to HMC by allowing the mass matrix to be random with a probability distribution instead of a fixed mass. Furthermore, Magnetic Hamiltonian Monte Carlo (MHMC) has been recently proposed as an extension to HMC and adds a magnetic field to HMC which results in non-canonical dynamics associated with the movement of a particle under a magnetic field. In this work, we utilise the non-canonical dynamics of MHMC while allowing the mass matrix to be random to create the Quantum-Inspired Magnetic Hamiltonian Monte Carlo (QIMHMC) algorithm, which is shown to converge to the correct steady state distribution. Empirical results on a broad class of target posterior distributions show that the proposed method produces better sampling performance than HMC, MHMC and HMC with a random mass matrix.
    MeSH term(s) Bayes Theorem ; Databases as Topic ; Magnetic Phenomena ; Monte Carlo Method ; Multivariate Analysis ; Quantum Theory ; Regression Analysis
    Language English
    Publishing date 2021-10-05
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 2267670-3
    ISSN 1932-6203 ; 1932-6203
    ISSN (online) 1932-6203
    ISSN 1932-6203
    DOI 10.1371/journal.pone.0258277
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  8. Article ; Online: Alleviating class imbalance in actuarial applications using generative adversarial networks

    Ngwenduna, Kwanda Sydwell / Mbuvha, Rendani

    2021  

    Abstract: To build adequate predictive models, a substantial amount of data is desirable. However, when expanding to new or unexplored territories, this required level of information is rarely always available. To build such models, actuaries often have to: ... ...

    Abstract To build adequate predictive models, a substantial amount of data is desirable. However, when expanding to new or unexplored territories, this required level of information is rarely always available. To build such models, actuaries often have to: procure data from local providers, use limited unsuitable industry and public research, or rely on extrapolations from other better-known markets. Another common pathology when applying machine learning techniques in actuarial domains is the prevalence of imbalanced classes where risk events of interest, such as mortality and fraud, are under-represented in data. In this work, we show how an implicit model using the Generative Adversarial Network (GAN) can alleviate these problems through the generation of adequate quality data from very limited or highly imbalanced samples. We provide an introduction to GANs and how they are used to synthesize data that accurately enhance the data resolution of very infrequent events and improve model robustness. Overall, we show a significant superiority of GANs for boosting predictive models when compared to competing approaches on benchmark data sets. This work offers numerous of contributions to actuaries with applications to inter alia new sample creation, data augmentation, boosting predictive models, anomaly detection, and missing data imputation.
    Keywords ddc:330 ; actuarial science ; class imbalance ; data augmentation ; generative adversarial network ; generative models ; SMOTE ; synthetic sampling
    Subject code 310 ; 006
    Language English
    Publisher Basel: MDPI
    Publishing country de
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  9. Article ; Online: Bayesian inference of COVID-19 spreading rates in South Africa

    Mbuvha, Rendani / Marwala, Tshilidzi

    PLOS ONE

    2020  Volume 15, Issue 8, Page(s) e0237126

    Keywords General Biochemistry, Genetics and Molecular Biology ; General Agricultural and Biological Sciences ; General Medicine ; covid19
    Language English
    Publisher Public Library of Science (PLoS)
    Publishing country us
    Document type Article ; Online
    ISSN 1932-6203
    DOI 10.1371/journal.pone.0237126
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  10. Article ; Online: Quantum-Inspired Magnetic Hamiltonian Monte Carlo.

    Wilson Tsakane Mongwe / Rendani Mbuvha / Tshilidzi Marwala

    PLoS ONE, Vol 16, Iss 10, p e

    2021  Volume 0258277

    Abstract: Hamiltonian Monte Carlo (HMC) is a Markov Chain Monte Carlo algorithm that is able to generate distant proposals via the use of Hamiltonian dynamics, which are able to incorporate first-order gradient information about the target posterior. This has ... ...

    Abstract Hamiltonian Monte Carlo (HMC) is a Markov Chain Monte Carlo algorithm that is able to generate distant proposals via the use of Hamiltonian dynamics, which are able to incorporate first-order gradient information about the target posterior. This has driven its rise in popularity in the machine learning community in recent times. It has been shown that making use of the energy-time uncertainty relation from quantum mechanics, one can devise an extension to HMC by allowing the mass matrix to be random with a probability distribution instead of a fixed mass. Furthermore, Magnetic Hamiltonian Monte Carlo (MHMC) has been recently proposed as an extension to HMC and adds a magnetic field to HMC which results in non-canonical dynamics associated with the movement of a particle under a magnetic field. In this work, we utilise the non-canonical dynamics of MHMC while allowing the mass matrix to be random to create the Quantum-Inspired Magnetic Hamiltonian Monte Carlo (QIMHMC) algorithm, which is shown to converge to the correct steady state distribution. Empirical results on a broad class of target posterior distributions show that the proposed method produces better sampling performance than HMC, MHMC and HMC with a random mass matrix.
    Keywords Medicine ; R ; Science ; Q
    Subject code 541
    Language English
    Publishing date 2021-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

To top