LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 10 of total 10

Search options

  1. Article ; Online: PAC-Bayes Unleashed

    Maxime Haddouche / Benjamin Guedj / Omar Rivasplata / John Shawe-Taylor

    Entropy, Vol 23, Iss 1330, p

    Generalisation Bounds with Unbounded Losses

    2021  Volume 1330

    Abstract: We present new PAC-Bayesian generalisation bounds for learning problems with unbounded loss functions. This extends the relevance and applicability of the PAC-Bayes learning framework, where most of the existing literature focuses on supervised learning ... ...

    Abstract We present new PAC-Bayesian generalisation bounds for learning problems with unbounded loss functions. This extends the relevance and applicability of the PAC-Bayes learning framework, where most of the existing literature focuses on supervised learning problems with a bounded loss function (typically assumed to take values in the interval [0;1]). In order to relax this classical assumption, we propose to allow the range of the loss to depend on each predictor. This relaxation is captured by our new notion of HYPothesis-dependent rangE (HYPE). Based on this, we derive a novel PAC-Bayesian generalisation bound for unbounded loss functions, and we instantiate it on a linear regression problem. To make our theory usable by the largest audience possible, we include discussions on actual computation, practicality and limitations of our assumptions.
    Keywords statistical learning theory ; PAC-Bayes ; generalisation bounds ; Science ; Q ; Astrophysics ; QB460-466 ; Physics ; QC1-999
    Subject code 006
    Language English
    Publishing date 2021-10-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  2. Article ; Online: Network topological determinants of pathogen spread

    María Pérez-Ortiz / Petru Manescu / Fabio Caccioli / Delmiro Fernández-Reyes / Parashkev Nachev / John Shawe-Taylor

    Scientific Reports, Vol 12, Iss 1, Pp 1-

    2022  Volume 13

    Abstract: Abstract How do we best constrain social interactions to decrease transmission of communicable diseases? Indiscriminate suppression is unsustainable long term and presupposes that all interactions carry equal importance. Instead, transmission within a ... ...

    Abstract Abstract How do we best constrain social interactions to decrease transmission of communicable diseases? Indiscriminate suppression is unsustainable long term and presupposes that all interactions carry equal importance. Instead, transmission within a social network has been shown to be determined by its topology. In this paper, we deploy simulations to understand and quantify the impact on disease transmission of a set of topological network features, building a dataset of 9000 interaction graphs using generators of different types of synthetic social networks. Independently of the topology of the network, we maintain constant the total volume of social interactions in our simulations, to show how even with the same social contact some network structures are more or less resilient to the spread. We find a suitable intervention to be specific suppression of unfamiliar and casual interactions that contribute to the network’s global efficiency. This is, pathogen spread is significantly reduced by limiting specific kinds of contact rather than their global number. Our numerical studies might inspire further investigation in connection to public health, as an integrative framework to craft and evaluate social interventions in communicable diseases with different social graphs or as a highlight of network metrics that should be captured in social studies.
    Keywords Medicine ; R ; Science ; Q
    Language English
    Publishing date 2022-05-01T00:00:00Z
    Publisher Nature Portfolio
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  3. Article ; Online: From fear to action

    Kevin Baum / Joanna Bryson / Frank Dignum / Virginia Dignum / Marko Grobelnik / Holger Hoos / Morten Irgens / Paul Lukowicz / Catelijne Muller / Francesca Rossi / John Shawe-Taylor / Andreas Theodorou / Ricardo Vinuesa

    Frontiers in Computer Science, Vol

    AI governance and opportunities for all

    2023  Volume 5

    Keywords Artificial Intelligence ; governance ; responsible AI ; Trustworthy AI ; large language models ; generative AI ; Electronic computers. Computer science ; QA75.5-76.95
    Language English
    Publishing date 2023-05-01T00:00:00Z
    Publisher Frontiers Media S.A.
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  4. Article ; Online: Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression.

    Sonja Lehtinen / Jon Lees / Jürg Bähler / John Shawe-Taylor / Christine Orengo

    PLoS ONE, Vol 10, Iss 8, p e

    2015  Volume 0134668

    Abstract: With the growing availability of large-scale biological datasets, automated methods of extracting functionally meaningful information from this data are becoming increasingly important. Data relating to functional association between genes or proteins, ... ...

    Abstract With the growing availability of large-scale biological datasets, automated methods of extracting functionally meaningful information from this data are becoming increasingly important. Data relating to functional association between genes or proteins, such as co-expression or functional association, is often represented in terms of gene or protein networks. Several methods of predicting gene function from these networks have been proposed. However, evaluating the relative performance of these algorithms may not be trivial: concerns have been raised over biases in different benchmarking methods and datasets, particularly relating to non-independence of functional association data and test data. In this paper we propose a new network-based gene function prediction algorithm using a commute-time kernel and partial least squares regression (Compass). We compare Compass to GeneMANIA, a leading network-based prediction algorithm, using a number of different benchmarks, and find that Compass outperforms GeneMANIA on these benchmarks. We also explicitly explore problems associated with the non-independence of functional association data and test data. We find that a benchmark based on the Gene Ontology database, which, directly or indirectly, incorporates information from other databases, may considerably overestimate the performance of algorithms exploiting functional association data for prediction.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2015-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  5. Article ; Online: Biomarker discovery by sparse canonical correlation analysis of complex clinical phenotypes of tuberculosis and malaria.

    Juho Rousu / Daniel D Agranoff / Olugbemiro Sodeinde / John Shawe-Taylor / Delmiro Fernandez-Reyes

    PLoS Computational Biology, Vol 9, Iss 4, p e

    2013  Volume 1003018

    Abstract: Biomarker discovery aims to find small subsets of relevant variables in 'omics data that correlate with the clinical syndromes of interest. Despite the fact that clinical phenotypes are usually characterized by a complex set of clinical parameters, ... ...

    Abstract Biomarker discovery aims to find small subsets of relevant variables in 'omics data that correlate with the clinical syndromes of interest. Despite the fact that clinical phenotypes are usually characterized by a complex set of clinical parameters, current computational approaches assume univariate targets, e.g. diagnostic classes, against which associations are sought for. We propose an approach based on asymmetrical sparse canonical correlation analysis (SCCA) that finds multivariate correlations between the 'omics measurements and the complex clinical phenotypes. We correlated plasma proteomics data to multivariate overlapping complex clinical phenotypes from tuberculosis and malaria datasets. We discovered relevant 'omic biomarkers that have a high correlation to profiles of clinical measurements and are remarkably sparse, containing 1.5-3% of all 'omic variables. We show that using clinical view projections we obtain remarkable improvements in diagnostic class prediction, up to 11% in tuberculosis and up to 5% in malaria. Our approach finds proteomic-biomarkers that correlate with complex combinations of clinical-biomarkers. Using the clinical-biomarkers improves the accuracy of diagnostic class prediction while not requiring the measurement plasma proteomic profiles of each subject. Our approach makes it feasible to use omics' data to build accurate diagnostic algorithms that can be deployed to community health centres lacking the expensive 'omics measurement capabilities.
    Keywords Biology (General) ; QH301-705.5
    Subject code 310
    Language English
    Publishing date 2013-04-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  6. Article ; Online: Directional migration of recirculating lymphocytes through lymph nodes via random walks.

    Niclas Thomas / Lenka Matejovicova / Wichat Srikusalanukul / John Shawe-Taylor / Benny Chain

    PLoS ONE, Vol 7, Iss 9, p e

    2012  Volume 45262

    Abstract: Naive T lymphocytes exhibit extensive antigen-independent recirculation between blood and lymph nodes, where they may encounter dendritic cells carrying cognate antigen. We examine how long different T cells may spend in an individual lymph node by ... ...

    Abstract Naive T lymphocytes exhibit extensive antigen-independent recirculation between blood and lymph nodes, where they may encounter dendritic cells carrying cognate antigen. We examine how long different T cells may spend in an individual lymph node by examining data from long term cannulation of blood and efferent lymphatics of a single lymph node in the sheep. We determine empirically the distribution of transit times of migrating T cells by applying the Least Absolute Shrinkage & Selection Operator (LASSO) or regularised S-LASSO to fit experimental data describing the proportion of labelled infused cells in blood and efferent lymphatics over time. The optimal inferred solution reveals a distribution with high variance and strong skew. The mode transit time is typically between 10 and 20 hours, but a significant number of cells spend more than 70 hours before exiting. We complement the empirical machine learning based approach by modelling lymphocyte passage through the lymph node insilico. On the basis of previous two photon analysis of lymphocyte movement, we optimised distributions which describe the transit times (first passage times) of discrete one dimensional and continuous (Brownian) three dimensional random walks with drift. The optimal fit is obtained when drift is small, i.e. the ratio of probabilities of migrating forward and backward within the node is close to one. These distributions are qualitatively similar to the inferred empirical distribution, with high variance and strong skew. In contrast, an optimised normal distribution of transit times (symmetrical around mean) fitted the data poorly. The results demonstrate that the rapid recirculation of lymphocytes observed at a macro level is compatible with predominantly randomised movement within lymph nodes, and significant probabilities of long transit times. We discuss how this pattern of migration may contribute to facilitating interactions between low frequency T cells and antigen presenting cells carrying cognate antigen.
    Keywords Medicine ; R ; Science ; Q
    Subject code 519
    Language English
    Publishing date 2012-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  7. Article ; Online: The Human Behaviour-Change Project

    Susan Michie / James Thomas / Marie Johnston / Pol Mac Aonghusa / John Shawe-Taylor / Michael P. Kelly / Léa A. Deleris / Ailbhe N. Finnerty / Marta M. Marques / Emma Norris / Alison O’Mara-Eves / Robert West

    Implementation Science, Vol 12, Iss 1, Pp 1-

    harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation

    2017  Volume 12

    Abstract: Abstract Background Behaviour change is key to addressing both the challenges facing human health and wellbeing and to promoting the uptake of research findings in health policy and practice. We need to make better use of the vast amount of accumulating ... ...

    Abstract Abstract Background Behaviour change is key to addressing both the challenges facing human health and wellbeing and to promoting the uptake of research findings in health policy and practice. We need to make better use of the vast amount of accumulating evidence from behaviour change intervention (BCI) evaluations and promote the uptake of that evidence into a wide range of contexts. The scale and complexity of the task of synthesising and interpreting this evidence, and increasing evidence timeliness and accessibility, will require increased computer support. The Human Behaviour-Change Project (HBCP) will use Artificial Intelligence and Machine Learning to (i) develop and evaluate a ‘Knowledge System’ that automatically extracts, synthesises and interprets findings from BCI evaluation reports to generate new insights about behaviour change and improve prediction of intervention effectiveness and (ii) allow users, such as practitioners, policy makers and researchers, to easily and efficiently query the system to get answers to variants of the question ‘What works, compared with what, how well, with what exposure, with what behaviours (for how long), for whom, in what settings and why?’. Methods The HBCP will: a) develop an ontology of BCI evaluations and their reports linking effect sizes for given target behaviours with intervention content and delivery and mechanisms of action, as moderated by exposure, populations and settings; b) develop and train an automated feature extraction system to annotate BCI evaluation reports using this ontology; c) develop and train machine learning and reasoning algorithms to use the annotated BCI evaluation reports to predict effect sizes for particular combinations of behaviours, interventions, populations and settings; d) build user and machine interfaces for interrogating and updating the knowledge base; and e) evaluate all the above in terms of performance and utility. Discussion The HBCP aims to revolutionise our ability to synthesise, interpret and deliver evidence on ...
    Keywords Behaviour change interventions ; Implementation ; Ontology ; Machine learning ; Natural language processing ; Evidence synthesis ; Medicine (General) ; R5-920
    Subject code 670
    Language English
    Publishing date 2017-10-01T00:00:00Z
    Publisher BMC
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  8. Article ; Online: Movement activity based classification of animal behaviour with an application to data from cheetah (Acinonyx jubatus).

    Steffen Grünewälder / Femke Broekhuis / David Whyte Macdonald / Alan Martin Wilson / John Weldon McNutt / John Shawe-Taylor / Stephen Hailes

    PLoS ONE, Vol 7, Iss 11, p e

    2012  Volume 49120

    Abstract: We propose a new method, based on machine learning techniques, for the analysis of a combination of continuous data from dataloggers and a sampling of contemporaneous behaviour observations. This data combination provides an opportunity for biologists to ...

    Abstract We propose a new method, based on machine learning techniques, for the analysis of a combination of continuous data from dataloggers and a sampling of contemporaneous behaviour observations. This data combination provides an opportunity for biologists to study behaviour at a previously unknown level of detail and accuracy; however, continuously recorded data are of little use unless the resulting large volumes of raw data can be reliably translated into actual behaviour. We address this problem by applying a Support Vector Machine and a Hidden-Markov Model that allows us to classify an animal's behaviour using a small set of field observations to calibrate continuously recorded activity data. Such classified data can be applied quantitatively to the behaviour of animals over extended periods and at times during which observation is difficult or impossible. We demonstrate the usefulness of the method by applying it to data from six cheetah (Acinonyx jubatus) in the Okavango Delta, Botswana. Cumulative activity data scores were recorded every five minutes by accelerometers embedded in GPS radio-collars for around one year on average. Direct behaviour sampling of each of the six cheetah were collected in the field for comparatively short periods. Using this approach we are able to classify each five minute activity score into a set of three key behaviour (feeding, mobile and stationary), creating a continuous behavioural sequence for the entire period for which the collars were deployed. Evaluation of our classifier with cross-validation shows the accuracy to be 83%-94%, but that the accuracy for individual classes is reduced with decreasing sample size of direct observations. We demonstrate how these processed data can be used to study behaviour identifying seasonal and gender differences in daily activity and feeding times. Results given here are unlike any that could be obtained using traditional approaches in both accuracy and detail.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2012-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  9. Article ; Online: Extracting diagnoses and investigation results from unstructured text in electronic health records by semi-supervised machine learning.

    Zhuoran Wang / Anoop D Shah / A Rosemary Tate / Spiros Denaxas / John Shawe-Taylor / Harry Hemingway

    PLoS ONE, Vol 7, Iss 1, p e

    2012  Volume 30412

    Abstract: Background Electronic health records are invaluable for medical research, but much of the information is recorded as unstructured free text which is time-consuming to review manually. Aim To develop an algorithm to identify relevant free texts ... ...

    Abstract Background Electronic health records are invaluable for medical research, but much of the information is recorded as unstructured free text which is time-consuming to review manually. Aim To develop an algorithm to identify relevant free texts automatically based on labelled examples. Methods We developed a novel machine learning algorithm, the 'Semi-supervised Set Covering Machine' (S3CM), and tested its ability to detect the presence of coronary angiogram results and ovarian cancer diagnoses in free text in the General Practice Research Database. For training the algorithm, we used texts classified as positive and negative according to their associated Read diagnostic codes, rather than by manual annotation. We evaluated the precision (positive predictive value) and recall (sensitivity) of S3CM in classifying unlabelled texts against the gold standard of manual review. We compared the performance of S3CM with the Transductive Vector Support Machine (TVSM), the original fully-supervised Set Covering Machine (SCM) and our 'Freetext Matching Algorithm' natural language processor. Results Only 60% of texts with Read codes for angiogram actually contained angiogram results. However, the S3CM algorithm achieved 87% recall with 64% precision on detecting coronary angiogram results, outperforming the fully-supervised SCM (recall 78%, precision 60%) and TSVM (recall 2%, precision 3%). For ovarian cancer diagnoses, S3CM had higher recall than the other algorithms tested (86%). The Freetext Matching Algorithm had better precision than S3CM (85% versus 74%) but lower recall (62%). Conclusions Our novel S3CM machine learning algorithm effectively detected free texts in primary care records associated with angiogram results and ovarian cancer diagnoses, after training on pre-classified test sets. It should be easy to adapt to other disease areas as it does not rely on linguistic rules, but needs further testing in other electronic health record datasets.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2012-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  10. Article ; Online: Data-driven malaria prevalence prediction in large densely populated urban holoendemic sub-Saharan West Africa

    Biobele J. Brown / Petru Manescu / Alexander A. Przybylski / Fabio Caccioli / Gbeminiyi Oyinloye / Muna Elmi / Michael J. Shaw / Vijay Pawar / Remy Claveau / John Shawe-Taylor / Mandayam A. Srinivasan / Nathaniel K. Afolabi / Geraint Rees / Adebola E. Orimadegun / Wasiu A. Ajetunmobi / Francis Akinkunmi / Olayinka Kowobari / Kikelomo Osinusi / Felix O. Akinbami /
    Samuel Omokhodion / Wuraola A. Shokunbi / Ikeoluwa Lagunju / Olugbemiro Sodeinde / Delmiro Fernandez-Reyes

    Scientific Reports, Vol 10, Iss 1, Pp 1-

    2020  Volume 17

    Abstract: Abstract Over 200 million malaria cases globally lead to half-million deaths annually. The development of malaria prevalence prediction systems to support malaria care pathways has been hindered by lack of data, a tendency towards universal “monolithic” ... ...

    Abstract Abstract Over 200 million malaria cases globally lead to half-million deaths annually. The development of malaria prevalence prediction systems to support malaria care pathways has been hindered by lack of data, a tendency towards universal “monolithic” models (one-size-fits-all-regions) and a focus on long lead time predictions. Current systems do not provide short-term local predictions at an accuracy suitable for deployment in clinical practice. Here we show a data-driven approach that reliably produces one-month-ahead prevalence prediction within a densely populated all-year-round malaria metropolis of over 3.5 million inhabitants situated in Nigeria which has one of the largest global burdens of P. falciparum malaria. We estimate one-month-ahead prevalence in a unique 22-years prospective regional dataset of > 9 × 104 participants attending our healthcare services. Our system agrees with both magnitude and direction of the prediction on validation data achieving MAE ≤ 6 × 10–2, MSE ≤ 7 × 10–3, PCC (median 0.63, IQR 0.3) and with more than 80% of estimates within a (+ 0.1 to − 0.05) error-tolerance range which is clinically relevant for decision-support in our holoendemic setting. Our data-driven approach could facilitate healthcare systems to harness their own data to support local malaria care pathways.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2020-09-01T00:00:00Z
    Publisher Nature Publishing Group
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

To top