LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 10 of total 10

Search options

  1. Article ; Online: Improvement of Malicious Software Detection Accuracy through Genetic Programming Symbolic Classifier with Application of Dataset Oversampling Techniques

    Nikola Anđelić / Sandi Baressi Šegota / Zlatan Car

    Computers, Vol 12, Iss 12, p

    2023  Volume 242

    Abstract: Malware detection using hybrid features, combining binary and hexadecimal analysis with DLL calls, is crucial for leveraging the strengths of both static and dynamic analysis methods. Artificial intelligence (AI) enhances this process by enabling ... ...

    Abstract Malware detection using hybrid features, combining binary and hexadecimal analysis with DLL calls, is crucial for leveraging the strengths of both static and dynamic analysis methods. Artificial intelligence (AI) enhances this process by enabling automated pattern recognition, anomaly detection, and continuous learning, allowing security systems to adapt to evolving threats and identify complex, polymorphic malware that may exhibit varied behaviors. This synergy of hybrid features with AI empowers malware detection systems to efficiently and proactively identify and respond to sophisticated cyber threats in real time. In this paper, the genetic programming symbolic classifier (GPSC) algorithm was applied to the publicly available dataset to obtain symbolic expressions (SEs) that could detect the malware software with high classification performance. The initial problem with the dataset was a high imbalance between class samples, so various oversampling techniques were utilized to obtain balanced dataset variations on which GPSC was applied. To find the optimal combination of GPSC hyperparameter values, the random hyperparameter value search method (RHVS) was developed and applied to obtain SEs with high classification accuracy. The GPSC was trained with five-fold cross-validation (5FCV) to obtain a robust set of SEs on each dataset variation. To choose the best SEs, several evaluation metrics were used, i.e., the length and depth of SEs, accuracy score (ACC), area under receiver operating characteristic curve (AUC), precision, recall, f1-score, and confusion matrix. The best-obtained SEs are applied on the original imbalanced dataset to see if the classification performance is the same as it was on balanced dataset variations. The results of the investigation showed that the proposed method generated SEs with high classification accuracy (0.9962) in malware software detection.
    Keywords genetic programming symbolic classifier ; 5-fold cross-validation ; malware software detection ; oversampling techniques ; random hyperparameter value search method ; Electronic computers. Computer science ; QA75.5-76.95
    Subject code 006
    Language English
    Publishing date 2023-11-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  2. Article ; Online: Automated Detection and Classification of Returnable Packaging Based on YOLOV4 Algorithm

    Matko Glučina / Sandi Baressi Šegota / Nikola Anđelić / Zlatan Car

    Applied Sciences, Vol 12, Iss 11131, p

    2022  Volume 11131

    Abstract: This article describes the implementation of the You Only Look Once (YOLO) detection algorithm for the detection of returnable packaging. The method of creating an original dataset and creating an augmented dataset is shown. The model was evaluated using ...

    Abstract This article describes the implementation of the You Only Look Once (YOLO) detection algorithm for the detection of returnable packaging. The method of creating an original dataset and creating an augmented dataset is shown. The model was evaluated using mean Average Precision (mAP), F1 <math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mrow></mrow><mi>score</mi></msub></semantics></math> , Precision, Recall, Average Intersection over Union (Average IoU) score, and Average Loss. The training was conducted in four cycles, i.e., 6000, 8000, 10,000, and 20,000 max batches with three different activation functions Mish, ReLU, and Linear (used in 6000 and 8000 max batches). The influence train/test dataset ratio was also investigated. The conducted investigation showed that variation of hyperparameters (activation function and max batch sizes) have a significant influence on detection and classification accuracy with the best results obtained in the case of YOLO version 4 (YOLOV4) with the Mish activation function and max batch size of 20,000 that achieved the highest mAP of 99.96% and lowest average error of 0.3643.
    Keywords artificial intelligence algorithms ; automated system ; convolutional neural network ; computer vision ; YOLOV4 ; Technology ; T ; Engineering (General). Civil engineering (General) ; TA1-2040 ; Biology (General) ; QH301-705.5 ; Physics ; QC1-999 ; Chemistry ; QD1-999
    Subject code 006
    Language English
    Publishing date 2022-11-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  3. Article ; Online: The Development of Symbolic Expressions for the Detection of Hepatitis C Patients and the Disease Progression from Blood Parameters Using Genetic Programming-Symbolic Classification Algorithm

    Nikola Anđelić / Ivan Lorencin / Sandi Baressi Šegota / Zlatan Car

    Applied Sciences, Vol 13, Iss 1, p

    2022  Volume 574

    Abstract: Hepatitis C is an infectious disease which is caused by the Hepatitis C virus (HCV) and the virus primarily affects the liver. Based on the publicly available dataset used in this paper the idea is to develop a mathematical equation that could be used to ...

    Abstract Hepatitis C is an infectious disease which is caused by the Hepatitis C virus (HCV) and the virus primarily affects the liver. Based on the publicly available dataset used in this paper the idea is to develop a mathematical equation that could be used to detect HCV patients with high accuracy based on the enzymes, proteins, and biomarker values contained in a patient’s blood sample using genetic programming symbolic classification (GPSC) algorithm. Not only that, but the idea was also to obtain a mathematical equation that could detect the progress of the disease i.e., Hepatitis C, Fibrosis, and Cirrhosis using the GPSC algorithm. Since the original dataset was imbalanced (a large number of healthy patients versus a small number of Hepatitis C/Fibrosis/Cirrhosis patients) the dataset was balanced using random oversampling, SMOTE, ADSYN, and Borderline SMOTE methods. The symbolic expressions (mathematical equations) were obtained using the GPSC algorithm using a rigorous process of 5-fold cross-validation with a random hyperparameter search method which had to be developed for this problem. To evaluate each symbolic expression generated with GPSC the mean and standard deviation values of accuracy (ACC), the area under the receiver operating characteristic curve ( <semantics> A U C </semantics> ), precision, recall, and F1-score were obtained. In a simple binary case (healthy vs. Hepatitis C patients) the best case was achieved with a dataset balanced with the Borderline SMOTE method. The results are <semantics> A C C ¯ ± S D ( A C C ) </semantics> , <semantics> A U C ¯ ± S D ( A U C ) </semantics> , <semantics> P r e c i s i o n ¯ ± S D ( P r e c i s i o n ) </semantics> , <semantics> R e c a l l ¯ ± S D ( R e c a l l ) </semantics> , and <semantics> F 1 − s c o r e ¯ ± S D ( F 1 − s c o r e ) </semantics> equal to <semantics> 0.99 ± 5.8 × 10 − 3 </semantics> , <semantics> 0.99 ± 5.4 × 10 − 3 </semantics> , ...
    Keywords ADASYN ; borderline SMOTE ; genetic programming-symbolic classifier ; Hepatitis C ; fibrosis ; cirrhosis ; Technology ; T ; Engineering (General). Civil engineering (General) ; TA1-2040 ; Biology (General) ; QH301-705.5 ; Physics ; QC1-999 ; Chemistry ; QD1-999
    Subject code 511
    Language English
    Publishing date 2022-12-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  4. Article ; Online: Utilization of multilayer perceptron for determining the inverse kinematics of an industrial robotic manipulator

    Sandi Baressi Šegota / Nikola Anđelić / Vedran Mrzljak / Ivan Lorencin / Ivan Kuric / Zlatan Car

    International Journal of Advanced Robotic Systems, Vol

    2021  Volume 18

    Abstract: Inverse kinematic equations allow the determination of the joint angles necessary for the robotic manipulator to place a tool into a predefined position. Determining this equation is vital but a complex work. In this article, an artificial neural network, ...

    Abstract Inverse kinematic equations allow the determination of the joint angles necessary for the robotic manipulator to place a tool into a predefined position. Determining this equation is vital but a complex work. In this article, an artificial neural network, more specifically, a feed-forward type, multilayer perceptron (MLP), is trained, so that it could be used to calculate the inverse kinematics for a robotic manipulator. First, direct kinematics of a robotic manipulator are determined using Denavit–Hartenberg method and a dataset of 15,000 points is generated using the calculated homogenous transformation matrices. Following that, multiple MLPs are trained with 10,240 different hyperparameter combinations to find the best. Each trained MLP is evaluated using the R 2 and mean absolute error metrics and the architectures of the MLPs that achieved the best results are presented. Results show a successful regression for the first five joints (percentage error being less than 0.1%) but a comparatively poor regression for the final joint due to the configuration of the robotic manipulator.
    Keywords Electronics ; TK7800-8360 ; Electronic computers. Computer science ; QA75.5-76.95
    Subject code 629
    Language English
    Publishing date 2021-08-01T00:00:00Z
    Publisher SAGE Publishing
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  5. Article ; Online: Modeling the Spread of COVID-19 Infection Using a Multilayer Perceptron

    Zlatan Car / Sandi Baressi Šegota / Nikola Anđelić / Ivan Lorencin / Vedran Mrzljak

    Computational and Mathematical Methods in Medicine, Vol

    2020  Volume 2020

    Abstract: Coronavirus (COVID-19) is a highly infectious disease that has captured the attention of the worldwide public. Modeling of such diseases can be extremely important in the prediction of their impact. While classic, statistical, modeling can provide ... ...

    Abstract Coronavirus (COVID-19) is a highly infectious disease that has captured the attention of the worldwide public. Modeling of such diseases can be extremely important in the prediction of their impact. While classic, statistical, modeling can provide satisfactory models, it can also fail to comprehend the intricacies contained within the data. In this paper, authors use a publicly available dataset, containing information on infected, recovered, and deceased patients in 406 locations over 51 days (22nd January 2020 to 12th March 2020). This dataset, intended to be a time-series dataset, is transformed into a regression dataset and used in training a multilayer perceptron (MLP) artificial neural network (ANN). The aim of training is to achieve a worldwide model of the maximal number of patients across all locations in each time unit. Hyperparameters of the MLP are varied using a grid search algorithm, with a total of 5376 hyperparameter combinations. Using those combinations, a total of 48384 ANNs are trained (16128 for each patient group—deceased, recovered, and infected), and each model is evaluated using the coefficient of determination (R2). Cross-validation is performed using K-fold algorithm with 5-folds. Best models achieved consists of 4 hidden layers with 4 neurons in each of those layers, and use a ReLU activation function, with R2 scores of 0.98599 for confirmed, 0.99429 for deceased, and 0.97941 for recovered patient models. When cross-validation is performed, these scores drop to 0.94 for confirmed, 0.781 for recovered, and 0.986 for deceased patient models, showing high robustness of the deceased patient model, good robustness for confirmed, and low robustness for recovered patient model.
    Keywords Computer applications to medicine. Medical informatics ; R858-859.7 ; covid19
    Subject code 006
    Language English
    Publishing date 2020-01-01T00:00:00Z
    Publisher Hindawi Limited
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  6. Article ; Online: Impact of COVID-19 on Forecasting Stock Prices

    Daniel Štifanić / Jelena Musulin / Adrijana Miočević / Sandi Baressi Šegota / Roman Šubić / Zlatan Car

    Complexity, Vol

    An Integration of Stationary Wavelet Transform and Bidirectional Long Short-Term Memory

    2020  Volume 2020

    Abstract: COVID-19 is an infectious disease that mostly affects the respiratory system. At the time of this research being performed, there were more than 1.4 million cases of COVID-19, and one of the biggest anxieties is not just our health, but our livelihoods, ... ...

    Abstract COVID-19 is an infectious disease that mostly affects the respiratory system. At the time of this research being performed, there were more than 1.4 million cases of COVID-19, and one of the biggest anxieties is not just our health, but our livelihoods, too. In this research, authors investigate the impact of COVID-19 on the global economy, more specifically, the impact of COVID-19 on the financial movement of Crude Oil price and three US stock indexes: DJI, S&P 500, and NASDAQ Composite. The proposed system for predicting commodity and stock prices integrates the stationary wavelet transform (SWT) and bidirectional long short-term memory (BDLSTM) networks. Firstly, SWT is used to decompose the data into approximation and detail coefficients. After decomposition, data of Crude Oil price and stock market indexes along with COVID-19 confirmed cases were used as input variables for future price movement forecasting. As a result, the proposed system BDLSTM + WT-ADA achieved satisfactory results in terms of five-day Crude Oil price forecast.
    Keywords Electronic computers. Computer science ; QA75.5-76.95 ; covid19
    Subject code 330
    Language English
    Publishing date 2020-01-01T00:00:00Z
    Publisher Hindawi-Wiley
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  7. Article ; Online: Path planning optimization of six-degree-of-freedom robotic manipulators using evolutionary algorithms

    Sandi Baressi Šegota / Nikola Anđelić / Ivan Lorencin / Milan Saga / Zlatan Car

    International Journal of Advanced Robotic Systems, Vol

    2020  Volume 17

    Abstract: Lowering joint torques of a robotic manipulator enables lowering the energy it uses as well as increase in the longevity of the robotic manipulator. This article proposes the use of evolutionary computation algorithms for optimizing the paths of the ... ...

    Abstract Lowering joint torques of a robotic manipulator enables lowering the energy it uses as well as increase in the longevity of the robotic manipulator. This article proposes the use of evolutionary computation algorithms for optimizing the paths of the robotic manipulator with the goal of lowering the joint torques. The robotic manipulator used for optimization is modelled after a realistic six-degree-of-freedom robotic manipulator. Two cases are observed and these are a single robotic manipulator carrying a weight in a point-to-point trajectory and two robotic manipulators cooperating and moving the same weight along a calculated point-to-point trajectory. The article describes the process used for determining the kinematic properties using Denavit–Hartenberg method and the dynamic equations of the robotic manipulator using Lagrange–Euler and Newton–Euler algorithms. Then, the description of used artificial intelligence optimization algorithms is given – genetic algorithm using random and average recombination, simulated annealing using linear and geometric cooling strategy and differential evolution. The methods are compared and the results show that the genetic algorithm provides best results in regard to torque minimization, with differential evolution also providing comparatively good results and simulated annealing giving the comparatively weakest results while providing smoother torque curves.
    Keywords Electronics ; TK7800-8360 ; Electronic computers. Computer science ; QA75.5-76.95
    Subject code 629
    Language English
    Publishing date 2020-03-01T00:00:00Z
    Publisher SAGE Publishing
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  8. Article ; Online: On Urinary Bladder Cancer Diagnosis

    Ivan Lorencin / Sandi Baressi Šegota / Nikola Anđelić / Vedran Mrzljak / Tomislav Ćabov / Josip Španjol / Zlatan Car

    Biology, Vol 10, Iss 3, p

    Utilization of Deep Convolutional Generative Adversarial Networks for Data Augmentation

    2021  Volume 175

    Abstract: Urinary bladder cancer is one of the most common urinary tract cancers. Standard diagnosis procedure can be invasive and time-consuming. For these reasons, procedure called optical biopsy is introduced. This procedure allows in-vivo evaluation of bladder ...

    Abstract Urinary bladder cancer is one of the most common urinary tract cancers. Standard diagnosis procedure can be invasive and time-consuming. For these reasons, procedure called optical biopsy is introduced. This procedure allows in-vivo evaluation of bladder mucosa without the need for biopsy. Although less invasive and faster, accuracy is often lower. For this reason, machine learning (ML) algorithms are used to increase its accuracy. The issue with ML algorithms is their sensitivity to the amount of input data. In medicine, collection can be time-consuming due to a potentially low number of patients. For these reasons, data augmentation is performed, usually through a series of geometric variations of original images. While such images improve classification performance, the number of new data points and the insight they provide is limited. These issues are a motivation for the application of novel augmentation methods. Authors demonstrate the use of Deep Convolutional Generative Adversarial Networks (DCGAN) for the generation of images. Augmented datasets used for training of commonly used Convolutional Neural Network-based (CNN) architectures (AlexNet and VGG-16) show a significcan performance increase for AlexNet, where AUCmicro reaches values up to 0.99. Average and median results of networks used in grid-search increases. These results point towards the conclusion that GAN-based augmentation has decreased the networks sensitivity to hyperparemeter change.
    Keywords AlexNet ; data augmentation ; deep convolutional generative adversarial networks ; urinary bladder cancer ; VGG16 ; Biology (General) ; QH301-705.5
    Subject code 006
    Language English
    Publishing date 2021-02-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  9. Article ; Online: Epidemiological Predictive Modeling of COVID-19 Infection

    Tijana Šušteršič / Andjela Blagojević / Danijela Cvetković / Aleksandar Cvetković / Ivan Lorencin / Sandi Baressi Šegota / Dragan Milovanović / Dejan Baskić / Zlatan Car / Nenad Filipović

    Frontiers in Public Health, Vol

    Development, Testing, and Implementation on the Population of the Benelux Union

    2021  Volume 9

    Abstract: Since the outbreak of coronavirus disease-2019 (COVID-19), the whole world has taken interest in the mechanisms of its spread and development. Mathematical models have been valuable instruments for the study of the spread and control of infectious ... ...

    Abstract Since the outbreak of coronavirus disease-2019 (COVID-19), the whole world has taken interest in the mechanisms of its spread and development. Mathematical models have been valuable instruments for the study of the spread and control of infectious diseases. For that purpose, we propose a two-way approach in modeling COVID-19 spread: a susceptible, exposed, infected, recovered, deceased (SEIRD) model based on differential equations and a long short-term memory (LSTM) deep learning model. The SEIRD model is a compartmental epidemiological model with included components: susceptible, exposed, infected, recovered, deceased. In the case of the SEIRD model, official statistical data available online for countries of Belgium, Netherlands, and Luxembourg (Benelux) in the period of March 15 2020 to March 15 2021 were used. Based on them, we have calculated key parameters and forward them to the epidemiological model, which will predict the number of infected, deceased, and recovered people. Results show that the SEIRD model is able to accurately predict several peaks for all the three countries of interest, with very small root mean square error (RMSE), except for the mild cases (maximum RMSE was 240.79 ± 90.556), which can be explained by the fact that no official data were available for mild cases, but this number was derived from other statistics. On the other hand, LSTM represents a special kind of recurrent neural network structure that can comparatively learn long-term temporal dependencies. Results show that LSTM is capable of predicting several peaks based on the position of previous peaks with low values of RMSE. Higher values of RMSE are observed in the number of infected cases in Belgium (RMSE was 535.93) and Netherlands (RMSE was 434.28), and are expected because of thousands of people getting infected per day in those countries. In future studies, we will extend the models to include mobility information, variants of concern, as well as a medical intervention, etc. A prognostic model could help us predict ...
    Keywords COVID-19 ; disease spread modeling ; SEIRD model ; LSTM model ; epidemiological model ; Public aspects of medicine ; RA1-1270
    Language English
    Publishing date 2021-10-01T00:00:00Z
    Publisher Frontiers Media S.A.
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  10. Article ; Online: Automatic Evaluation of the Lung Condition of COVID-19 Patients Using X-ray Images and Convolutional Neural Networks

    Ivan Lorencin / Sandi Baressi Šegota / Nikola Anđelić / Anđela Blagojević / Tijana Šušteršić / Alen Protić / Miloš Arsenijević / Tomislav Ćabov / Nenad Filipović / Zlatan Car

    Journal of Personalized Medicine, Vol 11, Iss 28, p

    2021  Volume 28

    Abstract: COVID-19 represents one of the greatest challenges in modern history. Its impact is most noticeable in the health care system, mostly due to the accelerated and increased influx of patients with a more severe clinical picture. These facts are increasing ... ...

    Abstract COVID-19 represents one of the greatest challenges in modern history. Its impact is most noticeable in the health care system, mostly due to the accelerated and increased influx of patients with a more severe clinical picture. These facts are increasing the pressure on health systems. For this reason, the aim is to automate the process of diagnosis and treatment. The research presented in this article conducted an examination of the possibility of classifying the clinical picture of a patient using X-ray images and convolutional neural networks. The research was conducted on the dataset of 185 images that consists of four classes. Due to a lower amount of images, a data augmentation procedure was performed. In order to define the CNN architecture with highest classification performances, multiple CNNs were designed. Results show that the best classification performances can be achieved if ResNet152 is used. This CNN has achieved <math display="inline"><semantics><mover><mrow><mi>A</mi><mi>U</mi><msub><mi>C</mi><mrow><mi>m</mi><mi>a</mi><mi>c</mi><mi>r</mi><mi>o</mi></mrow></msub></mrow><mo>¯</mo></mover></semantics></math> and <math display="inline"><semantics><mover><mrow><mi>A</mi><mi>U</mi><msub><mi>C</mi><mrow><mi>m</mi><mi>i</mi><mi>c</mi><mi>r</mi><mi>o</mi></mrow></msub></mrow><mo>¯</mo></mover></semantics></math> up to 0.94, suggesting the possibility of applying CNN to the classification of the clinical picture of COVID-19 patients using an X-ray image of the lungs. When higher layers are frozen during the training procedure, higher <math ...<br />
    Keywords AlexNet ; convolutional neural network ; COVID-19 ; ResNet ; VGG-16 ; Medicine ; R
    Subject code 511
    Language English
    Publishing date 2021-01-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

To top