LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 10 of total 16

Search options

  1. Article ; Online: A Multi-Scale Cross-Fusion Medical Image Segmentation Network Based on Dual-Attention Mechanism Transformer

    Jianguo Cui / Liejun Wang / Shaochen Jiang

    Applied Sciences, Vol 13, Iss 10881, p

    2023  Volume 10881

    Abstract: The U-net network, with its simple and powerful encoder–decoder structure, dominates the field of medical image segmentation. However, convolution operations are limited by receptive fields. They do not have the ability to model long-range dependencies, ... ...

    Abstract The U-net network, with its simple and powerful encoder–decoder structure, dominates the field of medical image segmentation. However, convolution operations are limited by receptive fields. They do not have the ability to model long-range dependencies, but Transformer has the capability of long-term modeling thanks to its core self-attention mechanism, which has been widely applied in the field of medical image segmentation. However, both CNNs and Transformer can only perform correlation calculations for a single sample, ignoring the correlation between different samples. To address these problems, we propose a new Transformer, which we call the Dual-Attention Transformer (DAT). This module captures correlations within a single sample while also learning correlations between different samples. The current U-net and some of its variant models have the problem of inadequate feature fusion, so we also improve the skip connection to strengthen the association between feature maps at different scales, reduce the semantic gap between the encoder and decoder, and further improve the segmentation performance. We refer to this structure as DATUnet. We conducted extensive experiments on the Synapse and ACDC datasets to validate the superior performance of our network, and we achieved an average DSC (%) of 83.6 and 90.9 and an average HD95 of 13.99 and 1.466 for the Synapse and ACDC datasets, respectively.
    Keywords medical image segmentation ; Transformer ; convolutional neural network ; Technology ; T ; Engineering (General). Civil engineering (General) ; TA1-2040 ; Biology (General) ; QH301-705.5 ; Physics ; QC1-999 ; Chemistry ; QD1-999
    Subject code 006
    Language English
    Publishing date 2023-09-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  2. Article ; Online: TC-Net

    Yuying Dong / Liejun Wang / Yongming Li

    PLoS ONE, Vol 17, Iss

    Dual coding network of Transformer and CNN for skin lesion segmentation

    2022  Volume 11

    Abstract: Skin lesion segmentation has become an essential recent direction in machine learning for medical applications. In a deep learning segmentation network, the convolutional neural network (CNN) uses convolution to capture local information for modeling. ... ...

    Abstract Skin lesion segmentation has become an essential recent direction in machine learning for medical applications. In a deep learning segmentation network, the convolutional neural network (CNN) uses convolution to capture local information for modeling. However, it ignores the relationship between pixels and still can not meet the precise segmentation requirements of some complex low contrast datasets. Transformer performs well in modeling global feature information, but their ability to extract fine-grained local feature patterns is weak. In this work, The dual coding fusion network architecture Transformer and CNN (TC-Net), as an architecture that can more accurately combine local feature information and global feature information, can improve the segmentation performance of skin images. The results of this work demonstrate that the combination of CNN and Transformer brings very significant improvement in global segmentation performance and allows outperformance as compared to the pure single network model. The experimental results and visual analysis of these three datasets quantitatively and qualitatively illustrate the robustness of TC-Net. Compared with Swin UNet, on the ISIC2018 dataset, it has increased by 2.46% in the dice index and about 4% in the JA index. On the ISBI2017 dataset, the dice and JA indices rose by about 4%.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2022-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  3. Article ; Online: Composed query image retrieval based on triangle area triple loss function and combining CNN with transformer

    Zhiwei Zhang / Liejun Wang / Shuli Cheng

    Scientific Reports, Vol 12, Iss 1, Pp 1-

    2022  Volume 12

    Abstract: Abstract The existing typical combined query image retrieval methods adopt Euclidean distance as sample distance measurement method, and the model trained by triple loss function blindly pursues absolute distance between samples, resulting in ... ...

    Abstract Abstract The existing typical combined query image retrieval methods adopt Euclidean distance as sample distance measurement method, and the model trained by triple loss function blindly pursues absolute distance between samples, resulting in unsatisfactory image retrieval performance. Meanwhile, these methods singularly adopt Convolutional Neural Network (CNN) to extract reference image features. However, receptive field of convolution operation has the characteristics of locality, which is easy to cause the loss of edge feature information of reference images. In view of shortcomings of these methods, the following improvements are proposed in this paper: (1) We propose Triangle Area Triple Loss Function (TATLF), which adopts Triangle Area (TA) as measurement of sample distance. TA comprehensively considers the absolute distance and included angle between samples, so that the trained model has better retrieval performance; (2) We combine CNN with Transformer to simultaneously extract local and edge features of reference images, which can effectively reduce the loss of reference images information. Specifically, CNN is adopted to extract local feature information of reference images. Transformer is used to pay attention to the edge feature information of reference images. Extensive experiments on two public datasets, Fashion200k and MIT-States, confirm the excellent performance of our proposed method.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2022-12-01T00:00:00Z
    Publisher Nature Portfolio
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  4. Article ; Online: Deep parameter-free attention hashing for image retrieval

    Wenjing Yang / Liejun Wang / Shuli Cheng

    Scientific Reports, Vol 12, Iss 1, Pp 1-

    2022  Volume 20

    Abstract: Abstract Deep hashing method is widely applied in the field of image retrieval because of its advantages of low storage consumption and fast retrieval speed. There is a defect of insufficiency feature extraction when existing deep hashing method uses the ...

    Abstract Abstract Deep hashing method is widely applied in the field of image retrieval because of its advantages of low storage consumption and fast retrieval speed. There is a defect of insufficiency feature extraction when existing deep hashing method uses the convolutional neural network (CNN) to extract images semantic features. Some studies propose to add channel-based or spatial-based attention modules. However, embedding these modules into the network can increase the complexity of model and lead to over fitting in the training process. In this study, a novel deep parameter-free attention hashing (DPFAH) is proposed to solve these problems, that designs a parameter-free attention (PFA) module in ResNet18 network. PFA is a lightweight module that defines an energy function to measure the importance of each neuron and infers 3-D attention weights for feature map in a layer. A fast closed-form solution for this energy function proves that the PFA module does not add any parameters to the network. Otherwise, this paper designs a novel hashing framework that includes the hash codes learning branch and the classification branch to explore more label information. The like-binary codes are constrained by a regulation term to reduce the quantization error in the continuous relaxation. Experiments on CIFAR-10, NUS-WIDE and Imagenet-100 show that DPFAH method achieves better performance.
    Keywords Medicine ; R ; Science ; Q
    Subject code 005
    Language English
    Publishing date 2022-04-01T00:00:00Z
    Publisher Nature Portfolio
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  5. Article ; Online: Optical Flow-Aware-Based Multi-Modal Fusion Network for Violence Detection

    Yang Xiao / Guxue Gao / Liejun Wang / Huicheng Lai

    Entropy, Vol 24, Iss 7, p

    2022  Volume 939

    Abstract: Violence detection aims to locate violent content in video frames. Improving the accuracy of violence detection is of great importance for security. However, the current methods do not make full use of the multi-modal vision and audio information, which ... ...

    Abstract Violence detection aims to locate violent content in video frames. Improving the accuracy of violence detection is of great importance for security. However, the current methods do not make full use of the multi-modal vision and audio information, which affects the accuracy of violence detection. We found that the violence detection accuracy of different kinds of videos is related to the change of optical flow. With this in mind, we propose an optical flow-aware-based multi-modal fusion network (OAMFN) for violence detection. Specifically, we use three different fusion strategies to fully integrate multi-modal features. First, the main branch concatenates RGB features and audio features and the optical flow branch concatenates optical flow features with RGB features and audio features, respectively. Then, the cross-modal information fusion module integrates the features of different combinations and applies weights to them to capture cross-modal information in audio and video. After that, the channel attention module extracts valuable information by weighting the integration features. Furthermore, an optical flow-aware-based score fusion strategy is introduced to fuse features of different modalities from two branches. Compared with methods on the XD-Violence dataset, our multi-modal fusion network yields APs that are 83.09% and 1.4% higher than those of the state-of-the-art methods in offline detection, and 78.09% and 4.42% higher than those of the state-of-the-art methods in online detection.
    Keywords violence detection ; multi-modal fusion ; adaptive fusion ; optical flow-aware ; Science ; Q ; Astrophysics ; QB460-466 ; Physics ; QC1-999
    Subject code 621 ; 004
    Language English
    Publishing date 2022-07-01T00:00:00Z
    Publisher MDPI AG
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  6. Article ; Online: MHANet

    Lianghui Xu / Liejun Wang / Shuli Cheng / Yongming Li

    PLoS ONE, Vol 16, Iss 12, p e

    A hybrid attention mechanism for retinal diseases classification.

    2021  Volume 0261285

    Abstract: With the increase of patients with retinopathy, retinopathy recognition has become a research hotspot. In this article, we describe the etiology and symptoms of three kinds of retinal diseases, including drusen(DRUSEN), choroidal neovascularization(CNV) ... ...

    Abstract With the increase of patients with retinopathy, retinopathy recognition has become a research hotspot. In this article, we describe the etiology and symptoms of three kinds of retinal diseases, including drusen(DRUSEN), choroidal neovascularization(CNV) and diabetic macular edema(DME). In addition, we also propose a hybrid attention mechanism to classify and recognize different types of retinopathy images. In particular, the hybrid attention mechanism proposed in this paper includes parallel spatial attention mechanism and channel attention mechanism. It can extract the key features in the channel dimension and spatial dimension of retinopathy images, and reduce the negative impact of background information on classification results. The experimental results show that the hybrid attention mechanism proposed in this paper can better assist the network to focus on extracting thr fetures of the retinopathy area and enhance the adaptability to the differences of different data sets. Finally, the hybrid attention mechanism achieved 96.5% and 99.76% classification accuracy on two public OCT data sets of retinopathy, respectively.
    Keywords Medicine ; R ; Science ; Q
    Language English
    Publishing date 2021-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  7. Article ; Online: HDC-Net

    Xiaolong Hu / Liejun Wang / Shuli Cheng / Yongming Li

    PLoS ONE, Vol 16, Iss 9, p e

    A hierarchical dilation convolutional network for retinal vessel segmentation.

    2021  Volume 0257013

    Abstract: The cardinal symptoms of some ophthalmic diseases observed through exceptional retinal blood vessels, such as retinal vein occlusion, diabetic retinopathy, etc. The advanced deep learning models used to obtain morphological and structural information of ... ...

    Abstract The cardinal symptoms of some ophthalmic diseases observed through exceptional retinal blood vessels, such as retinal vein occlusion, diabetic retinopathy, etc. The advanced deep learning models used to obtain morphological and structural information of blood vessels automatically are conducive to the early treatment and initiative prevention of ophthalmic diseases. In our work, we propose a hierarchical dilation convolutional network (HDC-Net) to extract retinal vessels in a pixel-to-pixel manner. It utilizes the hierarchical dilation convolution (HDC) module to capture the fragile retinal blood vessels usually neglected by other methods. An improved residual dual efficient channel attention (RDECA) module can infer more delicate channel information to reinforce the discriminative capability of the model. The structured Dropblock can help our HDC-Net model to solve the network overfitting effectively. From a holistic perspective, the segmentation results obtained by HDC-Net are superior to other deep learning methods on three acknowledged datasets (DRIVE, CHASE-DB1, STARE), the sensitivity, specificity, accuracy, f1-score and AUC score are {0.8252, 0.9829, 0.9692, 0.8239, 0.9871}, {0.8227, 0.9853, 0.9745, 0.8113, 0.9884}, and {0.8369, 0.9866, 0.9751, 0.8385, 0.9913}, respectively. It surpasses most other advanced retinal vessel segmentation models. Qualitative and quantitative analysis demonstrates that HDC-Net can fulfill the task of retinal vessel segmentation efficiently and accurately.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2021-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  8. Article ; Online: Image Reconstruction Based on Progressive Multistage Distillation Convolution Neural Network

    Yuxi Cai / Guxue Gao / Zhenhong Jia / Liejun Wang / Huicheng Lai

    Computational Intelligence and Neuroscience, Vol

    2022  Volume 2022

    Abstract: To address the problem that some current algorithms suffer from the loss of some important features due to rough feature distillation and the loss of key information in some channels due to compressed channel attention in the network, we propose a ... ...

    Abstract To address the problem that some current algorithms suffer from the loss of some important features due to rough feature distillation and the loss of key information in some channels due to compressed channel attention in the network, we propose a progressive multistage distillation network that gradually refines the features in stages to obtain the maximum amount of key feature information in them. In addition, to maximize the network performance, we propose a weight-sharing information lossless attention block to enhance the channel characteristics through a weight-sharing auxiliary path and, at the same time, use convolution layers to model the interchannel dependencies without compression, effectively avoiding the previous problem of information loss in channel attention. Extensive experiments on several benchmark data sets show that the algorithm in this paper achieves a good balance between network performance, the number of parameters, and computational complexity and achieves highly competitive performance in both objective metrics and subjective vision, which indicates the advantages of this paper’s algorithm for image reconstruction. It can be seen that this gradual feature distillation from coarse to fine is effective in improving network performance. Our code is available at the following link: https://github.com/Cai631/PMDN.
    Keywords Computer applications to medicine. Medical informatics ; R858-859.7 ; Neurosciences. Biological psychiatry. Neuropsychiatry ; RC321-571
    Subject code 006
    Language English
    Publishing date 2022-01-01T00:00:00Z
    Publisher Hindawi Limited
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  9. Article ; Online: Asymmetric coordinate attention spectral-spatial feature fusion network for hyperspectral image classification

    Shuli Cheng / Liejun Wang / Anyu Du

    Scientific Reports, Vol 11, Iss 1, Pp 1-

    2021  Volume 17

    Abstract: Abstract In recent years, the hyperspectral classification algorithm based on deep learning has received widespread attention, but the existing network models have higher model complexity and require more time consumption. In order to further improve the ...

    Abstract Abstract In recent years, the hyperspectral classification algorithm based on deep learning has received widespread attention, but the existing network models have higher model complexity and require more time consumption. In order to further improve the accuracy of hyperspectral image classification and reduce model complexity, this paper proposes an asymmetric coordinate attention spectral-spatial feature fusion network (ACAS2F2N) to capture distinguishing hyperspectral features. Specifically, adaptive asymmetric iterative attention was proposed to obtain the discriminative spectral-spatial features. Different from the common feature fusion method, this feature fusion method can adapt to most skip connection tasks. In addition, there is no manual parameter setting. Coordinate attention is used to obtain accurate coordinate information and channel relationship. The strip pooling module was introduced to increase the network’s receptive field and avoid irrelevant information brought by conventional convolution kernels. The proposed algorithm is tested on the mainstream hyperspectral datasets (IP, KSC, and Botswana), experimental results show that the proposed ACAS2F2N can achieve state-of-the-art performance with lower time complexity.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2021-08-01T00:00:00Z
    Publisher Nature Portfolio
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  10. Article ; Online: Extracting user influence from ratings and trust for rating prediction in recommendations

    Wenchuan Shi / Liejun Wang / Jiwei Qin

    Scientific Reports, Vol 10, Iss 1, Pp 1-

    2020  Volume 13

    Abstract: Abstract The Collaborative Filtering (CF) algorithm based on trust has been the main method used to solve the cold start problem in Recommendation Systems (RSs) for the past few years. Nevertheless, the current trust-based CF algorithm ignores the ... ...

    Abstract Abstract The Collaborative Filtering (CF) algorithm based on trust has been the main method used to solve the cold start problem in Recommendation Systems (RSs) for the past few years. Nevertheless, the current trust-based CF algorithm ignores the implicit influence contained in the ratings and trust data. In this paper, we propose a new rating prediction model named the Rating-Trust-based Recommendation Model (RTRM) to explore the influence of internal factors among the users. The proposed user internal factors include the user reliability and popularity. The internal factors derived from the explicit behavior data (ratings and trust), which can help us understand the user better and model the user more accurately. In addition, we incorporate the proposed internal factors into the Singular Value Decomposition Plus Plus (SVD + +) model to perform the rating prediction task. Experimental studies on two common datasets show that utilizing ratings and trust data simultaneously to mine the factors that influence the relationships among different users can improve the accuracy of rating prediction and effectively relieve the cold start problem.
    Keywords Medicine ; R ; Science ; Q
    Language English
    Publishing date 2020-08-01T00:00:00Z
    Publisher Nature Publishing Group
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

To top