LIVIVO - Suchergebnisse -

Suchergebnis

Treffer 1 - 10 von insgesamt 13

Suchoptionen

Artikel ; Online: Ultra Fast Deep Lane Detection With Hybrid Anchor Driven Ordinal Classification.

Qin, Zequn / Zhang, Pengyi / Li, Xi

IEEE transactions on pattern analysis and machine intelligence

2024 Band 46, Heft 5, Seite(n) 2555–2568

Abstract: Modern methods mainly regard lane detection as a problem of pixel-wise segmentation, which is struggling to address the problems of efficiency and challenging scenarios like severe occlusions and extreme lighting conditions. Inspired by human perception, ...

Abstract	Modern methods mainly regard lane detection as a problem of pixel-wise segmentation, which is struggling to address the problems of efficiency and challenging scenarios like severe occlusions and extreme lighting conditions. Inspired by human perception, the recognition of lanes under severe occlusions and extreme lighting conditions is mainly based on contextual and global information. Motivated by this observation, we propose a novel, simple, yet effective formulation aiming at ultra fast speed and the problem of challenging scenarios. Specifically, we treat the process of lane detection as an anchor-driven ordinal classification problem using global features. First, we represent lanes with sparse coordinates on a series of hybrid (row and column) anchors. With the help of the anchor-driven representation, we then reformulate the lane detection task as an ordinal classification problem to get the coordinates of lanes. Our method could significantly reduce the computational cost with the anchor-driven representation. Using the large receptive field property of the ordinal classification formulation, we could also handle challenging scenarios. Extensive experiments on four lane detection datasets show that our method could achieve state-of-the-art performance in terms of both speed and accuracy. A lightweight version could even achieve 300+ frames per second(FPS). Our code is at https://github.com/cfzd/Ultra-Fast-Lane-Detection-v2.
Sprache	Englisch
Erscheinungsdatum	2024-04-03
Erscheinungsland	United States
Dokumenttyp	Journal Article
ISSN	1939-3539
ISSN (online)	1939-3539
DOI	10.1109/TPAMI.2022.3182097
Datenquelle	MEDical Literature Analysis and Retrieval System OnLINE

Zusatzmaterialien

Über subito bestellen

Dieser Service ist kostenpflichtig (siehe Lieferbedingungen von subito). Bestellungen, die einen Artikel nebst Supplementary Material umfassen, werden grundsätzlich wie mehrfache Bestellungen bearbeitet. Gebühren fallen in diesen Fällen für jede einzelne Bestellung an.

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Details ▾
- Kostenpflichtig bestellen

Artikel ; Online: GaitMPL: Gait Recognition With Memory-Augmented Progressive Learning.

Dou, Huanzhang / Zhang, Pengyi / Zhao, Yuhan / Dong, Lin / Qin, Zequn / Li, Xi

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

2024 Band 33, Seite(n) 1464–1475

Abstract: Gait recognition aims at identifying the pedestrians at a long distance by their biometric gait patterns. It is inherently challenging due to the various covariates and the properties of silhouettes (textureless and colorless), which result in two kinds ... ...

Abstract	Gait recognition aims at identifying the pedestrians at a long distance by their biometric gait patterns. It is inherently challenging due to the various covariates and the properties of silhouettes (textureless and colorless), which result in two kinds of pair-wise hard samples: the same pedestrian could have distinct silhouettes (intra-class diversity) and different pedestrians could have similar silhouettes (inter-class similarity). In this work, we propose to solve the hard sample issue with a Memory-augmented Progressive Learning network (GaitMPL), including Dynamic Reweighting Progressive Learning module (DRPL) and Global Structure-Aligned Memory bank (GSAM). Specifically, DRPL reduces the learning difficulty of hard samples by easy-to-hard progressive learning. GSAM further augments DRPL with a structure-aligned memory mechanism, which maintains and models the feature distribution of each ID. Experiments on two commonly used datasets, CASIA-B and OU-MVLP, demonstrate the effectiveness of GaitMPL. On CASIA-B, we achieve the state-of-the-art performance, i.e., 88.0% on the most challenging condition (Clothing) and 93.3% on the average condition, which outperforms the other methods by at least 3.8% and 1.4%, respectively. Code will be available at https://github.com/WhiteDOU/GaitMPL https://github.com/WhiteDOU/GaitMPL.
Mesh-Begriff(e)	Humans ; Gait ; Learning ; Pedestrians
Sprache	Englisch
Erscheinungsdatum	2024-02-23
Erscheinungsland	United States
Dokumenttyp	Journal Article
ISSN	1941-0042
ISSN (online)	1941-0042
DOI	10.1109/TIP.2022.3164543
Datenquelle	MEDical Literature Analysis and Retrieval System OnLINE

Zusatzmaterialien

Über subito bestellen

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Details ▾
- Kostenpflichtig bestellen

Buch ; Online: MonoGround

Qin, Zequn / Li, Xi

Detecting Monocular 3D Objects from the Ground

2022

Abstract: Monocular 3D object detection has attracted great attention for its advantages in simplicity and cost. Due to the ill-posed 2D to 3D mapping essence from the monocular imaging process, monocular 3D object detection suffers from inaccurate depth ... ...

Abstract	Monocular 3D object detection has attracted great attention for its advantages in simplicity and cost. Due to the ill-posed 2D to 3D mapping essence from the monocular imaging process, monocular 3D object detection suffers from inaccurate depth estimation and thus has poor 3D detection results. To alleviate this problem, we propose to introduce the ground plane as a prior in the monocular 3d object detection. The ground plane prior serves as an additional geometric condition to the ill-posed mapping and an extra source in depth estimation. In this way, we can get a more accurate depth estimation from the ground. Meanwhile, to take full advantage of the ground plane prior, we propose a depth-align training strategy and a precise two-stage depth inference method tailored for the ground plane prior. It is worth noting that the introduced ground plane prior requires no extra data sources like LiDAR, stereo images, and depth information. Extensive experiments on the KITTI benchmark show that our method could achieve state-of-the-art results compared with other methods while maintaining a very fast speed. Our code and models are available at https://github.com/cfzd/MonoGround. Comment: CVPR22
Schlagwörter	Computer Science - Computer Vision and Pattern Recognition
Thema/Rubrik (Code)	004
Erscheinungsdatum	2022-06-15
Erscheinungsland	us
Dokumenttyp	Buch ; Online
Datenquelle	BASE - Bielefeld Academic Search Engine (Lebenswissenschaftliche Auswahl)

Volltext online

Volltext online

Zusatzmaterialien

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Artikel ; Online: CoDiNet: Path Distribution Modeling with Consistency and Diversity for Dynamic Routing.

Wang, Huanyu / Qin, Zequn / Li, Songyuan / Li, Xi

IEEE transactions on pattern analysis and machine intelligence

2021 Band PP

Abstract: Dynamic routing networks, aimed at finding the best routing paths in the networks, have achieved significant improvements to neural networks in terms of accuracy and efficiency. In this paper, we see dynamic routing networks in a fresh light, formulating ...

Abstract	Dynamic routing networks, aimed at finding the best routing paths in the networks, have achieved significant improvements to neural networks in terms of accuracy and efficiency. In this paper, we see dynamic routing networks in a fresh light, formulating a routing method as a mapping from a sample space to a routing space. From the perspective of space mapping, prevalent methods of dynamic routing did not take into account how inference paths would be distributed in the routing space. Thus, we propose a novel method, termed CoDiNet, to model the relationship between a sample space and a routing space by regularizing the distribution of routing paths with the properties of consistency and diversity. Specifically, samples with similar semantics should be mapped into the same area in routing space, while those with dissimilar semantics should be mapped into different areas. Moreover, we design a customizable dynamic routing module, which can strike a balance between accuracy and efficiency. When deployed upon ResNet models, our method achieves higher performance and effectively reduces average computational cost on four widely used datasets.
Sprache	Englisch
Erscheinungsdatum	2021-05-28
Erscheinungsland	United States
Dokumenttyp	Journal Article
ISSN	1939-3539
ISSN (online)	1939-3539
DOI	10.1109/TPAMI.2021.3084680
Datenquelle	MEDical Literature Analysis and Retrieval System OnLINE

Zusatzmaterialien

Über subito bestellen

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Details ▾
- Kostenpflichtig bestellen

Artikel ; Online: Multitask Attention Network for Lane Detection and Fitting.

Wang, Qi / Han, Tao / Qin, Zequn / Gao, Junyu / Li, Xuelong

IEEE transactions on neural networks and learning systems

2022 Band 33, Heft 3, Seite(n) 1066–1078

Abstract: Many CNN-based segmentation methods have been applied in lane marking detection recently and gain excellent success for a strong ability in modeling semantic information. Although the accuracy of lane line prediction is getting better and better, lane ... ...

Abstract	Many CNN-based segmentation methods have been applied in lane marking detection recently and gain excellent success for a strong ability in modeling semantic information. Although the accuracy of lane line prediction is getting better and better, lane markings' localization ability is relatively weak, especially when the lane marking point is remote. Traditional lane detection methods usually utilize highly specialized handcrafted features and carefully designed postprocessing to detect the lanes. However, these methods are based on strong assumptions and, thus, are prone to scalability. In this work, we propose a novel multitask method that: 1) integrates the ability to model semantic information of CNN and the strong localization ability provided by handcrafted features and 2) predicts the position of vanishing line. A novel lane fitting method based on vanishing line prediction is also proposed for sharp curves and nonflat road in this article. By integrating segmentation, specialized handcrafted features, and fitting, the accuracy of location and the convergence speed of networks are improved. Extensive experimental results on four-lane marking detection data sets show that our method achieves state-of-the-art performance.
Sprache	Englisch
Erscheinungsdatum	2022-02-28
Erscheinungsland	United States
Dokumenttyp	Journal Article
ISSN	2162-2388
ISSN (online)	2162-2388
DOI	10.1109/TNNLS.2020.3039675
Datenquelle	MEDical Literature Analysis and Retrieval System OnLINE

Zusatzmaterialien

Über subito bestellen

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Details ▾
- Kostenpflichtig bestellen

Buch ; Online: Ultra Fast Deep Lane Detection with Hybrid Anchor Driven Ordinal Classification

Qin, Zequn / Zhang, Pengyi / Li, Xi

2022

Abstract	Modern methods mainly regard lane detection as a problem of pixel-wise segmentation, which is struggling to address the problems of efficiency and challenging scenarios like severe occlusions and extreme lighting conditions. Inspired by human perception, the recognition of lanes under severe occlusions and extreme lighting conditions is mainly based on contextual and global information. Motivated by this observation, we propose a novel, simple, yet effective formulation aiming at ultra fast speed and the problem of challenging scenarios. Specifically, we treat the process of lane detection as an anchor-driven ordinal classification problem using global features. First, we represent lanes with sparse coordinates on a series of hybrid (row and column) anchors. With the help of the anchor-driven representation, we then reformulate the lane detection task as an ordinal classification problem to get the coordinates of lanes. Our method could significantly reduce the computational cost with the anchor-driven representation. Using the large receptive field property of the ordinal classification formulation, we could also handle challenging scenarios. Extensive experiments on four lane detection datasets show that our method could achieve state-of-the-art performance in terms of both speed and accuracy. A lightweight version could even achieve 300+ frames per second(FPS). Our code is at https://github.com/cfzd/Ultra-Fast-Lane-Detection-v2. Comment: TPAMI 2022
Schlagwörter	Computer Science - Computer Vision and Pattern Recognition
Thema/Rubrik (Code)	006 ; 004
Erscheinungsdatum	2022-06-15
Erscheinungsland	us
Dokumenttyp	Buch ; Online
Datenquelle	BASE - Bielefeld Academic Search Engine (Lebenswissenschaftliche Auswahl)

Volltext online

Volltext online

Zusatzmaterialien

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Buch ; Online: UniFormer

Qin, Zequn / Chen, Jingyu / Chen, Chao / Chen, Xiaozhi / Li, Xi

Unified Multi-view Fusion Transformer for Spatial-Temporal Representation in Bird's-Eye-View

2022

Abstract: Bird's eye view (BEV) representation is a new perception formulation for autonomous driving, which is based on spatial fusion. Further, temporal fusion is also introduced in BEV representation and gains great success. In this work, we propose a new ... ...

Abstract	Bird's eye view (BEV) representation is a new perception formulation for autonomous driving, which is based on spatial fusion. Further, temporal fusion is also introduced in BEV representation and gains great success. In this work, we propose a new method that unifies both spatial and temporal fusion and merges them into a unified mathematical formulation. The unified fusion could not only provide a new perspective on BEV fusion but also brings new capabilities. With the proposed unified spatial-temporal fusion, our method could support long-range fusion, which is hard to achieve in conventional BEV methods. Moreover, the BEV fusion in our work is temporal-adaptive, and the weights of temporal fusion are learnable. In contrast, conventional methods mainly use fixed and equal weights for temporal fusion. Besides, the proposed unified fusion could avoid information lost in conventional BEV fusion methods and make full use of features. Extensive experiments and ablation studies on the NuScenes dataset show the effectiveness of the proposed method and our method gains the state-of-the-art performance in the map segmentation task.
Schlagwörter	Computer Science - Computer Vision and Pattern Recognition
Thema/Rubrik (Code)	006
Erscheinungsdatum	2022-07-18
Erscheinungsland	us
Dokumenttyp	Buch ; Online
Datenquelle	BASE - Bielefeld Academic Search Engine (Lebenswissenschaftliche Auswahl)

Volltext online

Volltext online

Zusatzmaterialien

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Buch ; Online: GaitMPL

Dou, Huanzhang / Zhang, Pengyi / Zhao, Yuhan / Dong, Lin / Qin, Zequn / Li, Xi

Gait Recognition with Memory-Augmented Progressive Learning

2023

Abstract	Gait recognition aims at identifying the pedestrians at a long distance by their biometric gait patterns. It is inherently challenging due to the various covariates and the properties of silhouettes (textureless and colorless), which result in two kinds of pair-wise hard samples: the same pedestrian could have distinct silhouettes (intra-class diversity) and different pedestrians could have similar silhouettes (inter-class similarity). In this work, we propose to solve the hard sample issue with a Memory-augmented Progressive Learning network (GaitMPL), including Dynamic Reweighting Progressive Learning module (DRPL) and Global Structure-Aligned Memory bank (GSAM). Specifically, DRPL reduces the learning difficulty of hard samples by easy-to-hard progressive learning. GSAM further augments DRPL with a structure-aligned memory mechanism, which maintains and models the feature distribution of each ID. Experiments on two commonly used datasets, CASIA-B and OU-MVLP, demonstrate the effectiveness of GaitMPL. On CASIA-B, we achieve the state-of-the-art performance, i.e., 88.0% on the most challenging condition (Clothing) and 93.3% on the average condition, which outperforms the other methods by at least 3.8% and 1.4%, respectively. Comment: Accepted by TIP2022
Schlagwörter	Computer Science - Computer Vision and Pattern Recognition
Thema/Rubrik (Code)	006
Erscheinungsdatum	2023-06-06
Erscheinungsland	us
Dokumenttyp	Buch ; Online
Datenquelle	BASE - Bielefeld Academic Search Engine (Lebenswissenschaftliche Auswahl)

Volltext online

Volltext online

Zusatzmaterialien

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Buch ; Online: Ultra Fast Structure-aware Deep Lane Detection

Qin, Zequn / Wang, Huanyu / Li, Xi

2020

Abstract: Modern methods mainly regard lane detection as a problem of pixel-wise segmentation, which is struggling to address the problem of challenging scenarios and speed. Inspired by human perception, the recognition of lanes under severe occlusion and extreme ... ...

Abstract	Modern methods mainly regard lane detection as a problem of pixel-wise segmentation, which is struggling to address the problem of challenging scenarios and speed. Inspired by human perception, the recognition of lanes under severe occlusion and extreme lighting conditions is mainly based on contextual and global information. Motivated by this observation, we propose a novel, simple, yet effective formulation aiming at extremely fast speed and challenging scenarios. Specifically, we treat the process of lane detection as a row-based selecting problem using global features. With the help of row-based selecting, our formulation could significantly reduce the computational cost. Using a large receptive field on global features, we could also handle the challenging scenarios. Moreover, based on the formulation, we also propose a structural loss to explicitly model the structure of lanes. Extensive experiments on two lane detection benchmark datasets show that our method could achieve the state-of-the-art performance in terms of both speed and accuracy. A light-weight version could even achieve 300+ frames per second with the same resolution, which is at least 4x faster than previous state-of-the-art methods. Our code will be made publicly available. Comment: ECCV 2020
Schlagwörter	Computer Science - Computer Vision and Pattern Recognition
Thema/Rubrik (Code)	006
Erscheinungsdatum	2020-04-24
Erscheinungsland	us
Dokumenttyp	Buch ; Online
Datenquelle	BASE - Bielefeld Academic Search Engine (Lebenswissenschaftliche Auswahl)

Volltext online

Volltext online

Zusatzmaterialien

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Buch ; Online: CoDiNet

Wang, Huanyu / Qin, Zequn / Li, Songyuan / Li, Xi

Path Distribution Modeling with Consistency and Diversity for Dynamic Routing

2020

Abstract	Dynamic routing networks, aimed at finding the best routing paths in the networks, have achieved significant improvements to neural networks in terms of accuracy and efficiency. In this paper, we see dynamic routing networks in a fresh light, formulating a routing method as a mapping from a sample space to a routing space. From the perspective of space mapping, prevalent methods of dynamic routing didn't consider how inference paths would be distributed in the routing space. Thus, we propose a novel method, termed CoDiNet, to model the relationship between a sample space and a routing space by regularizing the distribution of routing paths with the properties of consistency and diversity. Specifically, samples with similar semantics should be mapped into the same area in routing space, while those with dissimilar semantics should be mapped into different areas. Moreover, we design a customizable dynamic routing module, which can strike a balance between accuracy and efficiency. When deployed upon ResNet models, our method achieves higher performance and effectively reduces average computational cost on four widely used datasets.
Schlagwörter	Computer Science - Computer Vision and Pattern Recognition
Thema/Rubrik (Code)	629
Erscheinungsdatum	2020-05-29
Erscheinungsland	us
Dokumenttyp	Buch ; Online
Datenquelle	BASE - Bielefeld Academic Search Engine (Lebenswissenschaftliche Auswahl)

Volltext online

Volltext online

Zusatzmaterialien

Fernleihe an ZB MED

Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.

Zum Seitenanfang

Zusatzmaterialien

Kategorien

Über subito bestellen

Fernleihe an ZB MED

Zusatzmaterialien

Kategorien

Über subito bestellen

Fernleihe an ZB MED

Volltext online

Zusatzmaterialien

Kategorien

Fernleihe an ZB MED

Zusatzmaterialien

Kategorien

Über subito bestellen

Fernleihe an ZB MED

Zusatzmaterialien

Kategorien

Über subito bestellen

Fernleihe an ZB MED

Volltext online

Zusatzmaterialien

Kategorien

Fernleihe an ZB MED

Volltext online

Zusatzmaterialien

Kategorien

Fernleihe an ZB MED

Volltext online

Zusatzmaterialien

Kategorien

Fernleihe an ZB MED

Volltext online

Zusatzmaterialien

Kategorien

Fernleihe an ZB MED

Volltext online

Zusatzmaterialien

Kategorien

Fernleihe an ZB MED