Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange

Wu, Yanhao; Zhang, Tong; Ke, Wei; Qiu, congpei; Süsstrunk, Sabine; Salzmann, Mathieu

Wu, Yanhao; Zhang, Tong; Ke, Wei; Qiu, congpei; Süsstrunk, Sabine; Salzmann, Mathieu

2024

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In the realm of point cloud scene understanding, particularly in indoor scenes, objects are arranged following human habits, resulting in objects of certain semantics being closely positioned and displaying notable inter-object correlations. This can create a tendency for neural networks to exploit these strong dependencies, bypassing the individual object patterns. To address this challenge, we introduce a novel self-supervised learning (SSL) strategy. Our approach leverages both object patterns and contextual cues to produce robust features. It begins with the formulation of an object-exchanging strategy, where pairs of objects with comparable sizes are exchanged across different scenes, effectively disentangling the strong contextual dependencies. Subsequently, we introduce a context-aware feature learning strategy, which encodes object patterns without relying on their specific context by aggregating object features across various scenes. Our extensive experiments demonstrate the superiority of our method over existing SSL techniques, further showing its better robustness to environmental changes. Moreover, we showcase the applicability of our approach by transferring pre-trained models to diverse point cloud datasets.

Details

Title Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange

Author(s) Wu, Yanhao ; Zhang, Tong ; Ke, Wei ; Qiu, congpei ; Süsstrunk, Sabine ; Salzmann, Mathieu

Conference Computer Vision and Pattern Recognition (CVPR), Seattle, USA, June 17-21, 2024

Date 2024-04-11

Keywords

point clouds; self-supervised learning

Additional link https://cvpr.thecvf.com/

Laboratories IVRL
CVLAB

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > IVRL - Image and Visual Representation Laboratory
Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > CVLAB - Computer Vision Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Conference Papers
Work produced at EPFL

Record creation date 2024-04-11

Files

Abstract

Details

PDF