Hierarchical Shape Pruning for 3D Sparse Convolution Networks

3D sparse convolution has emerged as a pivotal technique for efficient voxel-based perception in autonomous systems, enabling selective feature extraction from non-empty voxels while suppressing computational waste. Despite its theoretical efficiency advantages, practical implementations face under-...

Full description

Saved in:

Bibliographic Details
Published in	Computers, materials & continua Vol. 84; no. 2; pp. 2975 - 2988
Main Authors	Long, Haiyan, Zhang, Chonghao, Qiu, Xudong, Chen, Hai, Chen, Gang
Format	Journal Article
Language	English
Published	Henderson Tech Science Press 2025
Subjects	Accuracy Benchmarks Convolution Object recognition Pruning Redundancy Space perception Sparsity Three dimensional models
Online Access	Get full text
ISSN	1546-2226 1546-2218 1546-2226
DOI	10.32604/cmc.2025.065047

Cover

More Information
Summary:	3D sparse convolution has emerged as a pivotal technique for efficient voxel-based perception in autonomous systems, enabling selective feature extraction from non-empty voxels while suppressing computational waste. Despite its theoretical efficiency advantages, practical implementations face under-explored limitations: the fixed geometric patterns of conventional sparse convolutional kernels inevitably process non-contributory positions during sliding-window operations, particularly in regions with uneven point cloud density. To address this, we propose Hierarchical Shape Pruning for 3D Sparse Convolution (HSP-S), which dynamically eliminates redundant kernel stripes through layer-adaptive thresholding. Unlike static soft pruning methods, HSP-S maintains trainable sparsity patterns by progressively adjusting pruning thresholds during optimization, enlarging original parameter search space while removing redundant operations. Extensive experiments validate effectiveness of HSP-S across major autonomous driving benchmarks. On KITTI’s 3D object detection task, our method reduces 93.47% redundant kernel computations while maintaining comparable accuracy (1.56% mAP drop). Remarkably, on the more complex NuScenes benchmark, HSP-S achieves simultaneous computation reduction (21.94% sparsity) and accuracy gains (1.02% mAP (mean Average Precision) and 0.47% NDS (nuScenes detection score) improvement), demonstrating its scalability to diverse perception scenarios. This work establishes the first learnable shape pruning framework that simultaneously enhances computational efficiency and preserves detection accuracy in 3D perception systems.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1546-2226 1546-2218 1546-2226
DOI:	10.32604/cmc.2025.065047