Deformable Transformer-Based Object Detection for Robust Perception in Autonomous Driving

No Thumbnail Available

Date

2025

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE

Abstract

Autonomous driving demands robust and real-time object detection to safely navigate in complex environments. While Convolutional neural network (CNN)-based detectors have been widely adopted, they face challenges such as limited receptive fields and inefficiencies in handling small or occluded objects. This paper presents a deformable Transformer based object detection framework designed to address these limitations. By leveraging deformable attention mechanisms, the model dynamically focuses on relevant spatial regions, significantly enhancing detection accuracy. Evaluated on the benchmark KITTI dataset, our proposed approach achieves an interesting mAP@50 of 96.6%, surpassing many state-of-the-art methods, at the cost of slower inference speed (7.0 FPS). The experimental results also demonstrate the framework’s superior precision and adaptability in autonomous driving scenarios. This work underscores the potential of deformable transformers to advance perception systems, balancing high accuracy with the demands of real-world applications.

Description

Keywords

Object Detection, Convolutional Neural Network, Image edge detection

Citation

Endorsement

Review

Supplemented By

Referenced By