TransCalib: Automated Extrinsic Calibration of LiDAR–Camera Fusion using Convolutional Transformer for Targetless Self-Alignment | Zendy

Miftahul Khoir Shilahul Umam | Zendy; Jaejun Yoo | Zendy; Ida Bagus Krishna Yoga Utama | Zendy; Muhammad Rangga Aziz Nasution | Zendy; Muhammad Fairuz Mummtaz | Zendy; Muhammad Alfi Aldolio | Zendy; Su Mon Ko | Zendy; Yeong Min Jang | Zendy

Open Access

TransCalib: Automated Extrinsic Calibration of LiDAR–Camera Fusion using Convolutional Transformer for Targetless Self-Alignment

Author(s) -

Miftahul Khoir Shilahul Umam,

Jaejun Yoo,

Ida Bagus Krishna Yoga Utama,

Muhammad Rangga Aziz Nasution,

Muhammad Fairuz Mummtaz,

Muhammad Alfi Aldolio,

Su Mon Ko,

Yeong Min Jang

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3615993

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

In autonomous systems and robotic applications, accurate extrinsic calibration between light detection and ranging (LiDAR) sensors and cameras is crucial for reliable sensor fusion. Several techniques have been developed, including target-based and targetless calibration, but they are either impractical for real-world applications or limited in extracting complex and diverse features. This study presents TransCalib, an innovative deep-learning method for targetless and automatic extrinsic calibration. TransCalib predicts the misalignment between the camera and LiDAR by leveraging EfficientNetV2 to obtain features from the RGB camera image and LiDAR point cloud projection image (depth image), owing to its performance and parameter efficiency.We also developed an innovative feature-matching module that comprises a calibration convolutional feature aggregation block (Calib-CFAB) and a convolutional self-attention (CSA) transformer. Calib-CFAB enriches the combined feature map of the RGB and depth images, while the CSA transformer obtains the correlation in the feature maps. Trained and tested on the KITTI odometry dataset, TransCalib achieved a mean absolute rotation error of 0.14° and a mean translation error of 1.8 cm, outperforming existing methods. The proposed method allows for a robust fusion of LiDAR and camera data, improving the perception abilities of autonomous systems.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research