AD-L-JEPA-Release

Source code repo for AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data, the first joint-embedding predictive architecture (JEPA) based method for self-supervised representation learning of autonomous driving scenarios with LiDAR data.

We'll release the source code, pretrained models (both self-supervised and supervised, except for models with the Waymo dataset due to Waymo's licensing constraints), and training logs by the end of January 2025. Please stay tuned for more updates!

Timelines:

Initial commit
Source code release by the end of January 2025
Make code more organized and release pretrained models.

If this paper is helpful for you, you may consider cite it via:

@article{zhu2025ad,
  title={AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data},
  author={Zhu, Haoran and Dong, Zhenyuan and Topollai, Kristi and Choromanska, Anna},
  journal={arXiv preprint arXiv:2501.04969},
  year={2025}
}

Installation

This repo is developed by Python 3.8.

Installing pytorch:

pip install torch==1.10.1+cu111 torchvision==0.11.2+cu111 torchaudio==0.10.1 -f https://download.pytorch.org/whl/cu111/torch_stable.html

Installing other packages

pip install -r requirements.txt

For other installation requirements: Please refer to INSTALL.md for the installation of OpenPCDet(v0.5).

Setting up dataset

Please refer to GETTING_STARTED.md .

Set up KITTI dataset with different label efficiency, e.g, :

python -m pcdet.datasets.kitti.kitti_dataset_label_efficiency create_kitti_infos_label_efficiency tools/cfgs/dataset_configs/kitti_dataset_20_percent.yaml

Usage

Pre-training

Train with multiple GPUs:
bash ./scripts/dist_pretrain.sh ${NUM_GPUS}  --cfg_file ${CFG} --extra_tag ${EXP_TAG}

Train with a single GPU:
python3 ssl_pretrain.py  --cfg_file ${CFG} --extra_tag ${EXP_TAG}

e.g.:

bash ./scripts/dist_pretrain.sh 4 --cfg_file cfgs/kitti_models/ad_l_jepa_kitti.yaml --extra_tag ${EXP_TAG}

Then fine-tuning model

Same as OpenPCDet, e.g.:

bash ./scripts/dist_train.sh ${NUM_GPUS}  --cfg_file cfgs/kitti_models/second.yaml  --pretrained_model ../output/kitti/voxel_mae/ckpt/check_point_10.pth --extra_tag ${EXP_TAG}$

Acknowledgement

This repository is based on OpenPCDet, Occupancy-MAE, BEV-MAE, DINO

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
docker		docker
docs		docs
pcdet		pcdet
tools		tools
AD_L_JEPA_architecture.pdf		AD_L_JEPA_architecture.pdf
AD_L_JEPA_architecture.png		AD_L_JEPA_architecture.png
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AD-L-JEPA-Release

Installation

Setting up dataset

Usage

Pre-training

Then fine-tuning model

Acknowledgement

About

Releases

Packages

Languages

HaoranZhuExplorer/AD-L-JEPA-Release

Folders and files

Latest commit

History

Repository files navigation

AD-L-JEPA-Release

Installation

Setting up dataset

Usage

Pre-training

Then fine-tuning model

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages