Marius Aasan
This repo contains code for Differentiable Hierarchical Visual Tokenization, accepted as a spotlight paper for NeurIPS 2025.
For an introduction to our work, visit the project webpage.
The repo can currently be installed as a package via:
# HTTPS
pip install git+https://github.com/dsb-ifi/dHT.git
# SSH
pip install git+ssh://git@github.com/dsb-ifi/dHT.gitYou can load the Superpixel Transformer models easily via torch.hub:
# Example with raster-to-vector model
model = torch.hub.load(
'dsb-ifi/dht',
'dht_ras2vec',
pretrained=True,
source='github',
)This will load the model and downloaded the pretrained weights, stored in your local torch.hub directory.
If you find our work useful, please consider citing our paper.
@inproceedings{aasan2025dht,
title={Differentiable Hierarchical Visual Tokenization},
author={Aasan, Marius and Hjelkrem-Tan, Martine and Catalano, Nico and Choi, Changkyu and Ram\'irez Rivera, Ad\'in},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://openreview.net/forum?id=y8VWYf5cVI}
}
🚧 NOTE: The hubconf.py is still under construction, and will be updated with classification models soon.

