Jan Eric Lenssen (Senior Researcher)

Dr. Jan Eric Lenssen

Address: Max-Planck-Institut für Informatik
Saarland Informatics Campus
Campus E1 4
66123 Saarbrücken
Location: E1 4 - 618
Phone: +49 681 9325 2141
Fax: +49 681 9325 2099
E-mail: Get email via email

Personal Information

Please visit my group website for further information!

Selected Publications

For a full publication list, visit my group website!

2025

Conference paper

K. Raj, C. Wewer, R. Yunus, E. Ilg, and J. E. Lenssen

“Spurfies: Sparse-view Surface Reconstruction using Local Geometry Priors,” in 3DV 2025, International Conference on 3D Vision, Singapore.

@inproceedings{Raj3DV25,
TITLE = {Spurfies: {S}parse-view Surface Reconstruction using Local Geometry Priors},
AUTHOR = {Raj, Kevin and Wewer, Christopher and Yunus, Raza and Ilg, Eddy and Lenssen, Jan Eric},
LANGUAGE = {eng},
PUBLISHER = {IEEE},
YEAR = {2025},
PUBLREMARK = {Accepted},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {3DV 2025, International Conference on 3D Vision},
ADDRESS = {Singapore},
}

Endnote

%0 Conference Proceedings
%A Raj, Kevin
%A Wewer, Christopher
%A Yunus, Raza
%A Ilg, Eddy
%A Lenssen, Jan Eric
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T Spurfies: Sparse-view Surface Reconstruction using Local Geometry Priors : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-8FAB-D
%D 2025
%B International Conference on 3D Vision
%Z date of event: 2025-03-25 - 2025-03-28
%C Singapore
%B 3DV 2025
%I IEEE

Conference paper

X. Xie, J. E. Lenssen, and G. Pons-Moll

“InterTrack: Tracking Human Object Interaction without Object Templates,” in 3DV 2025, International Conference on 3D Vision, Singapore.

@inproceedings{Xie3DV25,
TITLE = {{InterTrack}: {Tr}acking Human Object Interaction without Object Templates},
AUTHOR = {Xie, Xianghui and Lenssen, Jan Eric and Pons-Moll, Gerard},
LANGUAGE = {eng},
PUBLISHER = {IEEE},
YEAR = {2025},
PUBLREMARK = {Accepted},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {3DV 2025, International Conference on 3D Vision},
ADDRESS = {Singapore},
}

Endnote

%0 Conference Proceedings
%A Xie, Xianghui
%A Lenssen, Jan Eric
%A Pons-Moll, Gerard
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T InterTrack: Tracking Human Object Interaction without Object Templates : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-8C0C-4
%D 2025
%B International Conference on 3D Vision
%Z date of event: 2025-03-25 - 2025-03-28
%C Singapore
%B 3DV 2025
%I IEEE

Conference paper

M. Asim, C. Wewer, T. Wimmer, B. Schiele, and J. E. Lenssen

“MEt3R: Measuring Multi-View Consistency in Generated Images,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025), Nashville, TN, USA.

@inproceedings{Asim_CVPR25,
TITLE = {{MEt3R}: {M}easuring Multi-View Consistency in Generated Images},
AUTHOR = {Asim, Mohammad and Wewer, Christopher and Wimmer, Thomas and Schiele, Bernt and Lenssen, Jan Eric},
LANGUAGE = {eng},
PUBLISHER = {IEEE},
YEAR = {2025},
PUBLREMARK = {Accepted},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025)},
ADDRESS = {Nashville, TN, USA},
}

Endnote

%0 Conference Proceedings
%A Asim, Mohammad
%A Wewer, Christopher
%A Wimmer, Thomas
%A Schiele, Bernt
%A Lenssen, Jan Eric
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T MEt3R: Measuring Multi-View Consistency in Generated Images : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-7934-C
%D 2025
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%Z date of event: 2025-06-11 - 2025-06-15
%C Nashville, TN, USA
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%I IEEE

Conference paper

X. Hu, H. Wang, J. E. Lenssen, and B. Schiele

“PersonaHOI: Effortlessly Improving Personalized Face with Human-Object Interaction Generation,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025), Nashville, TN, USA.

Abstract

We introduce PersonaHOI, a training- and tuning-free framework that fuses a
general StableDiffusion model with a personalized face diffusion (PFD) model to
generate identity-consistent human-object interaction (HOI) images. While
existing PFD models have advanced significantly, they often overemphasize
facial features at the expense of full-body coherence, PersonaHOI introduces an
additional StableDiffusion (SD) branch guided by HOI-oriented text inputs. By
incorporating cross-attention constraints in the PFD branch and spatial merging
at both latent and residual levels, PersonaHOI preserves personalized facial
details while ensuring interactive non-facial regions. Experiments, validated
by a novel interaction alignment metric, demonstrate the superior realism and
scalability of PersonaHOI, establishing a new standard for practical
personalized face with HOI generation. Our code will be available at
github.com/JoyHuYY1412/PersonaHOI

BibTeX

@inproceedings{Hu_CVPR25,
TITLE = {{PersonaHOI}: {E}ffortlessly Improving Personalized Face with Human-Object Interaction Generation},
AUTHOR = {Hu, Xinting and Wang, Haoran and Lenssen, Jan Eric and Schiele, Bernt},
LANGUAGE = {eng},
PUBLISHER = {IEEE},
YEAR = {2025},
PUBLREMARK = {Accepted},
MARGINALMARK = {$\bullet$},
ABSTRACT = {We introduce PersonaHOI, a training- and tuning-free framework that fuses a<br>general StableDiffusion model with a personalized face diffusion (PFD) model to<br>generate identity-consistent human-object interaction (HOI) images. While<br>existing PFD models have advanced significantly, they often overemphasize<br>facial features at the expense of full-body coherence, PersonaHOI introduces an<br>additional StableDiffusion (SD) branch guided by HOI-oriented text inputs. By<br>incorporating cross-attention constraints in the PFD branch and spatial merging<br>at both latent and residual levels, PersonaHOI preserves personalized facial<br>details while ensuring interactive non-facial regions. Experiments, validated<br>by a novel interaction alignment metric, demonstrate the superior realism and<br>scalability of PersonaHOI, establishing a new standard for practical<br>personalized face with HOI generation. Our code will be available at<br>https://github.com/JoyHuYY1412/PersonaHOI<br>},
BOOKTITLE = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025)},
ADDRESS = {Nashville, TN, USA},
}

Endnote

%0 Conference Proceedings
%A Hu, Xinting
%A Wang, Haoran
%A Lenssen, Jan Eric
%A Schiele, Bernt
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T PersonaHOI: Effortlessly Improving Personalized Face with Human-Object
  Interaction Generation : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-793D-3
%D 2025
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%Z date of event: 2025-06-11 - 2025-06-15
%C Nashville, TN, USA
%X   We introduce PersonaHOI, a training- and tuning-free framework that fuses a<br>general StableDiffusion model with a personalized face diffusion (PFD) model to<br>generate identity-consistent human-object interaction (HOI) images. While<br>existing PFD models have advanced significantly, they often overemphasize<br>facial features at the expense of full-body coherence, PersonaHOI introduces an<br>additional StableDiffusion (SD) branch guided by HOI-oriented text inputs. By<br>incorporating cross-attention constraints in the PFD branch and spatial merging<br>at both latent and residual levels, PersonaHOI preserves personalized facial<br>details while ensuring interactive non-facial regions. Experiments, validated<br>by a novel interaction alignment metric, demonstrate the superior realism and<br>scalability of PersonaHOI, establishing a new standard for practical<br>personalized face with HOI generation. Our code will be available at<br>https://github.com/JoyHuYY1412/PersonaHOI<br>
%K Computer Science, Computer Vision and Pattern Recognition, cs.CV
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%I IEEE

Conference paper

H. Wang, Y. Fan, M. F. Naeem, Y. Xian, J. E. Lenssen, L. Wang, F. Tombari, and B. Schiele

“TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters,” in Thirteenth International Conference on Learning Representations (ICLR 2025), Singapore.

@inproceedings{Wang_ICLR25,
TITLE = {{TokenFormer}: Rethinking Transformer Scaling with Tokenized Model Parameters},
AUTHOR = {Wang, Haiyang and Fan, Yue and Naeem, Muhammad Ferjad and Xian, Yongqin and Lenssen, Jan Eric and Wang, Liwei and Tombari, Federico and Schiele, Bernt},
LANGUAGE = {eng},
YEAR = {2025},
PUBLREMARK = {Accepted},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {Thirteenth International Conference on Learning Representations (ICLR 2025)},
ADDRESS = {Singapore},
}

Endnote

%0 Conference Proceedings
%A Wang, Haiyang
%A Fan, Yue
%A Naeem, Muhammad Ferjad
%A Xian, Yongqin
%A Lenssen, Jan Eric
%A Wang, Liwei
%A Tombari, Federico
%A Schiele, Bernt
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T TokenFormer: Rethinking Transformer Scaling with Tokenized Model
  Parameters : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-0FC1-4
%D 2025
%B Thirteenth International Conference on Learning Representations
%Z date of event: 2025-04-24 - 2025-04-28
%C Singapore
%B Thirteenth International Conference on Learning Representations
%U https://iclr.cc/Conferences/2025

Conference paper

Y. Yuan, Z. Zhang, X. He, A. Nitta, W. Hu, D. Wang, M. Shah, S. Huang, B. Stojanovič, A. Krumholz, J. E. Lenssen, J. Leskovec, and M. Fey

“ContextGNN: Beyond Two-Tower Recommendation Systems,” in Thirteenth International Conference on Learning Representations (ICLR 2025), Singapore.

@inproceedings{Yuan_ICLR25,
TITLE = {{ContextGNN}: {B}eyond Two-Tower Recommendation Systems},
AUTHOR = {Yuan, Yiwen and Zhang, Zecheng and He, Xinwei and Nitta, Akihiro and Hu, Weihua and Wang, Dong and Shah, Manan and Huang, Shenyang and Stojanovi{\v c}, Bla{\v z} and Krumholz, Alan and Lenssen, Jan Eric and Leskovec, Jure and Fey, Matthias},
LANGUAGE = {eng},
YEAR = {2025},
PUBLREMARK = {Accepted},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {Thirteenth International Conference on Learning Representations (ICLR 2025)},
ADDRESS = {Singapore},
}

Endnote

%0 Conference Proceedings
%A Yuan, Yiwen
%A Zhang, Zecheng
%A He, Xinwei
%A Nitta, Akihiro
%A Hu, Weihua
%A Wang, Dong
%A Shah, Manan
%A Huang, Shenyang
%A Stojanovi&#269;, Bla&#382;
%A Krumholz, Alan
%A Lenssen, Jan Eric
%A Leskovec, Jure
%A Fey, Matthias
%+ External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
%T ContextGNN: Beyond Two-Tower Recommendation Systems : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-905A-6
%D 2025
%B Thirteenth International Conference on Learning Representations
%Z date of event: 2025-04-24 - 2025-04-28
%C Singapore
%B Thirteenth International Conference on Learning Representations
%U https://iclr.cc/Conferences/2025

Paper

J. Erbach, D. Narnhofer, A. Dombos, B. Schiele, J. E. Lenssen, and K. Schindler

“Solving Inverse Problems with FLAIR,” 2025. [Online]. Available: https://arxiv.org/abs/2506.02680.

Abstract

Flow-based latent generative models such as Stable Diffusion 3 are able to
generate images with remarkable quality, even enabling photorealistic
text-to-image generation. Their impressive performance suggests that these
models should also constitute powerful priors for inverse imaging problems, but
that approach has not yet led to comparable fidelity. There are several key
obstacles: (i) the encoding into a lower-dimensional latent space makes the
underlying (forward) mapping non-linear; (ii) the data likelihood term is
usually intractable; and (iii) learned generative models struggle to recover
rare, atypical data modes during inference. We present FLAIR, a novel training
free variational framework that leverages flow-based generative models as a
prior for inverse problems. To that end, we introduce a variational objective
for flow matching that is agnostic to the type of degradation, and combine it
with deterministic trajectory adjustments to recover atypical modes. To enforce
exact consistency with the observed data, we decouple the optimization of the
data fidelity and regularization terms. Moreover, we introduce a time-dependent
calibration scheme in which the strength of the regularization is modulated
according to off-line accuracy estimates. Results on standard imaging
benchmarks demonstrate that FLAIR consistently outperforms existing diffusion-
and flow-based methods in terms of reconstruction quality and sample diversity.

BibTeX

@online{Erbach_2506.02680,
TITLE = {Solving Inverse Problems with {FLAIR}},
AUTHOR = {Erbach, Julius and Narnhofer, Dominik and Dombos, Andreas and Schiele, Bernt and Lenssen, Jan Eric and Schindler, Konrad},
LANGUAGE = {eng},
URL = {https://arxiv.org/abs/2506.02680},
EPRINT = {2506.02680},
EPRINTTYPE = {arXiv},
YEAR = {2025},
MARGINALMARK = {$\bullet$},
ABSTRACT = {Flow-based latent generative models such as Stable Diffusion 3 are able to<br>generate images with remarkable quality, even enabling photorealistic<br>text-to-image generation. Their impressive performance suggests that these<br>models should also constitute powerful priors for inverse imaging problems, but<br>that approach has not yet led to comparable fidelity. There are several key<br>obstacles: (i) the encoding into a lower-dimensional latent space makes the<br>underlying (forward) mapping non-linear; (ii) the data likelihood term is<br>usually intractable; and (iii) learned generative models struggle to recover<br>rare, atypical data modes during inference. We present FLAIR, a novel training<br>free variational framework that leverages flow-based generative models as a<br>prior for inverse problems. To that end, we introduce a variational objective<br>for flow matching that is agnostic to the type of degradation, and combine it<br>with deterministic trajectory adjustments to recover atypical modes. To enforce<br>exact consistency with the observed data, we decouple the optimization of the<br>data fidelity and regularization terms. Moreover, we introduce a time-dependent<br>calibration scheme in which the strength of the regularization is modulated<br>according to off-line accuracy estimates. Results on standard imaging<br>benchmarks demonstrate that FLAIR consistently outperforms existing diffusion-<br>and flow-based methods in terms of reconstruction quality and sample diversity.<br>},
}

Endnote

%0 Report
%A Erbach, Julius
%A Narnhofer, Dominik
%A Dombos, Andreas
%A Schiele, Bernt
%A Lenssen, Jan Eric
%A Schindler, Konrad
%+ External Organizations
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
%T Solving Inverse Problems with FLAIR : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0011-5A9B-A
%U https://arxiv.org/abs/2506.02680
%D 2025
%X   Flow-based latent generative models such as Stable Diffusion 3 are able to<br>generate images with remarkable quality, even enabling photorealistic<br>text-to-image generation. Their impressive performance suggests that these<br>models should also constitute powerful priors for inverse imaging problems, but<br>that approach has not yet led to comparable fidelity. There are several key<br>obstacles: (i) the encoding into a lower-dimensional latent space makes the<br>underlying (forward) mapping non-linear; (ii) the data likelihood term is<br>usually intractable; and (iii) learned generative models struggle to recover<br>rare, atypical data modes during inference. We present FLAIR, a novel training<br>free variational framework that leverages flow-based generative models as a<br>prior for inverse problems. To that end, we introduce a variational objective<br>for flow matching that is agnostic to the type of degradation, and combine it<br>with deterministic trajectory adjustments to recover atypical modes. To enforce<br>exact consistency with the observed data, we decouple the optimization of the<br>data fidelity and regularization terms. Moreover, we introduce a time-dependent<br>calibration scheme in which the strength of the regularization is modulated<br>according to off-line accuracy estimates. Results on standard imaging<br>benchmarks demonstrate that FLAIR consistently outperforms existing diffusion-<br>and flow-based methods in terms of reconstruction quality and sample diversity.<br>
%K Computer Science, Computer Vision and Pattern Recognition, cs.CV,eess.IV

Paper

C. Wewer, B. Pogodzinski, B. Schiele, and J. E. Lenssen

“Spatial Reasoning with Denoising Models,” 2025. [Online]. Available: https://www.arxiv.org/abs/2502.21075.

Abstract

We introduce Spatial Reasoning Models (SRMs), a framework to perform
reasoning over sets of continuous variables via denoising generative models.
SRMs infer continuous representations on a set of unobserved variables, given
observations on observed variables. Current generative models on spatial
domains, such as diffusion and flow matching models, often collapse to
hallucination in case of complex distributions. To measure this, we introduce a
set of benchmark tasks that test the quality of complex reasoning in generative
models and can quantify hallucination. The SRM framework allows to report key
findings about importance of sequentialization in generation, the associated
order, as well as the sampling strategies during training. It demonstrates, for
the first time, that order of generation can successfully be predicted by the
denoising network itself. Using these findings, we can increase the accuracy of
specific reasoning tasks from 1% to >50%.

BibTeX

@online{Wewer2502.21075,
TITLE = {Spatial Reasoning with Denoising Models},
AUTHOR = {Wewer, Christopher and Pogodzinski, Bart and Schiele, Bernt and Lenssen, Jan Eric},
LANGUAGE = {eng},
URL = {https://www.arxiv.org/abs/2502.21075},
EPRINT = {2502.21075},
EPRINTTYPE = {arXiv},
YEAR = {2025},
MARGINALMARK = {$\bullet$},
ABSTRACT = {We introduce Spatial Reasoning Models (SRMs), a framework to perform<br>reasoning over sets of continuous variables via denoising generative models.<br>SRMs infer continuous representations on a set of unobserved variables, given<br>observations on observed variables. Current generative models on spatial<br>domains, such as diffusion and flow matching models, often collapse to<br>hallucination in case of complex distributions. To measure this, we introduce a<br>set of benchmark tasks that test the quality of complex reasoning in generative<br>models and can quantify hallucination. The SRM framework allows to report key<br>findings about importance of sequentialization in generation, the associated<br>order, as well as the sampling strategies during training. It demonstrates, for<br>the first time, that order of generation can successfully be predicted by the<br>denoising network itself. Using these findings, we can increase the accuracy of<br>specific reasoning tasks from <1% to >50%.<br>},
}

Endnote

%0 Report
%A Wewer, Christopher
%A Pogodzinski, Bart
%A Schiele, Bernt
%A Lenssen, Jan Eric
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T Spatial Reasoning with Denoising Models : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-DADB-2
%U https://www.arxiv.org/abs/2502.21075
%D 2025
%X   We introduce Spatial Reasoning Models (SRMs), a framework to perform<br>reasoning over sets of continuous variables via denoising generative models.<br>SRMs infer continuous representations on a set of unobserved variables, given<br>observations on observed variables. Current generative models on spatial<br>domains, such as diffusion and flow matching models, often collapse to<br>hallucination in case of complex distributions. To measure this, we introduce a<br>set of benchmark tasks that test the quality of complex reasoning in generative<br>models and can quantify hallucination. The SRM framework allows to report key<br>findings about importance of sequentialization in generation, the associated<br>order, as well as the sampling strategies during training. It demonstrates, for<br>the first time, that order of generation can successfully be predicted by the<br>denoising network itself. Using these findings, we can increase the accuracy of<br>specific reasoning tasks from <1% to >50%.<br>
%K Computer Science, Computer Vision and Pattern Recognition, cs.CV,Computer Science, Learning, cs.LG

2024

Conference paper

W. Böttcher, L. Hoyer, O. Unal, J. E. Lenssen, and B. Schiele

“Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets,” in Advances in Neural Information Processing Systems 37 (NeurIPS 2024), Vancouver, Canada, 2024.

@inproceedings{Boettcher_Neurips24,
TITLE = {{S}cribbles for All: {B}enchmarking Scribble Supervised Segmentation Across Datasets},
AUTHOR = {B{\"o}ttcher, Wolfgang and Hoyer, Lukas and Unal, Ozan and Lenssen, Jan Eric and Schiele, Bernt},
LANGUAGE = {eng},
PUBLISHER = {Curran Associates, Inc.},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {Advances in Neural Information Processing Systems 37 (NeurIPS 2024)},
EDITOR = {Globerson, A. and Mackey, L. and Belgrave, D. and Fan, A. and Paquet, U. and Tomczak, J. and Zhang, c.},
PAGES = {46002--46024},
ADDRESS = {Vancouver, Canada},
}

Endnote

%0 Conference Proceedings
%A B&#246;ttcher, Wolfgang
%A Hoyer, Lukas
%A Unal, Ozan
%A Lenssen, Jan Eric
%A Schiele, Bernt
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T Scribbles for All: Benchmarking Scribble Supervised Segmentation Across
  Datasets : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-4548-0
%D 2024
%B 38th Conference on Neural Information Processing Systems
%Z date of event: 2024-12-10 - 2024-12-15
%C Vancouver, Canada
%B Advances in Neural Information Processing Systems 37
%E Globerson, A.; Mackey, L.; Belgrave, D.; Fan, A.; Paquet, U.; Tomczak, J.; Zhang, c.
%P 46002 - 46024
%I Curran Associates, Inc.
%U https://papers.nips.cc/paper_files/paper/2024/hash/51cd2b3608d7ae17a9fadcc6e1f68629-Abstract-Datasets_and_Benchmarks_Track.html

Conference paper

J. Chen, J. E. Lenssen, A. Feng, W. Hu, M. Fey, L. Tassiulas, J. Leskovec, and R. Ying

“From Similarity to Superiority: Channel Clustering for Time Series Forecasting,” in Advances in Neural Information Processing Systems 37 (NeurIPS 2024), Vancouver, Canada, 2024.

@inproceedings{Chen_Neurips24,
TITLE = {From Similarity to Superiority: Channel Clustering for Time Series Forecasting},
AUTHOR = {Chen, Jialin and Lenssen, Jan Eric and Feng, Aosong and Hu, Weihua and Fey, Matthias and Tassiulas, Leandros and Leskovec, Jure and Ying, Rex},
LANGUAGE = {eng},
PUBLISHER = {Curran Associates, Inc},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {Advances in Neural Information Processing Systems 37 (NeurIPS 2024)},
EDITOR = {Globerson, A. and Mackey, L. and Belgrave, D. and Fan, A. and Paquet, U. and Tomczak, J. and Zhang, C.},
PAGES = {130635--130663},
ADDRESS = {Vancouver, Canada},
}

Endnote

%0 Conference Proceedings
%A Chen, Jialin
%A Lenssen, Jan Eric
%A Feng, Aosong
%A Hu, Weihua
%A Fey, Matthias
%A Tassiulas, Leandros
%A Leskovec, Jure
%A Ying, Rex
%+ External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
%T From Similarity to Superiority: Channel Clustering for Time Series Forecasting : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-9121-4
%D 2024
%B 38th Conference on Neural Information Processing Systems
%Z date of event: 2024-12-10 - 2024-12-15
%C Vancouver, Canada
%B Advances in Neural Information Processing Systems 37
%E Globerson, A.; Mackey, L.; Belgrave, D.; Fan, A.; Paquet, U.; Tomczak, J.; Zhang, C.
%P 130635 - 130663
%I Curran Associates, Inc
%U https://proceedings.neurips.cc/paper_files/paper/2024/file/eb9b18ccb76a1156af5779ffdca1d91f-Paper-Conference.pdf

Conference paper

J. Robinson, R. Ranjan, W. Hu, K. Huang, J. Han, A. Dobles, M. Fey, J. E. Lenssen, Y. Yuan, Z. Zhang, X. He, and J. Leskovec

“RelBench: A Benchmark for Deep Learning on Relational Databases,” in Advances in Neural Information Processing Systems 37 (NeurIPS 2024), Vancouver, Canada, 2024.

@inproceedings{Robinson_Neurips24,
TITLE = {{RelBench}: {A} Benchmark for Deep Learning on Relational Databases},
AUTHOR = {Robinson, Joshua and Ranjan, Rishabh and Hu, Weihua and Huang, Kexin and Han, Jiaqi and Dobles, Alejandro and Fey, Matthias and Lenssen, Jan Eric and Yuan, Yiwen and Zhang, Zecheng and He, Xinwei and Leskovec, Jure},
LANGUAGE = {eng},
PUBLISHER = {Curran Associates, Inc.},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {Advances in Neural Information Processing Systems 37 (NeurIPS 2024)},
EDITOR = {Globerson, A. and Mackey, L. and Belgrave, D. and Fan, A. and Paquet, U. and Tomczak, J. and Zhang, C.},
PAGES = {21330--21341},
ADDRESS = {Vancouver, Canada},
}

Endnote

%0 Conference Proceedings
%A Robinson, Joshua
%A Ranjan, Rishabh
%A Hu, Weihua
%A Huang, Kexin
%A Han, Jiaqi
%A Dobles, Alejandro
%A Fey, Matthias
%A Lenssen, Jan Eric
%A Yuan, Yiwen
%A Zhang, Zecheng
%A He, Xinwei
%A Leskovec, Jure
%+ External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
External Organizations
External Organizations
%T RelBench: A Benchmark for Deep Learning on Relational Databases : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-9060-E
%D 2024
%B 38th Conference on Neural Information Processing Systems
%Z date of event: 2024-12-10 - 2024-12-15
%C Vancouver, Canada
%B Advances in Neural Information Processing Systems 37
%E Globerson, A.; Mackey, L.; Belgrave, D.; Fan, A.; Paquet, U.; Tomczak, J.; Zhang, C.
%P 21330 - 21341
%I Curran Associates, Inc.
%U https://proceedings.neurips.cc/paper_files/paper/2024/file/25cd345233c65fac1fec0ce61d0f7836-Paper-Datasets_and_Benchmarks_Track.pdf

Article

D2D6

R. Yunus, J. E. Lenssen, M. Niemeyer, Y. Liao, C. Rupprecht, C. Theobalt, G. Pons-Moll, J.-B. Huang, V. Golyanik, and E. Ilg,

“Recent Trends in 3D Reconstruction of General Non-Rigid Scenes,” Computer Graphics Forum (Proc. EUROGRAPHICS 2024), vol. 43, no. 2, 2024.

@article{Yunus_Eurographics24,
TITLE = {Recent Trends in {3D} Reconstruction of General Non-Rigid Scenes},
AUTHOR = {Yunus, Raza and Lenssen, Jan Eric and Niemeyer, Michael and Liao, Yiyi and Rupprecht, Christian and Theobalt, Christian and Pons-Moll, Gerard and Huang, Jia-Bin and Golyanik, Vladislav and Ilg,, Eddy},
LANGUAGE = {eng},
ISSN = {0167-7055},
DOI = {10.1111/cgf.15062},
PUBLISHER = {Blackwell-Wiley},
ADDRESS = {Oxford},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
JOURNAL = {Computer Graphics Forum (Proc. EUROGRAPHICS)},
VOLUME = {43},
NUMBER = {2},
PAGES = {1--42},
EID = {e15062},
BOOKTITLE = {EUROGRAPHICS 2024},
EDITOR = {Bermano, A. and Kaolgerakis, E.},
}

Endnote

%0 Journal Article
%A Yunus, Raza
%A Lenssen, Jan Eric
%A Niemeyer, Michael
%A Liao, Yiyi
%A Rupprecht, Christian
%A Theobalt, Christian
%A Pons-Moll, Gerard
%A Huang, Jia-Bin
%A Golyanik, Vladislav
%A Ilg,, Eddy
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
External Organizations
Visual Computing and Artificial Intelligence, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
Visual Computing and Artificial Intelligence, MPI for Informatics, Max Planck Society
External Organizations
%T Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
 : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000F-4DA5-1
%R 10.1111/cgf.15062
%7 2024-04-30
%D 2024
%J Computer Graphics Forum
%O Computer Graphics Forum : journal of the European Association for Computer Graphics Comput. Graph. Forum
%V 43
%N 2
%& 1
%P 1 - 42
%Z sequence number: e15062
%I Blackwell-Wiley
%C Oxford
%@ false
%B EUROGRAPHICS 2024
%O EUROGRAPHICS 2024 EG 2024 The European Association for Computer Graphics
45th Annual Conference ; Limassol, Cyprus, April 22-26

Conference paper

C. Wewer, K. Raj, E. Ilg, B. Schiele, and J. E. Lenssen

“latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction,” in Computer Vision -- ECCV 2024, Milano, Italy, 2024.

@inproceedings{WewerECCV24,
TITLE = {{latentSplat}: {A}utoencoding Variational {G}aussians for Fast Generalizable {3D} Reconstruction},
AUTHOR = {Wewer, Christopher and Raj, Kevin and Ilg, Eddy and Schiele, Bernt and Lenssen, Jan Eric},
LANGUAGE = {eng},
ISBN = {978-3-031-73020-7},
DOI = {10.1007/978-3-031-73021-4_27},
PUBLISHER = {Springer},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
BOOKTITLE = {Computer Vision -- ECCV 2024},
EDITOR = {Leonardis, Ale{\v s} and Ricci, Elisa and Roth, Stefan and Russakovsky, Olga and Sattler, Torsten and Varol, G{\"u}l},
PAGES = {456--473},
SERIES = {Lecture Notes in Computer Science},
VOLUME = {15145},
ADDRESS = {Milano, Italy},
}

Endnote

%0 Conference Proceedings
%A Wewer, Christopher
%A Raj, Kevin
%A Ilg, Eddy
%A Schiele, Bernt
%A Lenssen, Jan Eric
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000F-D86F-2
%R 10.1007/978-3-031-73021-4_27
%D 2024
%B 18th European Conference on Computer Vision 
%Z date of event: 2024-09-29 - 2024-10-04
%C Milano, Italy
%B Computer Vision -- ECCV 2024
%E Leonardis, Ale&#353;; Ricci, Elisa; Roth, Stefan; Russakovsky, Olga; Sattler, Torsten; Varol, G&#252;l
%P 456 - 473
%I Springer
%@ 978-3-031-73020-7
%B Lecture Notes in Computer Science
%N 15145

Conference paper

Y. Yue, A. Das, F. Engelmann, S. Tang, and J. E. Lenssen

“Improving 2D Feature Representations by 3D-Aware Fine-Tuning,” in Computer Vision -- ECCV 2024, Milano, Italy, 2024.

@inproceedings{YueECCV24,
TITLE = {Improving {2D} Feature Representations by {3D}-Aware Fine-Tuning},
AUTHOR = {Yue, Yuanwen and Das, Anurag and Engelmann, Francis and Tang, Siyu and Lenssen, Jan Eric},
LANGUAGE = {eng},
ISBN = {978-3-031-72627-9},
DOI = {10.1007/978-3-031-72627-9_4},
PUBLISHER = {Springer},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
BOOKTITLE = {Computer Vision -- ECCV 2024},
EDITOR = {Leonardis, Ale{\v s} and Ricci, Elisa and Roth, Stefan and Russakovsky, Olga and Sattler, Torsten and Varol, G{\"u}l},
PAGES = {57--74},
SERIES = {Lecture Notes in Computer Science},
VOLUME = {15060},
ADDRESS = {Milano, Italy},
}

Endnote

%0 Conference Proceedings
%A Yue, Yuanwen
%A Das, Anurag
%A Engelmann, Francis
%A Tang, Siyu
%A Lenssen, Jan Eric
%+ External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T Improving 2D Feature Representations by 3D-Aware Fine-Tuning : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000F-D86C-5
%R 10.1007/978-3-031-72627-9_4
%D 2024
%B 18th European Conference on Computer Vision 
%Z date of event: 2024-09-29 - 2024-10-04
%C Milano, Italy
%B Computer Vision -- ECCV 2024
%E Leonardis, Ale&#353;; Ricci, Elisa; Roth, Stefan; Russakovsky, Olga; Sattler, Torsten; Varol, G&#252;l
%P 57 - 74
%I Springer
%@ 978-3-031-72627-9
%B Lecture Notes in Computer Science
%N 15060

Conference paper

S. Paul, C. Wewer, B. Schiele, and J. E. Lenssen

“Sp2360: Sparse-view 360° Scene Reconstruction using Cascaded 2D Diffusion Priors,” in ECCV 2024 Workshop on Wild 3D (ECCV 2024 Wild3D), Milan, Italy, 2024.

@inproceedings{Paul_Wild3D,
TITLE = {Sp2360: {S}parse-view 360$^{\circ}$ Scene Reconstruction using Cascaded {2D} Diffusion Priors},
AUTHOR = {Paul, Soumava and Wewer, Christopher and Schiele, Bernt and Lenssen, Jan Eric},
LANGUAGE = {eng},
URL = {https://openreview.net/forum?id=XuNhNyHHwK},
PUBLISHER = {OpenReview.net},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
BOOKTITLE = {ECCV 2024 Workshop on Wild 3D (ECCV 2024 Wild3D)},
ADDRESS = {Milan, Italy},
}

Endnote

%0 Conference Proceedings
%A Paul, Soumava
%A Wewer, Christopher
%A Schiele, Bernt
%A Lenssen, Jan Eric
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T Sp2360: Sparse-view 360&#176; Scene Reconstruction using Cascaded 2D Diffusion Priors : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-1041-2
%U https://openreview.net/forum?id=XuNhNyHHwK
%D 2024
%8 09.09.2024
%B ECCV 2024 Workshop on Wild 3D
%Z date of event: 2024-09-30 - 2024-09-30
%C Milan, Italy
%B ECCV 2024 Workshop on Wild 3D

%I OpenReview.net

Conference paper

D. Das, C. Wewer, R. Yunus, E. Ilg, and J. E. Lenssen

“Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Seattle, WA, USA, 2024.

@inproceedings{2312.01196,
TITLE = {Neural Parametric {G}aussians for Monocular Non-Rigid Object Reconstruction},
AUTHOR = {Das, Devikalyan and Wewer, Christopher and Yunus, Raza and Ilg, Eddy and Lenssen, Jan Eric},
LANGUAGE = {eng},
ISBN = {979-8-3503-5301-3},
DOI = {10.1109/CVPR52733.2024.01019},
PUBLISHER = {IEEE},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
BOOKTITLE = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)},
PAGES = {10715--10725},
ADDRESS = {Seattle, WA, USA},
}

Endnote

%0 Conference Proceedings
%A Das, Devikalyan
%A Wewer, Christopher
%A Yunus, Raza
%A Ilg, Eddy
%A Lenssen, Jan Eric
%+ External Organizations
External Organizations
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T Neural Parametric Gaussians for Monocular Non-Rigid Object
  Reconstruction : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000E-B10E-B
%R 10.1109/CVPR52733.2024.01019
%D 2024
%B 37th IEEE/CVF Conference on Computer Vision and Pattern Recognition
%Z date of event: 2024-06-17 - 2024-06-21
%C Seattle, WA, USA
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%P 10715 - 10725
%I IEEE
%@ 979-8-3503-5301-3

Conference paper

Y. He, G. Tiwari, T. Birdal, J. E. Lenssen, and G. Pons-Moll

“NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Seattle, WA, USA, 2024.

@inproceedings{he24nrdf,
TITLE = {{NRDF}: {N}eural {R}iemannian Distance Fields for Learning Articulated Pose Priors},
AUTHOR = {He, Yannan and Tiwari, Garvita and Birdal, Tolga and Lenssen, Jan Eric and Pons-Moll, Gerard},
LANGUAGE = {eng},
DOI = {10.1109/CVPR52733.2024.00164},
PUBLISHER = {IEEE},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
BOOKTITLE = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)},
PAGES = {1661--1671},
ADDRESS = {Seattle, WA, USA},
}

Endnote

%0 Conference Proceedings
%A He, Yannan
%A Tiwari, Garvita
%A Birdal, Tolga
%A Lenssen, Jan Eric
%A Pons-Moll, Gerard
%+ External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000E-B120-5
%R 10.1109/CVPR52733.2024.00164
%D 2024
%B 37th IEEE/CVF Conference on Computer Vision and Pattern Recognition
%Z date of event: 2024-06-17 - 2024-06-21
%C Seattle, WA, USA
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%P 1661 - 1671
%I IEEE

Conference paper

P. Schröppel, C. Wewer, J. E. Lenssen, E. Ilg, and T. Brox

“Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Seattle, WA, USA, 2024.

@inproceedings{schroppel24npcd,
TITLE = {Neural Point Cloud Diffusion for Disentangled {3D} Shape and Appearance Generation},
AUTHOR = {Schr{\"o}ppel, Philipp and Wewer, Christopher and Lenssen, Jan Eric and Ilg, Eddy and Brox, Thomas},
LANGUAGE = {eng},
ISBN = {979-8-3503-5300-6},
DOI = {10.1109/CVPR52733.2024.00839},
PUBLISHER = {IEEE},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
BOOKTITLE = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)},
PAGES = {8785--8794},
ADDRESS = {Seattle, WA, USA},
}

Endnote

%0 Conference Proceedings
%A Schr&#246;ppel, Philipp
%A Wewer, Christopher
%A Lenssen, Jan Eric
%A Ilg, Eddy
%A Brox, Thomas
%+ External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
%T Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000E-B0D8-7
%R 10.1109/CVPR52733.2024.00839
%D 2024
%B 37th IEEE/CVF Conference on Computer Vision and Pattern Recognition
%Z date of event: 2024-06-17 - 2024-06-21
%C Seattle, WA, USA
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%P 8785 - 8794
%I IEEE
%@ 979-8-3503-5300-6

Conference paper

X. Xie, B. L. Bhatnagar, J. E. Lenssen, and G. Pons-Moll

“Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Seattle, WA, USA, 2024.

@inproceedings{xie24hdm,
TITLE = {Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation},
AUTHOR = {Xie, Xianghui and Bhatnagar, Bharat Lal and Lenssen, Jan Eric and Pons-Moll, Gerard},
LANGUAGE = {eng},
ISBN = {979-8-3503-5301-3},
DOI = {10.1109/CVPR52733.2024.00954},
PUBLISHER = {IEEE},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
BOOKTITLE = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)},
PAGES = {10003--10015},
ADDRESS = {Seattle, WA, USA},
}

Endnote

%0 Conference Proceedings
%A Xie, Xianghui
%A Bhatnagar, Bharat Lal
%A Lenssen, Jan Eric
%A Pons-Moll, Gerard
%+ External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
%T Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000E-B0C2-F
%R 10.1109/CVPR52733.2024.00954
%D 2024
%B 37th IEEE/CVF Conference on Computer Vision and Pattern Recognition
%Z date of event: 2024-06-17 - 2024-06-21
%C Seattle, WA, USA
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%P 10003 - 10015
%I IEEE
%@ 979-8-3503-5301-3
%U https://virtualhumans.mpi-inf.mpg.de/procigen-hdm

Conference paper

K. Zhou, B. L. Bhatnagar, J. E. Lenssen, and G. Pons-Moll

“GEARS: Local Geometry-aware Hand-object Interaction Synthesis,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Seattle, WA, USA, 2024.

@inproceedings{zhou24gears,
TITLE = {{GEARS}: {L}ocal Geometry-aware Hand-object Interaction Synthesis},
AUTHOR = {Zhou, Keyang and Bhatnagar, Bharat Lal and Lenssen, Jan Eric and Pons-Moll, Gerard},
LANGUAGE = {eng},
ISBN = {979-8-3503-5301-3},
DOI = {10.1109/CVPR52733.2024.01950},
PUBLISHER = {IEEE},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
BOOKTITLE = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)},
PAGES = {20634--20643},
ADDRESS = {Seattle, WA, USA},
}

Endnote

%0 Conference Proceedings
%A Zhou, Keyang
%A Bhatnagar, Bharat Lal
%A Lenssen, Jan Eric
%A Pons-Moll, Gerard
%+ External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
%T GEARS: Local Geometry-aware Hand-object Interaction Synthesis : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000E-B1C4-C
%R 10.1109/CVPR52733.2024.01950
%D 2024
%B 37th IEEE/CVF Conference on Computer Vision and Pattern Recognition
%Z date of event: 2024-06-17 - 2024-06-21
%C Seattle, WA, USA
%B IEEE/CVF Conference on Computer Vision and Pattern Recognition
%P 20634 - 20643
%I IEEE
%@ 979-8-3503-5301-3

Conference paper

M. Fey, W. Hu, K. Huang, J. E. Lenssen, R. Ranjan, J. Robinson, R. Ying, J. You, and J. Leskovec

“Position: Relational Deep Learning - Graph Representation Learning on Relational Databases,” in Proceedings of the 41st International Conference on Machine Learning (ICML 2024), Vienna, Austria, 2024.

@inproceedings{Fey_ICML24,
TITLE = {Position: {R}elational Deep Learning -- Graph Representation Learning on Relational Databases},
AUTHOR = {Fey, Matthias and Hu, Weihua and Huang, Kexin and Lenssen, Jan Eric and Ranjan, Rishabh and Robinson, Joshua and Ying, Rex and You, Jiaxuan and Leskovec, Jure},
LANGUAGE = {eng},
ISSN = {1938-7228},
URL = {https://proceedings.mlr.press/v235/},
PUBLISHER = {MLR Press},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
DATE = {2024},
BOOKTITLE = {Proceedings of the 41st International Conference on Machine Learning (ICML 2024)},
EDITOR = {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
PAGES = {13592--13607},
SERIES = {Proceedings of the Machine Learning Research},
VOLUME = {235},
ADDRESS = {Vienna, Austria},
}

Endnote

%0 Conference Proceedings
%A Fey, Matthias
%A Hu, Weihua
%A Huang, Kexin
%A Lenssen, Jan Eric
%A Ranjan, Rishabh
%A Robinson, Joshua
%A Ying, Rex
%A You, Jiaxuan
%A Leskovec, Jure
%+ External Organizations
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
%T Position: Relational Deep Learning - Graph Representation Learning on Relational Databases : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-B3BC-0
%D 2024
%B 41st International Conference on Machine Learning
%Z date of event: 2024-07-21 - 2024-07-27
%C Vienna, Austria
%B Proceedings of the 41st International Conference on Machine Learning
%E Salakhutdinov, Ruslan; Kolter, Zico; Heller, Katherine; Weller, Adrian; Oliver, Nuria; Scarlett, Jonathan; Berkenkamp, Felix
%P 13592 - 13607
%I MLR Press
%U https://proceedings.mlr.press/v235/
%B Proceedings of the Machine Learning Research
%N 235
%@ false
%U https://proceedings.mlr.press/v235/fey24a.html

Paper

C. Braunstein, H. Petekkaya, J. E. Lenssen, M. Toneva, and E. Ilg

“SLayR: Scene Layout Generation with Rectified Flow,” 2024. [Online]. Available: https://arxiv.org/abs/2412.05003.

Abstract

We introduce SLayR, Scene Layout Generation with Rectified flow.
State-of-the-art text-to-image models achieve impressive results. However, they
generate images end-to-end, exposing no fine-grained control over the process.
SLayR presents a novel transformer-based rectified flow model for layout
generation over a token space that can be decoded into bounding boxes and
corresponding labels, which can then be transformed into images using existing
models. We show that established metrics for generated images are inconclusive
for evaluating their underlying scene layout, and introduce a new benchmark
suite, including a carefully designed repeatable human-evaluation procedure
that assesses the plausibility and variety of generated layouts. In contrast to
previous works, which perform well in either high variety or plausibility, we
show that our approach performs well on both of these axes at the same time. It
is also at least 5x times smaller in the number of parameters and 37% faster
than the baselines. Our complete text-to-image pipeline demonstrates the added
benefits of an interpretable and editable intermediate representation.

BibTeX

@online{Braunstein2412.05003,
TITLE = {{SLayR}: Scene Layout Generation with Rectified Flow},
AUTHOR = {Braunstein, Cameron and Petekkaya, Hevra and Lenssen, Jan Eric and Toneva, Mariya and Ilg, Eddy},
LANGUAGE = {eng},
URL = {https://arxiv.org/abs/2412.05003},
EPRINT = {2412.05003},
EPRINTTYPE = {arXiv},
YEAR = {2024},
MARGINALMARK = {$\bullet$},
ABSTRACT = {We introduce SLayR, Scene Layout Generation with Rectified flow.<br>State-of-the-art text-to-image models achieve impressive results. However, they<br>generate images end-to-end, exposing no fine-grained control over the process.<br>SLayR presents a novel transformer-based rectified flow model for layout<br>generation over a token space that can be decoded into bounding boxes and<br>corresponding labels, which can then be transformed into images using existing<br>models. We show that established metrics for generated images are inconclusive<br>for evaluating their underlying scene layout, and introduce a new benchmark<br>suite, including a carefully designed repeatable human-evaluation procedure<br>that assesses the plausibility and variety of generated layouts. In contrast to<br>previous works, which perform well in either high variety or plausibility, we<br>show that our approach performs well on both of these axes at the same time. It<br>is also at least 5x times smaller in the number of parameters and 37% faster<br>than the baselines. Our complete text-to-image pipeline demonstrates the added<br>benefits of an interpretable and editable intermediate representation.<br>},
}

Endnote

%0 Report
%A Braunstein, Cameron
%A Petekkaya, Hevra
%A Lenssen, Jan Eric
%A Toneva, Mariya
%A Ilg, Eddy
%+ External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
%T SLayR: Scene Layout Generation with Rectified Flow : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-A3F6-0
%U https://arxiv.org/abs/2412.05003
%D 2024
%X   We introduce SLayR, Scene Layout Generation with Rectified flow.<br>State-of-the-art text-to-image models achieve impressive results. However, they<br>generate images end-to-end, exposing no fine-grained control over the process.<br>SLayR presents a novel transformer-based rectified flow model for layout<br>generation over a token space that can be decoded into bounding boxes and<br>corresponding labels, which can then be transformed into images using existing<br>models. We show that established metrics for generated images are inconclusive<br>for evaluating their underlying scene layout, and introduce a new benchmark<br>suite, including a carefully designed repeatable human-evaluation procedure<br>that assesses the plausibility and variety of generated layouts. In contrast to<br>previous works, which perform well in either high variety or plausibility, we<br>show that our approach performs well on both of these axes at the same time. It<br>is also at least 5x times smaller in the number of parameters and 37% faster<br>than the baselines. Our complete text-to-image pipeline demonstrates the added<br>benefits of an interpretable and editable intermediate representation.<br>
%K Computer Science, Computer Vision and Pattern Recognition, cs.CV

2023

Conference paper

C. Wewer, E. Ilg, B. Schiele, and J. E. Lenssen

“SimNP: Learning Self-Similarity Priors Between Neural Points,” in IEEE/CVF International Conference on Computer Vision (ICCV 2023), Paris, France, 2023.

@inproceedings{Wewer_ICCV23,
TITLE = {{SimNP}: {L}earning Self-Similarity Priors Between Neural Points},
AUTHOR = {Wewer, Christopher and Ilg, Eddy and Schiele, Bernt and Lenssen, Jan Eric},
LANGUAGE = {eng},
ISBN = {979-8-3503-0718-4},
DOI = {10.1109/ICCV51070.2023.00812},
PUBLISHER = {IEEE},
YEAR = {2023},
MARGINALMARK = {$\bullet$},
DATE = {2023},
BOOKTITLE = {IEEE/CVF International Conference on Computer Vision (ICCV 2023)},
PAGES = {8807--8818},
ADDRESS = {Paris, France},
}

Endnote

%0 Conference Proceedings
%A Wewer, Christopher
%A Ilg, Eddy
%A Schiele, Bernt
%A Lenssen, Jan Eric
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T SimNP: Learning Self-Similarity Priors Between Neural Points : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000D-C95E-8
%R 10.1109/ICCV51070.2023.00812
%D 2023
%B IEEE/CVF International Conference on Computer Vision
%Z date of event: 2023-10-02 - 2023-10-06
%C Paris, France
%B IEEE/CVF International Conference on Computer Vision
%P 8807 - 8818
%I IEEE
%@ 979-8-3503-0718-4

Paper

M. Fey, W. Hu, K. Huang, J. E. Lenssen, R. Ranjan, J. Robinson, R. Ying, J. You, and J. Leskovec

“Relational Deep Learning: Graph Representation Learning on Relational Databases,” 2023. [Online]. Available: https://arxiv.org/abs/2312.04615.

Abstract

Much of the world's most valued data is stored in relational databases and
data warehouses, where the data is organized into many tables connected by
primary-foreign key relations. However, building machine learning models using
this data is both challenging and time consuming. The core problem is that no
machine learning method is capable of learning on multiple tables
interconnected by primary-foreign key relations. Current methods can only learn
from a single table, so the data must first be manually joined and aggregated
into a single training table, the process known as feature engineering. Feature
engineering is slow, error prone and leads to suboptimal models. Here we
introduce an end-to-end deep representation learning approach to directly learn
on data laid out across multiple tables. We name our approach Relational Deep
Learning (RDL). The core idea is to view relational databases as a temporal,
heterogeneous graph, with a node for each row in each table, and edges
specified by primary-foreign key links. Message Passing Graph Neural Networks
can then automatically learn across the graph to extract representations that
leverage all input data, without any manual feature engineering. Relational
Deep Learning leads to more accurate models that can be built much faster. To
facilitate research in this area, we develop RelBench, a set of benchmark
datasets and an implementation of Relational Deep Learning. The data covers a
wide spectrum, from discussions on Stack Exchange to book reviews on the Amazon
Product Catalog. Overall, we define a new research area that generalizes graph
machine learning and broadens its applicability to a wide set of AI use cases.

BibTeX

@online{Fey2312.04615,
TITLE = {Relational Deep Learning: Graph Representation Learning on Relational Databases},
AUTHOR = {Fey, Matthias and Hu, Weihua and Huang, Kexin and Lenssen, Jan Eric and Ranjan, Rishabh and Robinson, Joshua and Ying, Rex and You, Jiaxuan and Leskovec, Jure},
LANGUAGE = {eng},
URL = {https://arxiv.org/abs/2312.04615},
EPRINT = {2312.04615},
EPRINTTYPE = {arXiv},
YEAR = {2023},
MARGINALMARK = {$\bullet$},
ABSTRACT = {Much of the world's most valued data is stored in relational databases and<br>data warehouses, where the data is organized into many tables connected by<br>primary-foreign key relations. However, building machine learning models using<br>this data is both challenging and time consuming. The core problem is that no<br>machine learning method is capable of learning on multiple tables<br>interconnected by primary-foreign key relations. Current methods can only learn<br>from a single table, so the data must first be manually joined and aggregated<br>into a single training table, the process known as feature engineering. Feature<br>engineering is slow, error prone and leads to suboptimal models. Here we<br>introduce an end-to-end deep representation learning approach to directly learn<br>on data laid out across multiple tables. We name our approach Relational Deep<br>Learning (RDL). The core idea is to view relational databases as a temporal,<br>heterogeneous graph, with a node for each row in each table, and edges<br>specified by primary-foreign key links. Message Passing Graph Neural Networks<br>can then automatically learn across the graph to extract representations that<br>leverage all input data, without any manual feature engineering. Relational<br>Deep Learning leads to more accurate models that can be built much faster. To<br>facilitate research in this area, we develop RelBench, a set of benchmark<br>datasets and an implementation of Relational Deep Learning. The data covers a<br>wide spectrum, from discussions on Stack Exchange to book reviews on the Amazon<br>Product Catalog. Overall, we define a new research area that generalizes graph<br>machine learning and broadens its applicability to a wide set of AI use cases.<br>},
}

Endnote

%0 Report
%A Fey, Matthias
%A Hu, Weihua
%A Huang, Kexin
%A Lenssen, Jan Eric
%A Ranjan, Rishabh
%A Robinson, Joshua
%A Ying, Rex
%A You, Jiaxuan
%A Leskovec, Jure
%+ External Organizations
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
External Organizations
External Organizations
External Organizations
%T Relational Deep Learning: Graph Representation Learning on Relational
  Databases : 
%G eng
%U http://hdl.handle.net/21.11116/0000-0010-A3EB-D
%U https://arxiv.org/abs/2312.04615
%D 2023
%X   Much of the world's most valued data is stored in relational databases and<br>data warehouses, where the data is organized into many tables connected by<br>primary-foreign key relations. However, building machine learning models using<br>this data is both challenging and time consuming. The core problem is that no<br>machine learning method is capable of learning on multiple tables<br>interconnected by primary-foreign key relations. Current methods can only learn<br>from a single table, so the data must first be manually joined and aggregated<br>into a single training table, the process known as feature engineering. Feature<br>engineering is slow, error prone and leads to suboptimal models. Here we<br>introduce an end-to-end deep representation learning approach to directly learn<br>on data laid out across multiple tables. We name our approach Relational Deep<br>Learning (RDL). The core idea is to view relational databases as a temporal,<br>heterogeneous graph, with a node for each row in each table, and edges<br>specified by primary-foreign key links. Message Passing Graph Neural Networks<br>can then automatically learn across the graph to extract representations that<br>leverage all input data, without any manual feature engineering. Relational<br>Deep Learning leads to more accurate models that can be built much faster. To<br>facilitate research in this area, we develop RelBench, a set of benchmark<br>datasets and an implementation of Relational Deep Learning. The data covers a<br>wide spectrum, from discussions on Stack Exchange to book reviews on the Amazon<br>Product Catalog. Overall, we define a new research area that generalizes graph<br>machine learning and broadens its applicability to a wide set of AI use cases.<br>
%K Computer Science, Learning, cs.LG,Computer Science, Databases, cs.DB

2022

Conference paper

G. Tiwari, D. Antic, J. E. Lenssen, N. Sarafianos, T. Tung, and G. Pons-Moll

“Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields,” in Computer Vision -- ECCV 2022, Tel Aviv, Israel, 2022.

@inproceedings{Tiwari_ECCV22,
TITLE = {Pose-{NDF}: {M}odeling Human Pose Manifolds with Neural Distance Fields},
AUTHOR = {Tiwari, Garvita and Antic, Dimitrije and Lenssen, Jan Eric and Sarafianos, Nikolaos and Tung, Tony and Pons-Moll, Gerard},
LANGUAGE = {eng},
ISBN = {10.1007/978-3-031-20065-6{\textunderscore}33},
DOI = {10.1007/978-3-031-20065-6_33},
PUBLISHER = {Springer},
YEAR = {2022},
DATE = {2022},
BOOKTITLE = {Computer Vision -- ECCV 2022},
EDITOR = {Avidan, Shai and Brostow, Gabriel and Ciss{\'e}, Moustapha and Farinella, Giovanni and Hassner, Tal},
PAGES = {572--589},
SERIES = {Lecture Notes in Computer Science},
VOLUME = {13665},
ADDRESS = {Tel Aviv, Israel},
}

Endnote

%0 Conference Proceedings
%A Tiwari, Garvita
%A Antic, Dimitrije
%A Lenssen, Jan Eric
%A Sarafianos, Nikolaos
%A Tung, Tony
%A Pons-Moll, Gerard
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
External Organizations
External Organizations
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000A-B582-6
%R 10.1007/978-3-031-20065-6_33
%D 2022
%B 17th European Conference on Computer Vision
%Z date of event: 2022-10-23 - 2022-10-27
%C Tel Aviv, Israel
%B Computer Vision -- ECCV 2022
%E Avidan, Shai; Brostow, Gabriel; Ciss&#233;, Moustapha; Farinella, Giovanni; Hassner, Tal
%P 572 - 589
%I Springer
%@ 10.1007/978-3-031-20065-6_33
%B Lecture Notes in Computer Science
%N 13665
%U https://rdcu.be/c26RY

Conference paper

K. Zhou, B. L. Bhatnagar, J. E. Lenssen, and G. Pons-Moll

“TOCH: Spatio-Temporal Object Correspondence to Hand for Motion Refinement,” in Computer Vision -- ECCV 2022, Tel Aviv, Israel, 2022.

@inproceedings{Zhou_ECCV2022,
TITLE = {{TOCH}: {S}patio-Temporal Object Correspondence to Hand for Motion Refinement},
AUTHOR = {Zhou, Keyang and Bhatnagar, Bharat Lal and Lenssen, Jan Eric and Pons-Moll, Gerard},
LANGUAGE = {eng},
ISBN = {978-3-031-20061-8},
DOI = {10.1007/978-3-031-20062-5_1},
PUBLISHER = {Springer},
YEAR = {2022},
DATE = {2022},
BOOKTITLE = {Computer Vision -- ECCV 2022},
EDITOR = {Avidan, Shai and Brostow, Gabriel and Ciss{\'e}, Moustapha and Farinella, Giovanni and Hassner, Tal},
PAGES = {1--19},
SERIES = {Lecture Notes in Computer Science},
VOLUME = {13663},
ADDRESS = {Tel Aviv, Israel},
}

Endnote

%0 Conference Proceedings
%A Zhou, Keyang
%A Bhatnagar, Bharat Lal
%A Lenssen, Jan Eric
%A Pons-Moll, Gerard
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T TOCH: Spatio-Temporal Object Correspondence to Hand for Motion
  Refinement : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000A-B586-2
%R 10.1007/978-3-031-20062-5_1
%D 2022
%B 17th European Conference on Computer Vision
%Z date of event: 2022-10-23 - 2022-10-27
%C Tel Aviv, Israel
%B Computer Vision -- ECCV 2022
%E Avidan, Shai; Brostow, Gabriel; Ciss&#233;, Moustapha; Farinella, Giovanni; Hassner, Tal
%P 1 - 19
%I Springer
%@ 978-3-031-20061-8
%B Lecture Notes in Computer Science
%N 13663
%U https://rdcu.be/c26JY

Paper

K. Zhou, B. Lal Bhatnagar, J. E. Lenssen, and G. Pons-Moll

“TOCH: Spatio-Temporal Object Correspondence to Hand for Motion Refinement,” 2022. [Online]. Available: https://arxiv.org/abs/2205.07982.

Abstract

We present TOCH, a method for refining incorrect 3D hand-object interaction
sequences using a data prior. Existing hand trackers, especially those that
rely on very few cameras, often produce visually unrealistic results with
hand-object intersection or missing contacts. Although correcting such errors
requires reasoning about temporal aspects of interaction, most previous work
focus on static grasps and contacts. The core of our method are TOCH fields, a
novel spatio-temporal representation for modeling correspondences between hands
and objects during interaction. The key component is a point-wise
object-centric representation which encodes the hand position relative to the
object. Leveraging this novel representation, we learn a latent manifold of
plausible TOCH fields with a temporal denoising auto-encoder. Experiments
demonstrate that TOCH outperforms state-of-the-art (SOTA) 3D hand-object
interaction models, which are limited to static grasps and contacts. More
importantly, our method produces smooth interactions even before and after
contact. Using a single trained TOCH model, we quantitatively and qualitatively
demonstrate its usefulness for 1) correcting erroneous reconstruction results
from off-the-shelf RGB/RGB-D hand-object reconstruction methods, 2) de-noising,
and 3) grasp transfer across objects. We will release our code and trained
model on our project page at virtualhumans.mpi-inf.mpg.de/toch/

BibTeX

@online{Zhou_2205.07982,
TITLE = {{TOCH}: Spatio-Temporal Object Correspondence to Hand for Motion Refinement},
AUTHOR = {Zhou, Keyang and Lal Bhatnagar, Bharat and Lenssen, Jan Eric and Pons-Moll, Gerard},
LANGUAGE = {eng},
URL = {https://arxiv.org/abs/2205.07982},
EPRINT = {2205.07982},
EPRINTTYPE = {arXiv},
YEAR = {2022},
ABSTRACT = {We present TOCH, a method for refining incorrect 3D hand-object interaction<br>sequences using a data prior. Existing hand trackers, especially those that<br>rely on very few cameras, often produce visually unrealistic results with<br>hand-object intersection or missing contacts. Although correcting such errors<br>requires reasoning about temporal aspects of interaction, most previous work<br>focus on static grasps and contacts. The core of our method are TOCH fields, a<br>novel spatio-temporal representation for modeling correspondences between hands<br>and objects during interaction. The key component is a point-wise<br>object-centric representation which encodes the hand position relative to the<br>object. Leveraging this novel representation, we learn a latent manifold of<br>plausible TOCH fields with a temporal denoising auto-encoder. Experiments<br>demonstrate that TOCH outperforms state-of-the-art (SOTA) 3D hand-object<br>interaction models, which are limited to static grasps and contacts. More<br>importantly, our method produces smooth interactions even before and after<br>contact. Using a single trained TOCH model, we quantitatively and qualitatively<br>demonstrate its usefulness for 1) correcting erroneous reconstruction results<br>from off-the-shelf RGB/RGB-D hand-object reconstruction methods, 2) de-noising,<br>and 3) grasp transfer across objects. We will release our code and trained<br>model on our project page at http://virtualhumans.mpi-inf.mpg.de/toch/<br>},
}

Endnote

%0 Report
%A Zhou, Keyang
%A Lal Bhatnagar, Bharat
%A Lenssen, Jan Eric
%A Pons-Moll, Gerard
%+ Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society
%T TOCH: Spatio-Temporal Object Correspondence to Hand for Motion
  Refinement : 
%G eng
%U http://hdl.handle.net/21.11116/0000-000A-ACF3-2
%U https://arxiv.org/abs/2205.07982
%D 2022
%X   We present TOCH, a method for refining incorrect 3D hand-object interaction<br>sequences using a data prior. Existing hand trackers, especially those that<br>rely on very few cameras, often produce visually unrealistic results with<br>hand-object intersection or missing contacts. Although correcting such errors<br>requires reasoning about temporal aspects of interaction, most previous work<br>focus on static grasps and contacts. The core of our method are TOCH fields, a<br>novel spatio-temporal representation for modeling correspondences between hands<br>and objects during interaction. The key component is a point-wise<br>object-centric representation which encodes the hand position relative to the<br>object. Leveraging this novel representation, we learn a latent manifold of<br>plausible TOCH fields with a temporal denoising auto-encoder. Experiments<br>demonstrate that TOCH outperforms state-of-the-art (SOTA) 3D hand-object<br>interaction models, which are limited to static grasps and contacts. More<br>importantly, our method produces smooth interactions even before and after<br>contact. Using a single trained TOCH model, we quantitatively and qualitatively<br>demonstrate its usefulness for 1) correcting erroneous reconstruction results<br>from off-the-shelf RGB/RGB-D hand-object reconstruction methods, 2) de-noising,<br>and 3) grasp transfer across objects. We will release our code and trained<br>model on our project page at http://virtualhumans.mpi-inf.mpg.de/toch/<br>
%K Computer Science, Computer Vision and Pattern Recognition, cs.CV