New research area “Vision and Language Models (VLMs)” at the Saarbrücken Research Center for Visual Computing, Interaction and Artificial Intelligence under the direction of Professor Bernt Schiele Saarbrücken [...] launched in November 2022 at the MPI in Saarbrücken, is establishing a new research area “Vision and Language Models (VLMs)”, led by Professor Bernt Schiele. This was presented by the Max Planck Directors Bernt [...] and machine learning to the VIA Center. His team will advance the state of the art in vision and language models (VLMs), with a particular focus on zero-shot learning, self-supervised learning, novel p
Mattia and Piccinelli, Luigi and Li, Siyuan and Gool, Luc Van and Yu, Fisher and Schiele, B.}, LANGUAGE = {eng}, ISBN = {978-3-031-73241-6}, DOI = {10.1007/978-3-031-73242-3_1}, PUBLISHER = {Springer} [...] Mattia and Piccinelli, Luigi and Li, Siyuan and Yang, Yung-Hsu and Schiele, Bernt and Van Gool, Luc}, LANGUAGE = {eng}, URL = {https://arxiv.org/abs/2410.01806}, EPRINT = {2410.01806}, EPRINTTYPE = {arXiv}, [...] Mattia and Tai, Yu-Wing and Yu, Fisher and Tang, Chi-Keung and Schiele, Bernt and Dai, Dengxin}, LANGUAGE = {eng}, URL = {https://openreview.net/group?id=ICLR.cc/2023/Conference#poster}, PUBLISHER = {OpenReview
{C}ontextual Media Retrieval Using Natural Language Queries}, AUTHOR = {Nag Chowdhury, Sreyasi and Malinowski, Mateusz and Bulling, Andreas and Fritz, Mario}, LANGUAGE = {eng}, ISBN = {978-1-4503-4359-6}, DOI [...] Mateusz and Fritz, Mario}, LANGUAGE = {eng}, URL = {https://arxiv.org/abs/1501.03302}, EPRINT = {1501.03302}, EPRINTTYPE = {arXiv}, YEAR = {2015}, ABSTRACT = {Progress in language and image understanding [...] AUTHOR = {Malinowski, Mateusz and Fritz, Mario}, LANGUAGE = {eng}, EPRINT = {1410.8027}, EPRINTTYPE = {arXiv}, YEAR = {2014}, ABSTRACT = {As language and visual understanding by machines progresses rapidly
Automatic text alignment is an important problem in natural language processing. It can be used to create the data needed to train different language models. Most research about automatic summarization revolves [...] {Automatic text alignment is an important problem in natural language processing. It<br>can be used to create the data needed to train different language models. Most research<br>about automatic summarization [...] Automatic text alignment is an important problem in natural language processing. It<br>can be used to create the data needed to train different language models. Most research<br>about automatic summarization
Group Computational Biology RG1 Automation of Logic RG2 Network and Cloud Systems RG3 Multimodal Language Processing Databases and Information Systems People Former Members and Guests Research Commonsense
Group Computational Biology RG1 Automation of Logic RG2 Network and Cloud Systems RG3 Multimodal Language Processing Databases and Information Systems People Former Members and Guests Research Commonsense
J. Keuper “Are Vision Language Models Texture or Shape Biased and Can We Steer Them?,” 2024. [Online]. Available: https://arxiv.org/abs/2403.09193. more Abstract Vision language models (VLMs) have drastically [...] Keuper, Janis}, LANGUAGE = {eng}, URL = {https://arxiv.org/abs/2403.09193}, EPRINT = {2403.09193}, EPRINTTYPE = {arXiv}, YEAR = {2024}, MARGINALMARK = {$\bullet$}, ABSTRACT = {Vision language models (VLMs) [...] Vision Language Models Texture or Shape Biased and Can We Steer Them? : %G eng %U http://hdl.handle.net/21.11116/0000-0010-5DEE-B %U https://arxiv.org/abs/2403.09193 %D 2024 %X Vision language models (VLMs)
Introduction to Human Computer Systems Research Projects A Movie Description Dataset Generating natural language descriptions for videos and images Large-Scale Knowlege Transfer MPII Cooking Activities Dataset
Knowledge Mango Mango Multi-Cultural Commonsense Knowledge Distillation Despite recent progress, large language models (LLMs) still face the challenge of appropriately reacting to the intricacies of social and
and Multilabel Classification}, AUTHOR = {Lapin, Maksim and Hein, Matthias and Schiele, Bernt}, LANGUAGE = {eng}, ISSN = {0162-8828}, DOI = {10.1109/TPAMI.2017.2751607}, PUBLISHER = {IEEE}, ADDRESS = [...] {Image Classification with Limited Training Data and Class Ambiguity}, AUTHOR = {Lapin, Maksim}, LANGUAGE = {eng}, URL = {urn:nbn:de:bsz:291-scidok-69098}, DOI = {10.22028/D291-26775}, SCHOOL = {Universit{\"a}t [...] Error: {A}nalysis and Insights}, AUTHOR = {Lapin, Maksim and Hein, Matthias and Schiele, Bernt}, LANGUAGE = {eng}, ISBN = {978-1-4673-8852-8}, DOI = {10.1109/CVPR.2016.163}, PUBLISHER = {IEEE Computer Society}