Large vision-language models: pre-training, prompting, and applications
Éditeur :
Springer International Publishing AG
ISBN :
9783031949685
Date de publication :
31 août 2025
Dimensions :
23,5 x 15,5 cm
Langue :
Anglais
Pays d'origine :
Suisse
The rapid progress in the field of large multimodal foundation models, especially vision-language models, has dramatically transformed the landscape of machine learning, computer vision, and natural language processing.