„Great ScrumMaster, The” został dodany do koszyka. Zobacz koszyk

save

30,05 zł

Multimodal Foundation Models

432,15 zł~~462,20 zł~~

Opis
Product Details

This monograph presents a comprehensive survey of the taxonomy and evolution of multimodal foundation models that demonstrate vision and vision-language capabilities, focusing on the transition from specialist models to general-purpose assistants.

The focus encompasses five core topics, categorized into two classes; (i) a survey of well-established research areas: multimodal foundation models pre-trained for specific purposes, including two topics – methods of learning vision backbones for visual understanding and text-to-image generation; (ii) recent advances in exploratory, open research areas: multimodal foundation models that aim to play the role of general-purpose assistants, including three topics – unified vision models inspired by large language models (LLMs), end-to-end training of multimodal LLMs, and chaining multimodal tools with LLMs.

The target audience of the monograph is researchers, graduate students, and professionals in computer vision and vision-language multimodal communities who are eager to learn the basics and recent advances in multimodal foundation models.

Opis
Product Details

SKU:	9781638283362
Category:	Informatyka

Podtytuł	From Specialists to General-Purpose Assistants
Autor	Li Chunyuan
Wydawca	Now Publishers
Język	angielski
Rok	2024
Stron	230
Oprawa	Miękka
ISBN	9781638283362
Infromacja GPSR	PROGMAR 40-748 Katowice ul.Strzelnica 60

Subtotal:

save

Multimodal Foundation Models