📑

AI Paper Research

AI 논문 조사 및 정리

Foundations
멀티모달Multimodal AI
Gemini: A Family of Highly Capable Multi...InternVL: Scaling up Vision Foundation M...
Visual Instruction TuningBLIP-2: Bootstrapping Language-Image Pre...CogVLM: Visual Expert for Pretrained Lan...
Flamingo: a Visual Language Model for Fe...
Learning Transferable Visual Models From...Zero-Shot Text-to-Image GenerationScaling Up Visual and Vision-Language Re...
ViLBERT: Pretraining Task-Agnostic Visio...
홈/멀티모달/2022

멀티모달 — 2022

1편의 논문

NeurIPS 20223,000+

Flamingo: a Visual Language Model for Few-Shot Learning

Flamingo: 퓨샷 학습을 위한 시각 언어 모델

Jean-Baptiste Alayrac, Jeff Donahue, Pauline Luc et al. (2022)

← 멀티모달 전체