📑

AI Paper Research

AI 논문 조사 및 정리

Foundations
경량화·효율화Efficient AI
QLoRA: Efficient Finetuning of Quantized...AWQ: Activation-aware Weight Quantizatio...Fast Inference from Transformers via Spe...Efficient Memory Management for Large La...FlashAttention-2: Faster Attention with ...
Switch Transformers: Scaling to Trillion...FlashAttention: Fast and Memory-Efficien...GPTQ: Accurate Post-Training Quantizatio...
LoRA: Low-Rank Adaptation of Large Langu...
Distilling the Knowledge in a Neural Net...
홈/경량화·효율화/2021

경량화·효율화 — 2021

1편의 논문

ICLR 20228,000+

LoRA: Low-Rank Adaptation of Large Language Models

LoRA: 대규모 언어 모델의 저순위 적응

Edward J. Hu, Yelong Shen, Phillip Wallis et al. (2021)

← 경량화·효율화 전체