📑

AI Paper Research

AI 논문 조사 및 정리

Foundations
음성·오디오Audio & Speech
Neural Codec Language Models are Zero-Sh...MusicLM: Generating Music From TextBark: Text-Prompted Generative Audio Mod...
Robust Speech Recognition via Large-Scal...AudioLM: a Language Modeling Approach to...High Fidelity Neural Audio Compression
HuBERT: Self-Supervised Speech Represent...SoundStream: An End-to-End Neural Audio ...
Natural TTS Synthesis by Conditioning Wa...
WaveNet: A Generative Model for Raw Audi...
홈/음성·오디오/2021

음성·오디오 — 2021

2편의 논문

IEEE/ACM TASLP 20213,000+

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

HuBERT: 은닉 유닛의 마스크 예측을 통한 자기지도 음성 표현 학습

Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai et al. (2021)

IEEE/ACM TASLP 20221,500+

SoundStream: An End-to-End Neural Audio Codec

SoundStream: 엔드투엔드 신경 오디오 코덱

Neil Zeghidour, Alejandro Luebs, Ahmed Omran et al. (2021)

← 음성·오디오 전체