📑

AI Paper Research

AI 논문 조사 및 정리

Foundations
음성·오디오Audio & Speech
Neural Codec Language Models are Zero-Sh...MusicLM: Generating Music From TextBark: Text-Prompted Generative Audio Mod...
Robust Speech Recognition via Large-Scal...AudioLM: a Language Modeling Approach to...High Fidelity Neural Audio Compression
HuBERT: Self-Supervised Speech Represent...SoundStream: An End-to-End Neural Audio ...
Natural TTS Synthesis by Conditioning Wa...
WaveNet: A Generative Model for Raw Audi...
홈/음성·오디오/2022

음성·오디오 — 2022

3편의 논문

ICML 20235,000+

Robust Speech Recognition via Large-Scale Weak Supervision

대규모 약한 감독을 통한 강건한 음성 인식

Alec Radford, Jong Wook Kim, Tao Xu et al. (2022)

IEEE/ACM TASLP 20231,000+

AudioLM: a Language Modeling Approach to Audio Generation

AudioLM: 오디오 생성을 위한 언어 모델링 접근법

Zalán Borsos, Raphaël Marinier, Damien Vincent et al. (2022)

arXiv1,500+

High Fidelity Neural Audio Compression

고충실도 신경 오디오 압축

Alexandre Défossez, Jade Copet, Gabriel Synnaeve et al. (2022)

← 음성·오디오 전체