📑

AI Paper Research

AI 논문 조사 및 정리

Foundations
음성·오디오Audio & Speech
Neural Codec Language Models are Zero-Sh...MusicLM: Generating Music From TextBark: Text-Prompted Generative Audio Mod...
Robust Speech Recognition via Large-Scal...AudioLM: a Language Modeling Approach to...High Fidelity Neural Audio Compression
HuBERT: Self-Supervised Speech Represent...SoundStream: An End-to-End Neural Audio ...
Natural TTS Synthesis by Conditioning Wa...
WaveNet: A Generative Model for Raw Audi...
홈/음성·오디오/2023

음성·오디오 — 2023

3편의 논문

arXiv1,000+

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

신경 코덱 언어 모델은 제로샷 텍스트 음성 합성기이다

Chengyi Wang, Sanyuan Chen, Yu Wu et al. (2023)

arXiv1,000+

MusicLM: Generating Music From Text

MusicLM: 텍스트로부터 음악 생성

Andrea Agostinelli, Timo I. Denk, Zalán Borsos et al. (2023)

GitHub / Suno AI500+

Bark: Text-Prompted Generative Audio Model

Bark: 텍스트 프롬프트 기반 생성 오디오 모델

Suno AI (2023)

← 음성·오디오 전체