Great audio AEs/codecs exist, but when you need structured latents or a tweaked bottleneck for a downstream task (e.g. generation), retraining is expensive & brittle.
We Re-Bottleneck👇
I'm thrilled to announce that our paper, "Generation or Replication: Auscultating Audio Latent Diffusion Models" 🩺 with the Speech & Audio team at MERL, has been accepted for publication at #ICASSP2024!
``Generation or Replication: Auscultating Audio Latent Diffusion Models. (arXiv:2310.10604v1 [eess.AS]),'' Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux, ift.tt/FTuqsxK
A little bit of bias by my side 🎶! We often neglect the variety in importance of training examples for separation models. Interestingly, we can solve important problems like robustness! Please check our work with @DBralios! arxiv.org/pdf/2010.13228…
50 Followers 102 FollowingResearcher in spatial audio & signal processing at Yamaha Corp. (2019~) / Ph.D student at Kyoto Univ. (2022~) / All tweets are my own.
235 Followers 243 FollowingResearch scientist @NVIDIA working on deep generative models for sequences, with a particular focus on speech and audio. Personal account.
1K Followers 220 FollowingResearching pixels @freepik. Independently created the Chroma & Radiance models as personal projects.
https://t.co/d8h2mIi9zS
https://t.co/RrPsPku3y6
259 Followers 291 FollowingPh.D. (Engineering)/Specially Appointed Researcher at Toyota Central R&D Labs., Inc./Vision & Language, Intelligent Robotics
477 Followers 378 FollowingPhD student in AI+Music @c4dm Expressive piano performance, computational musicology, music information retrieval, music education.
I play piano and theremin.
91K Followers 6K FollowingUpdates and commentary on startups, economics, geopolitics, political risk, global markets and tech. Focus on Greece & Europe.
2K Followers 413 FollowingSenior Researcher @category_xyz | MEV guy. Game Theorist who loves playing games. | Not financial, investment, legal, or tax advice; comedy, for fun only.
355 Followers 607 FollowingResearch engineer II @SonyAI_global Doing research on sound generation/restoration with diffusion models. Tweets are my own. Love fender 🎸
58K Followers 82 FollowingSolve all disease.
Developing and applying frontier AI to unlock deeper scientific insights, faster breakthroughs, and life-changing medicines.
50 Followers 102 FollowingResearcher in spatial audio & signal processing at Yamaha Corp. (2019~) / Ph.D student at Kyoto Univ. (2022~) / All tweets are my own.
1K Followers 124 FollowingWe are a research team on artificial intelligence for automotive applications working toward assisted and autonomous driving.
556K Followers 2K FollowingPolyagentmorous ClawFather. Came back from retirement to mess with AI and help a lobster take over the world.
@OpenClaw🦞 + @OpenAI
38K Followers 707 Followingex world model lead @xAI | ex @Nvidia @Meta | 30+ papers, 9k citations | talk about AI, LLM, video generation, multimodal, AGI
6K Followers 424 FollowingAssistant Professor of Computing Science @SFU. Ph.D. from @Berkeley_EECS and Bachelor's from @UofTCompSci. Formerly @GoogleAI and Member of @the_IAS.
8K Followers 21 FollowingGrad&Clip&EM is all you need @Kimi_Moonshot
Blog: https://t.co/YVxsWylklA , Cool Papers: https://t.co/scS1n1oyaO
Interesting link: https://t.co/7Tl4HaVajh, https://t.co/Y5qaxAA9Iy
3K Followers 427 FollowingAI Research @Cartesia. Prev: @NVIDIA, @GoogleAI, @Qualcomm, @Merl_news, PhD in Efficient Deep Learning @VUamsterdam. Opinions my own.
1K Followers 220 FollowingResearching pixels @freepik. Independently created the Chroma & Radiance models as personal projects.
https://t.co/d8h2mIi9zS
https://t.co/RrPsPku3y6