mittu1204 Profile Banner
Yuki Mitsufuji Profile
Yuki Mitsufuji

@mittu1204

Followers
4K
Following
42K
Statuses
3K

PhD, Distinguished Engineer @Sony, Lead Research Scientist/VP of AI Research @SonyAI_global, Head of Creative AI Lab, Associate Prof. @tokyotech_jp

Manhattan, NY
Joined December 2009
Don't wanna be here? Send us removal request.
@mittu1204
Yuki Mitsufuji
2 months
I'm very happy to see that MMAudio, which my talented colleagues (@mi141, A. Hayakawa, @yahshibu) and intern (@hkchengrex) at Sony AI have invested their time and effort into, is being tested by so many creative people in X arXiv:
@blizaine
Blaine Brown
2 months
Mario irl. Google Veo2 + MMAudio is magic. 🪄😁 (thread 🧵1/3) 🔊🔊
6
12
88
@mittu1204
Yuki Mitsufuji
2 days
1 spotlight, 5 posters from us at #ICLR2025 "Weighted Point Cloud Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric" led by Toshimitsu Uesaka, with strong support from Prof. Taiji Suzuki (@btreetaiji), was selected as a spotlight. Congrats🎊
0
1
6
@mittu1204
Yuki Mitsufuji
6 days
Fast sampling methods for discrete diffusion from our lab 🏎️ Our sampling schedule optimization method (Jump Your Steps) is accepted at #ICLR2025 Another (Di4C) is about distllation for discrete diffusion!
@takiko_san
Yuhta Takida
6 days
📢Discrete diffusion models are trending! Check out the latest work from our group (@mittu1204) in this exciting field: 1️⃣ Di4C: Fast sampling through distillation 📄 2️⃣ Jump Your Step: Optimizing sampling schedules (ICLR'25) 📄
0
0
12
@mittu1204
Yuki Mitsufuji
17 days
1
0
1
@mittu1204
Yuki Mitsufuji
23 days
When you register for #ICASSP2025, don't forget to select our tutorial on diffusion models for audio: 🎶Transforming Chaos into Harmony: Diffusion Models in Audio Signal Processing🎶 See you in Hyderabad, India!🇮🇳
Tweet media one
1
8
44
@mittu1204
Yuki Mitsufuji
2 months
"これまでにevalaが発表してきた36の立体音響作品のサウンド・データを学習したサウンドエフェクト生成AIを用いて,空間的作品をつくる試みです.サウンド(evalaがシグネチャーとして作品の始まりに用いている汽笛の音)とテキスト(学習に用いられた作品のうち8作品のタイトルをチャンネルごとに選択)の二つをリファレンス(プロンプト)として,リアルタイムかつマルチチャンネルで「evalaのような音」が生成され続けます.このプロジェクトは,作家不在でも立体音響作品を永続的に継承・制作しうる新しいアーカイヴのかたちを探求する実験であり,本作品はその最初のスケッチとなります."
@NTTICC
NTT ICC
2 months
《Studies for》は,これまでにevala @evalaport が発表してきた36の立体音響作品のサウンド・データを学習したサウンドエフェクト生成AIを用い,空間的作品を創る試み. 作家不在でも立体音響作品を永続的に継承・制作しうる新しいアーカイヴの形を探求する実験. #DOMMUNE
0
3
10
@mittu1204
Yuki Mitsufuji
3 months
RT @SonyAI_global: 🚀 PaGoDA by Sony AI: High-res image generation without retraining! Fast, efficient, and quality-focused. #NeurIPS2024 h…
0
5
0
@mittu1204
Yuki Mitsufuji
3 months
RT @SonyAI_global: #GenWarp by Sony AI creates realistic perspectives from a single image! See how it works 🧵 #NeurIPS2024
0
3
0
@mittu1204
Yuki Mitsufuji
3 months
🎶Large music models from our team🎶: 1. SoniDo🎼 for music mixing, demixing, transcription, etc. pdf: 2. OpenMU🧙‍♂️ for music captioning, reasoning, lyric understanding, etc. pdf: code: demo @ISMIRConf : #ISMIR2024
@yukara_ikemiya
Yukara IKEMIYA
3 months
【🎸Music Foundation Model report by Sony AI】 Our team published a paper validating the effectiveness of a foundation model for music generation, showing that combining it with our music analysis techniques achieves higher performance. Paper:
Tweet media one
0
8
50
@mittu1204
Yuki Mitsufuji
3 months
A blog on our two papers has been published
0
0
2
@mittu1204
Yuki Mitsufuji
3 months
A list of diffusion works & tutorial (at #ISMIR2024) from our lab! [ML] #NeurIPS24 (GenWarp: Novel View Synthesis) #NeurIPS24 (PaGoDA: Multi-Scale 1 Step Generator) #ICLR24 (CTM: Fast Image Gen.) #ICLR24 (MPGD: Guided Diffusion) #ICML23 (FP-Diff: Consistency-type Model) #ICML23 (Blind Inverse) [Audio/NLP] #ACL24 (Knowledge Gen.) #IJCAI24 (Music Editing) #ICASSP24 (Declipping) #ICASSP24 (Speech Enh.) #ICASSP23 (Music Transcription) #ICASSP23 (Vocoder) #ICASSP23 (Dereverb) #INTERSPEECH23 (Speech Enh.)
@mittu1204
Yuki Mitsufuji
8 months
If you will be at #ismir2024, don't forget to register our tutorial led by ⁦⁦@JCJesseLai
Tweet media one
1
15
63
@mittu1204
Yuki Mitsufuji
5 months
Here's a sneak peek of a crowd-based competition on sounding video generation starting from Oct. 1st! #ECCV2024
0
2
20
@mittu1204
Yuki Mitsufuji
5 months
If you are working on a family of inverse problems using diffusion models and are confused about their relationships, this survey paper will clear your head!
@giannis_daras
Giannis Daras
5 months
Why are there so many different methods for using diffusion models for inverse problems? 🤔 And how do these methods relate to each other? In this survey, we review more than 35 different methods and we attempt to unify them into common mathematical formulations.
Tweet media one
1
4
58
@mittu1204
Yuki Mitsufuji
6 months
RT @SonyAI_global: Meet @mittu1204, Lead Research Scientist, overseeing music and sound research within our AI for Creators Flagship. Lear…
0
6
0
@mittu1204
Yuki Mitsufuji
7 months
A list of diffusion works + tutorial from our lab! [ML] #ACL2024 (Knowledge Generation) #ICLR2024 (Consistency Trajectory Model) #ICLR2024 (Manifold Preserving Guided Diffusion) #ICML2023 (Consistency-type Model) #ICML2023 (Blind Image Restoration) [Audio] #IJCAI2024 (Music Editing) #ICASSP2024 (Declipping) #ICASSP2024 (Speech Enhancement) #ICASSP2023 (Music Transcription) #ICASSP2023 (Vocoder) #ICASSP2023 (Dereverb) #INTERSPEECH2023 (Speech Enhancement)
@mittu1204
Yuki Mitsufuji
8 months
If you will be at #ismir2024, don't forget to register our tutorial led by ⁦⁦@JCJesseLai
Tweet media one
1
20
98
@mittu1204
Yuki Mitsufuji
8 months
@zicokolter Congrats! 🎉
0
0
1
@mittu1204
Yuki Mitsufuji
8 months
📢For researchers in the Audio-Visual field, the call for papers of our #ECCV2024 workshop (AVGenL) is out Don't miss the deadline July 15⏳
@shiqi_yang_147
Shiqi Yang
9 months
Initial CfP advertisement for the ECCV 2024 workshop "AVGenL: Audio-Visual Generation and Learning". It will cover a wide range of topics about audio-visual generation and learning. Paper submission deadline is 15 Jul 2024. #ECCV2024 #ECCV @eccvconf
Tweet media one
Tweet media two
Tweet media three
0
1
17
@mittu1204
Yuki Mitsufuji
8 months
If you will be at #ismir2024, don't forget to register our tutorial led by ⁦⁦@JCJesseLai
Tweet media one
0
5
28