Back to people
@robrombach
R

Robin Rombach

画像生成
@robrombach

Krawallkrümel. Generative Models at https://t.co/1xqMb617gc, made with ❤️

13KFollowers561Following738PostsView on X

Recent posts

New paper out! We present a training method for multimodal generative models, called Self-Flow, which combines classic flow matching and representation learning. Why? Unlike most representation alignment methods, our new approach does not require external, pretrained models and thus scales gracefully to joint multimodal training on images, videos and audio. How? It combines per-timestep flow matching with dual-timestep representation learning, improving the models' internal representations. This approach outperforms prior methods and shows promising scaling behavior in multimodal pretraining. It also enables downstream applications such as action prediction for embodied AI. webpage+paper: https://t.co/qzGQGj8JYk code: https://t.co/edhfdVEqSf Credit to @hila_chefer, @pess_r, Dominik, @dustin_podell, Vikash, @Vinh_Suhi and Antonio. If you enjoy doing open research like this, come and join BFL! We are actively hiring🌲

Photo 1