Patrick Esser @pess_r profile

Patrick Esser

@pess_r

Followers

5,055

Following

314

Media

27

Statuses

1,119

Walking on the generative side of computer vision @bfl_ml . he/him

https://t.co/kcWF5UeMMa

Heidelberg

Joined September 2016

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

Brazil • 1180576 Tweets

Elon Musk • 721961 Tweets

Meu Twitter • 483724 Tweets

Alexandre de Moraes • 471351 Tweets

Bluesky • 214134 Tweets

Adeus Twitter • 183091 Tweets

Politico • 160259 Tweets

Xandão • 158295 Tweets

#BUS1stFANCON_KnockKnockKnock • 142808 Tweets

Wisconsin • 122381 Tweets

Tchau • 95672 Tweets

Caitlin Clark • 67630 Tweets

Temple • 59620 Tweets

ブラジル • 41051 Tweets

Angel Reese • 34390 Tweets

#プレイバックガチャ • 33552 Tweets

Michigan State • 23195 Tweets

#INZM_30Mviews • 21988 Tweets

Janta Kee Awaaz • 20278 Tweets

野菜の日 • 18524 Tweets

#GiftOfEducation • 17048 Tweets

Djokovic • 13593 Tweets

Stanford • 12119 Tweets

カネゴン • 11431 Tweets

#TheLastDriveIn • 10770 Tweets

台風一過 • 10188 Tweets

Luke Fickell

Chiles

Austin Wells

マサムネ

Laricel

Badgers

ラジエル

メデューサ

ウリエル

Rodrigo Ferreira

Van Dyke

カーショー

メドゥ子

スターリング

メドゥちゃん

Kershaw

ROTY

Joe Kelly

Western Michigan

メドゥーサ

Popyrin

土ブースト

ハルヒ新刊

Carille

Last Seen Profiles

@edward_caswell

@com_in_

@RandomArtz95

@asupan_sejati

@HagovSocialista

@Suresh_Krissna

@l_cbm

@PbfTogo

@christman22jenn

@DaddyDonor

@sanatanihindu_4

@GabrielaIb66916

@GoldenHornets

@hemirdesai

@Trinhnomics

@Stella_diazz

@Dream11

@ProfessionalEcc

@silencelabs_sl

@mpi_polymer

Pinned Tweet

Patrick Esser

@pess_r

2 years

#stablediffusion text-to-image checkpoints are now available for research purposes upon request at Working on a more permissive release & inpainting checkpoints. Soon™ coming to @runwayml for text-to-video-editing

102

1K

5K

Patrick Esser

@pess_r

2 years

The model behind Erase & Replace is now available at @runwayml jointly with @robrombach

9

124

823

Patrick Esser

@pess_r

4 years

Watching transformers do their thing in 3D

3

123

593

Patrick Esser

@pess_r

4 years

Pre-tamed transformers now at Just added a Colab demo to start sampling right away:

5

96

490

Patrick Esser

@pess_r

2 years

We present Latent Diffusion Models in tomorrow's Image & Video Synthesis and Generation session @CVPR in Hall B1 (Thur. 08:30-10:18). Join us for a chat at Poster #7 between 10:00-12:30 in Hall B2-C and sample some jazz! With @robrombach @andi_blatt D. Lorenz @runwayml B. Ommer

5

36

379

Patrick Esser

@pess_r

3 years

demo to run our GeoGPT models on images in the wild. Code at Detes at

8

69

350

Patrick Esser

@pess_r

5 months

wouldn't be where we are are without @EMostaque ❤️

Emad

@EMostaque

5 months

Self-sovereign AI

51

65

613

7

12

229

Patrick Esser

@pess_r

3 years

Jumping into a thread about video inpainting @runwayml 🦘👇 1/7

3

17

160

Patrick Esser

@pess_r

3 years

Geometry-Free View Synthesis: We don't need no 3D priors. Leave them transformers unbiased! Without coding 3D transformations into the model, they learn to synthesize novel views from a single input image.

1

30

148

Patrick Esser

@pess_r

3 years

Excited about ML for creative video editing? Come work with an amazing team @runwayml ! We have open positions for Research Scientists and other roles. The team is diverse and distributed all over the world with a great remote work culture.

Cristóbal Valenzuela

@c_valenzuelab

3 years

Thrilled to announce Runway’s $35M Series B led by Coatue. Content creation and video editing are being massively transformed by machine learning and the web. Excited to double down on our mission to keep reimagining how we tell stories. More here:

41

30

313

4

18

92

Patrick Esser

@pess_r

4 years

Denoising Diffusion Probabilistic Models converted to PyTorch with Streamlit Demo Run demo: pip install -e git+ pytorch_diffusion_demo

1

19

70

Patrick Esser

@pess_r

6 months

❤️ let's gooo 🚀

Robin Rombach

@robrombach

6 months

Party time! The SD3 paper made it to arxiv: Key takeaways: - flow matching is very nice. - back to work with @pess_r and a fantastic team ♥️ The paper is full of details on improved flow matching, scaling and engineering. Enjoy!

10

38

266

3

55

Patrick Esser

@pess_r

3 years

So many good news! 🥳 I joined @runwayml as a Research Scientist! I'll present Geometry-Free View Synthesis () at #ICCV2021 today 6 pm and Friday 11 am EDT 👋 ImageBART () was accepted at #NeurIPS2021 w/ @robrombach @andi_blatt & BO

6

55

Patrick Esser

@pess_r

2 months

sooooo good! 🤩 amazing work, congrats to the whole @runwayml team!!

Runway

@runwayml

2 months

Introducing Gen-3 Alpha: Runway’s new base model for video generation. Gen-3 Alpha can create highly detailed videos with complex scene changes, a wide range of cinematic choices, and detailed art directions. (1/10)

254

930

4K

2

1

54

Patrick Esser

@pess_r

6 years

Results from our #CVPR2018 paper on Shape and Appearance Conditional Image Generation. More results and code at

1

24

49

Patrick Esser

@pess_r

3 years

Packing things up. Our goal is to make creative experimentation possible for everyone. Turning prototypes into milestones on our way towards this goal requires a shared vision and true team efforts. Fast-forwarded prototype vs real-time app:

1

6

43

Patrick Esser

@pess_r

4 years

Demo of our #ECCV2020 work "Making Sense of CNNs" with @robrombach project: run @streamlit demo in 5 lines: git clone cd invariances conda env create -f environment.yaml conda activate invariances streamlit run invariances/demo.py

2

16

37

Patrick Esser

@pess_r

4 years

Training state-of-the-art models is getting too expensive. We present Net2Net at #NeurIPS2020 to efficiently recombine existing models for new tasks. Try it and translate between low- and high-resolution models to obtain a new super-resolution model!

1

5

24

Patrick Esser

@pess_r

3 years

Generating videos. Videos introduce additional challenges: Searching for frames where the background is visible is expensive and exposure changes between frames require a seamless blending strategy that produces temporally consistent results without flickering.

1

22

Patrick Esser

@pess_r

3 years

Moving forward. Anticipating every possible use case is impossible. That's why we release an early open beta to learn more about how inpainting will be used. If you are excited about boosting video creativity with new tools, join our team:

Careers | Runway

We're looking for talented and open-minded individuals from all backgrounds who are passionate about advancing human creativity.

runwayml.com

1

0

21

Patrick Esser

@pess_r

6 years

Reenactment results on Penn Action sport sequences from our HBUGEN2018 workshop paper "Towards Learning a Realistic Rendering of Human Behavior" #ECCV2018

0

7

16

Patrick Esser

@pess_r

3 years

Filling unknown regions of images. Filling in a masked region of an image is an underdetermined problem: There are many possible options and we need to come up with one of them. More about our research on generative models for inpainting and other tasks:

1

17

Patrick Esser

@pess_r

2 years

🤗

Runway

@runwayml

2 years

Stable Diffusion Inpainting is now available in the latest version of the @huggingface diffusers library (v0.6.0) 🎉 Get started here:

0

20

114

0

4

16

Patrick Esser

@pess_r

8 years

Lesson 2 of the #carND from @udacity . Exploring parameters.

1

16

Patrick Esser

@pess_r

3 years

What to remove? To avoid excessive manual masking, we follow a similar approach as in GreenScreen, except that we use a simpler & faster model, because we don't need very accurate masks, but we do need some extra time for the actual inpainting. More at

1

15

Patrick Esser

@pess_r

3 years

Using temporal information. For video inpainting, we don't necessarily need to come up with plausible content if we can see what's behind a masked region in other parts of the video. This becomes more clear if we explicitly align a video to its first frame:

1

0

15

Patrick Esser

@pess_r

3 years

Sampling with small top-k values doesn't give very good FID scores but a nice "prototypical" look. #CVPR2021 poster session starts at 9am UTC ( one hour) ╰( ^o^)╮

1

0

8

Patrick Esser

@pess_r

4 years

Great article "Understanding the Role of Individual Units in a Deep Network" on network dissection for classifiers and generators by @davidbau et al. New code with pretrained segmentation model analyzes 1825 semantic concepts (1.5x)

GitHub - davidbau/dissect: Code for the Proceedings of the National Academy of Sciences 2020...

Code for the Proceedings of the National Academy of Sciences 2020 article, "Understanding the Role of Individual Units in a Deep Neural Network" - davidbau/dissect

github.com

0

1

7

Patrick Esser

@pess_r

5 years

Robust mutual information minimization enables unsupervised disentangling of pose and appearance for arbitrary objects. More in our #iccv2019 paper "Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis".

1

6

Patrick Esser

@pess_r

3 years

"Maya the bee as an astronaut"

Rivers Have Wings

@RiversHaveWings

3 years

I finally managed to find hosting for my CLIP conditioned transformer (it's on GitHub as a release right now), so here's the Colab:

11

34

264

0

1

5

Patrick Esser

@pess_r

3 years

@Merzmensch @highqualitysh1t The interface is a bit more clumsy but you can dive right in :)

Google Colab Notebook

Run, share, and edit Python notebooks

colab.research.google.com

2

0

4

Patrick Esser

@pess_r

4 years

Great talks and papers at the AI for Content Creation workshop at #CVPR2020 We'll present our work "Network Fusion for Content Creation with Conditional INNs". Live Q&A @ 4pm PDT. Joint work with @robrombach & B. Ommer

0

3

Patrick Esser

@pess_r

3 years

Correction: Starts at 11 AM EDT / 5 PM CET / 3 PM UTC (ʘ‿ʘ)╯

0

3

Patrick Esser

@pess_r

4 years

We have also added code and a @streamlit demo for unpaired image translation tasks from our work "A Note on Data Biases" which will be presented at the #neurips4creativity workshop.

0

2

Patrick Esser

@pess_r

7 years

@MonaJalal_ No GAN, it's an autoregressive model scaled to large images via a multiscale approach

Parallel Multiscale Autoregressive Density Estimation

PixelCNN achieves state-of-the-art results in density estimation for natural images. Although training is fast, inference is costly, requiring one network evaluation per pixel; O(N) for N pixels....

arxiv.org

0

1