Kwang Moo Yi @kwangmoo_yi profile

Kwang Moo Yi

@kwangmoo_yi

Followers

2K

Following

333

Statuses

471

Assistant Professor of Computer Science at the University of British Columbia. I also post my daily finds on arxiv.

Joined August 2019

Don't wanna be here? Send us removal request.

Kwang Moo Yi

@kwangmoo_yi

1 day

Preprint of today: Chen et al., "Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach" -- Let video models also diffuse pixel-aligned point clouds via augmentation and regularization --> videos that make more sense

0

14

68

Kwang Moo Yi

@kwangmoo_yi

2 days

Preprint of today: Shen et al., "Seeing World Dynamics in a Nutshell" -- Feed-forward Dynamic Gaussian Estimator for videos. Estimates pixel-aligned Gaussians, then deforms them to adhere to depth and flow from Foundational models.

0

10

50

Kwang Moo Yi

@kwangmoo_yi

3 days

Preprint of today: Chefer et al., "VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models" -- Ensemble conditioning of text, motion, and both, and teaching a video model to both results in much more natural video.

0

5

41

Kwang Moo Yi

@kwangmoo_yi

4 days

Preprint of today: Govindarajan, Rebain et al., "Radiant Foam: Real-Time Differentiable Ray Tracing" -- You can do volumetric ray tracing with a learned representation using meshes (foams), even faster than rasterizing with Gaussian Splats.

Andrea Tagliasacchi 🇨🇦

@taiyasaki

5 days

📢📢📢 "𝐑𝐚𝐝𝐢𝐚𝐧𝐭 𝐅𝐨𝐚𝐦: Real-Time Differentiable Ray Tracing", a mesh-based 3D represention. Co-lead by my PhD students Shrisudhan Govindarajan and Daniel Rebain, and w/ @kwangmoo_yi

0

1

29

Kwang Moo Yi

@kwangmoo_yi

5 days

Preprint of today: Guizilini et al., "Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion" -- Generalizable view-conditioned novel-view synthesis diffusion model. The results aren't perfect, but give some nice gains from SOTA.

0

15

115

Kwang Moo Yi

@kwangmoo_yi

9 days

Preprint of (not) today: Liang et al., "Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos" -- A ViT that takes multiple frames (and their time andpose in plücker representation) outputs 3D Gaussians at queried time.

3

33

146

Kwang Moo Yi

@kwangmoo_yi

9 days

Preprint of today: Jäger et al., "FeatureGS: Eigenvalue-Feature Optim. in 3D Gaussian Splatting for Geom. Accurate and Artifact-Reduced Recon." -- Aligns Gaussian shapes with how neighbouring Gaussians are arranged via eigenvalues. Byebye floaters!

0

11

68

Kwang Moo Yi

@kwangmoo_yi

10 days

RT @ducha_aiki: While you are finishing CVPR rebuttal - consider submitting papers to Image Matching Workshop 2025 @CVPR #CVPR2025 Deadli…

0

6

0

Kwang Moo Yi

@kwangmoo_yi

10 days

Paper of (not) today: Chen et al., "On the Trajectory Regularity of ODE-based Diffusion Sampling" -- An interesting paper that highlights the regularity of diffusion trajectories. And how we can do better than simply going about this in equal steps.

0

11

89

Kwang Moo Yi

@kwangmoo_yi

11 days

@DavidSHolz I can't edit because I don't pay :( the proper link. Sorry!

0

1

Kwang Moo Yi

@kwangmoo_yi

11 days

Preprint 2/2 of today: Ren et al., "Improving Tropical Cyclone Forecasting With Video Diffusion Models" -- Video diffusion models can help predict cyclones! Note: baseline is also a diffusion model.

0

2

14

Kwang Moo Yi

@kwangmoo_yi

11 days

Preprint 1/2 of today: Elflein et al, "Light3R-SfM: Towards Feed-forward Structure-from-Motion" -- Feed-forward SfM using Dust3r style arch, but with latent-space alignment. Somewhat similar direction as Fast3R (yesterday's post)

1

23

105

Kwang Moo Yi

@kwangmoo_yi

15 days

Preprint of the day: Kinakh et al., "Binary Diffusion Probabilistic Model" -- Images are discrete and binary. Shouldn't we take that into account when we do probabilistic modeling? This work uses XOR-based noise transformations instead of Gaussians.

1

15

82

Kwang Moo Yi

@kwangmoo_yi

16 days

Paper of the day: Wang et al., "Continuous 3D Perception Model with Persistent State" -- RNN state model that can be "read out" as 3D pointmaps.

2

64

334

Kwang Moo Yi

@kwangmoo_yi

17 days

Preprint of the day: Raphaeli et al., "SILO: Solving Inverse Problems with Latent Operators" -- Learn an image degrader in the latent space for inverse problems for both speed and quality.

1

13

64

Kwang Moo Yi

@kwangmoo_yi

19 days

Preprint of the day: Wen et al., "FoundationStereo: Zero-Shot Stereo Matching" -- Extending DepthAnythingV2 to do stereo with a learned cost-volume filter. Cool visuals on the website!

0

24

193

Kwang Moo Yi

@kwangmoo_yi

22 days

Preprint of the day: Motamed et. al, "Physics IQ Benchmark: Do generative video models learn physical principles from watching videos?" -- A benchmark to test whether video models really learned physics -- spoiler: they didn't (yet)

2

20

81

Kwang Moo Yi

@kwangmoo_yi

23 days

repost with video since i forgot to put that up the first time :(

0

1