Fangrui Zhu @Fangrui_Zhu profile

Fangrui Zhu

@Fangrui_Zhu

Followers

25

Following

23

Statuses

10

CS PhD student @khourycollege

Joined August 2021

Don't wanna be here? Send us removal request.

Fangrui Zhu

@Fangrui_Zhu

2 months

RT @praeclarumjj: 💭 How do MLLMs improve their visual perception with more training data or visual inputs (depth/seg map)? 👉 Performance co…

0

10

0

Fangrui Zhu

@Fangrui_Zhu

2 months

Welcome to our poster at east exhibit hall 1711. Thanks my collaborators Huaizu @HuaizuJiang and Jianwei @jw2yang4ai !

Huaizu Jiang

@HuaizuJiang

2 months

#NeurIPS2024 Can we make SAM-like models to understand visual relationships? Excited to share FleVRS, which supports unified (HOI and scene graph), promotable, and open-vocabulary visual relationship segmentation. Paper: Code:

0

2

9

Fangrui Zhu

@Fangrui_Zhu

2 months

RT @HuaizuJiang: #NeurIPS2024 Can we make SAM-like models to understand visual relationships? Excited to share FleVRS, which supports unifi…

0

18

0

Fangrui Zhu

@Fangrui_Zhu

3 months

RT @MuCai7: Now TemporalBench is fully public! See how your video understanding model performs on TemporalBench before CVPR! 🤗 Dataset: h…

0

13

0

Fangrui Zhu

@Fangrui_Zhu

4 months

RT @jw2yang4ai: 🔥Check out our new LMM benchmark TemporalBench! Our world is temporal, dynamic and physical, which can be only captured i…

0

21

0

Fangrui Zhu

@Fangrui_Zhu

4 months

RT @MuCai7: 1/N) Are current large multimodal models like #GPT4o really good at video understanding? 🚀 We are thrilled to introduce Tempo…

0

15

0

Fangrui Zhu

@Fangrui_Zhu

7 months

RT @HuaizuJiang: Excited to share our recent work HouseCrafter, which can lift a floorplan into a complete large 3D indoor scene (e.g. a ho…

0

6

0

Fangrui Zhu

@Fangrui_Zhu

8 months

Come and say hi at Arch 4A-E 10:30-noon on Thursday!!

Huaizu Jiang

@HuaizuJiang

8 months

#CVPR2024 We propose to solve zero-shot visual grounding by considering the structural similarities between images and captions, modeling the relationships of entities across two modalities. Paper: Code:

0

Fangrui Zhu

@Fangrui_Zhu

8 months

RT @HuaizuJiang: #CVPR2024 We propose to solve zero-shot visual grounding by considering the structural similarities between images and cap…

0

5

0

Fangrui Zhu

@Fangrui_Zhu

1 year

Check out our recent work in diagnosing human-object interaction detectors. The toolbox is simple and intuitive to use! Thank my advisor @HuaizuJiang for great advising and support! Thank @YimingXie4 @WeidiXie for meaningful discussions and advice!

Huaizu Jiang

@HuaizuJiang

1 year

Overwhelmed by the progress of human-object interaction (HOI) detection? Ever wondered why one HOI model performs better than another? Check out our recent work in diagnosing human-object interaction detectors. Paper: Code: 🛢️ 1/N

0

1

5