Xingyu Fu @XingyuFu2 profile

Xingyu Fu

@XingyuFu2

Followers

834

Following

773

Statuses

111

PhD student @Penn @cogcomp. | Focused on Vision+Language | Previous: @MSFTResearch @AmazonScience B.S. @UofIllinois | ⛳️😺

Philadelphia, PA

Joined September 2020

Don't wanna be here? Send us removal request.

Xingyu Fu

@XingyuFu2

28 days

Teach GPT-4o to edit on charts and tables to ReFocus 🔍 and facilitate reasoning 🧠! 🔥 We introduce ReFocus, which edits input table and chart images to better reason visually 🤔 Can we teach smaller models to learn such visual CoT reasoning? 🚀 Yes -- They are better than QA and CoT data! 📈 ReFocus + GPT-4o brings +11.0% on tables and +6.8% on charts without using any tools🔧! 📊 We release a 14K Visual CoT Reasoning *Training Dataset* that provides intermediate refocusing supervision. 🤖+🔍 > CoT: ReFocus VCoT is 8.0% better than QA data and 2.6% better CoT data with supervised Finetuning on Phi3.5v. Trained model also released. 📑 Check out This work is done during intern @Microsoft with amazing coauthors @minqian_liu @zhengyuan_yang @JCorring36990 @YijuanLu @jw2yang4ai @DanRothNLP @DineiFlorencio @ChaZhang. A huge shoutout to everyone!

5

32

148

Xingyu Fu

@XingyuFu2

17 days

RT @sheng_zh: Muirbench has been accepted to #ICLR2025! 🚀 Companies like Apple, TikTok, and Salesforce are already evaluating their LMMs on…

0

8

0

Xingyu Fu

@XingyuFu2

18 days

RT @fwang_nlp: 𝗠𝘂𝗶𝗿𝗕𝗲𝗻𝗰𝗵 is officially accepted at #ICLR2025! 🎉 Recent VLMs/MLLMs such as LLaVA-OneVision, MM1.5, and MAmmoTH-VL have demo…

0

8

0

Xingyu Fu

@XingyuFu2

26 days

RT @XingyuFu2: Teach GPT-4o to edit on charts and tables to ReFocus 🔍 and facilitate reasoning 🧠! 🔥 We introduce ReFocus, which edits inpu…

0

32

0

Xingyu Fu

@XingyuFu2

27 days

@gabrielchua_ This is a hard problem for models and cannot solved by visual Sketchpad ! I think it’s really an exciting direction and please keep me tuned😃

0

Xingyu Fu

@XingyuFu2

28 days

@im_ashishsinha5 Try our finetuned model! Its free :D

0

1

Xingyu Fu

@XingyuFu2

28 days

@gabrielchua_ Unfortunately we finished the project before 4o could be finetuned with images😔 But we release all the training data with intermediate visual outputs, feel free to try with them!

1

0

Xingyu Fu

@XingyuFu2

28 days

@astro_nolan Intersting problem! To be honost I think similar to problems in ReFocus, python code + low-level vision tools can be very helpful, e.g. use cv2 tools to find curve coordinates and provide to GPT models.

1

0

2

Xingyu Fu

@XingyuFu2

28 days

@WenhuChen Thanks Wenhu! Solute to the important foundation TabFact paper lol🫡

0

1

Xingyu Fu

@XingyuFu2

28 days

ReFocus is inspired by many brilliant prior works, especially Visual SketchPad from @huyushi98 @WeijiaShi2 @LukeZettlemoyer @nlpnoah @RanjayKrishna, Visprog from @tanmay2099 @anikembhavi , ViperGPT from @SachitMenon @Surisdi @cvondrick , and many more!

0

5

Xingyu Fu

@XingyuFu2

1 month

RT @weichiuma: How to build an AI system that can generate 3D worlds from a single image? All you need is the **RIGHT** data! By training…

0

114

0

Xingyu Fu

@XingyuFu2

1 month

RT @WeijiaShi2: Introducing 𝐋𝐥𝐚𝐦𝐚𝐅𝐮𝐬𝐢𝐨𝐧: empowering Llama 🦙 with diffusion 🎨 to understand and generate text and images in arbitrary sequen…

0

178

0

Xingyu Fu

@XingyuFu2

3 months

RT @Xiaodong_Yu_126: Life update: I defended my Ph.D. thesis today and have joined @AMD GenAI as a research scientist. 🎉🎉 #UPenn #AMD http…

0

18

0

Xingyu Fu

@XingyuFu2

3 months

RT @thoma_gu: Life update: Excited to share that I will be joining @CIS_Penn @PennEngineers as an Assistant Professor in Fall 2025!🤯 I’m…

0

52

0

Xingyu Fu

@XingyuFu2

3 months

RT @cmalaviya11: Excited to share ✨ Contextualized Evaluations ✨! Benchmarks like Chatbot Arena contain underspecified queries, which can…

0

28

0

Xingyu Fu

@XingyuFu2

4 months

RT @thoma_gu: We (@zhaisf @jsusskin) are looking for PhD interns to join @Apple MLR late 2024 or early 2025 on generative modeling or multi…

0

56

0

Xingyu Fu

@XingyuFu2

4 months

RT @xiangyue96: 🌍 I’ve always had a dream of making AI accessible to everyone, regardless of location or language. However, current open ML…

0

78

0

Xingyu Fu

@XingyuFu2

4 months

RT @yuntiandeng: How many reasoning tokens does OpenAI o1 use? It turns out they are almost always multiples of 64 (99+% of the time in 100…

0

47

0