Xingyu Fu Profile
Xingyu Fu

@XingyuFu2

Followers
834
Following
773
Statuses
111

PhD student @Penn @cogcomp. | Focused on Vision+Language | Previous: @MSFTResearch @AmazonScience B.S. @UofIllinois | โ›ณ๏ธ๐Ÿ˜บ

Philadelphia, PA
Joined September 2020
Don't wanna be here? Send us removal request.
@XingyuFu2
Xingyu Fu
28 days
Teach GPT-4o to edit on charts and tables to ReFocus ๐Ÿ” and facilitate reasoning ๐Ÿง ! ๐Ÿ”ฅ We introduce ReFocus, which edits input table and chart images to better reason visually ๐Ÿค” Can we teach smaller models to learn such visual CoT reasoning? ๐Ÿš€ Yes -- They are better than QA and CoT data! ๐Ÿ“ˆ ReFocus + GPT-4o brings +11.0% on tables and +6.8% on charts without using any tools๐Ÿ”ง! ๐Ÿ“Š We release a 14K Visual CoT Reasoning *Training Dataset* that provides intermediate refocusing supervision. ๐Ÿค–+๐Ÿ” > CoT: ReFocus VCoT is 8.0% better than QA data and 2.6% better CoT data with supervised Finetuning on Phi3.5v. Trained model also released. ๐Ÿ“‘ Check out This work is done during intern @Microsoft with amazing coauthors @minqian_liu @zhengyuan_yang @JCorring36990 @YijuanLu @jw2yang4ai @DanRothNLP @DineiFlorencio @ChaZhang. A huge shoutout to everyone!
Tweet media one
5
32
148
@XingyuFu2
Xingyu Fu
17 days
RT @sheng_zh: Muirbench has been accepted to #ICLR2025! ๐Ÿš€ Companies like Apple, TikTok, and Salesforce are already evaluating their LMMs onโ€ฆ
0
8
0
@XingyuFu2
Xingyu Fu
18 days
RT @fwang_nlp: ๐— ๐˜‚๐—ถ๐—ฟ๐—•๐—ฒ๐—ป๐—ฐ๐—ต is officially accepted at #ICLR2025! ๐ŸŽ‰ Recent VLMs/MLLMs such as LLaVA-OneVision, MM1.5, and MAmmoTH-VL have demoโ€ฆ
0
8
0
@XingyuFu2
Xingyu Fu
26 days
RT @XingyuFu2: Teach GPT-4o to edit on charts and tables to ReFocus ๐Ÿ” and facilitate reasoning ๐Ÿง ! ๐Ÿ”ฅ We introduce ReFocus, which edits inpuโ€ฆ
0
32
0
@XingyuFu2
Xingyu Fu
27 days
@gabrielchua_ This is a hard problem for models and cannot solved by visual Sketchpad ! I think itโ€™s really an exciting direction and please keep me tuned๐Ÿ˜ƒ
0
0
0
@XingyuFu2
Xingyu Fu
28 days
@im_ashishsinha5 Try our finetuned model! Its free :D
0
0
1
@XingyuFu2
Xingyu Fu
28 days
@gabrielchua_ Unfortunately we finished the project before 4o could be finetuned with images๐Ÿ˜” But we release all the training data with intermediate visual outputs, feel free to try with them!
1
0
0
@XingyuFu2
Xingyu Fu
28 days
@astro_nolan Intersting problem! To be honost I think similar to problems in ReFocus, python code + low-level vision tools can be very helpful, e.g. use cv2 tools to find curve coordinates and provide to GPT models.
1
0
2
@XingyuFu2
Xingyu Fu
28 days
@WenhuChen Thanks Wenhu! Solute to the important foundation TabFact paper lol๐Ÿซก
0
0
1
@XingyuFu2
Xingyu Fu
28 days
ReFocus is inspired by many brilliant prior works, especially Visual SketchPad from @huyushi98 @WeijiaShi2 @LukeZettlemoyer @nlpnoah @RanjayKrishna, Visprog from @tanmay2099 @anikembhavi , ViperGPT from @SachitMenon @Surisdi @cvondrick , and many more!
0
0
5
@XingyuFu2
Xingyu Fu
1 month
RT @weichiuma: How to build an AI system that can generate 3D worlds from a single image? All you need is the **RIGHT** data! By trainingโ€ฆ
0
114
0
@XingyuFu2
Xingyu Fu
1 month
RT @WeijiaShi2: Introducing ๐‹๐ฅ๐š๐ฆ๐š๐…๐ฎ๐ฌ๐ข๐จ๐ง: empowering Llama ๐Ÿฆ™ with diffusion ๐ŸŽจ to understand and generate text and images in arbitrary sequenโ€ฆ
0
178
0
@XingyuFu2
Xingyu Fu
3 months
RT @Xiaodong_Yu_126: Life update: I defended my Ph.D. thesis today and have joined @AMD GenAI as a research scientist. ๐ŸŽ‰๐ŸŽ‰ #UPenn #AMD httpโ€ฆ
0
18
0
@XingyuFu2
Xingyu Fu
3 months
RT @thoma_gu: Life update: Excited to share that I will be joining @CIS_Penn @PennEngineers as an Assistant Professor in Fall 2025!๐Ÿคฏ Iโ€™mโ€ฆ
0
52
0
@XingyuFu2
Xingyu Fu
3 months
RT @cmalaviya11: Excited to share โœจ Contextualized Evaluations โœจ! Benchmarks like Chatbot Arena contain underspecified queries, which canโ€ฆ
0
28
0
@XingyuFu2
Xingyu Fu
4 months
RT @thoma_gu: We (@zhaisf @jsusskin) are looking for PhD interns to join @Apple MLR late 2024 or early 2025 on generative modeling or multiโ€ฆ
0
56
0
@XingyuFu2
Xingyu Fu
4 months
RT @xiangyue96: ๐ŸŒ Iโ€™ve always had a dream of making AI accessible to everyone, regardless of location or language. However, current open MLโ€ฆ
0
78
0
@XingyuFu2
Xingyu Fu
4 months
RT @yuntiandeng: How many reasoning tokens does OpenAI o1 use? It turns out they are almost always multiples of 64 (99+% of the time in 100โ€ฆ
0
47
0