minqian_liu Profile Banner
Minqian Liu Profile
Minqian Liu

@minqian_liu

Followers
430
Following
1K
Statuses
89

PhD student @VT_CS | Previous Research Intern at Microsoft and AWS AI | he/him

Blacksburg, VA
Joined August 2017
Don't wanna be here? Send us removal request.
@minqian_liu
Minqian Liu
3 months
Thrilled to announce that our work has been accepted by #EMNLP2024 (Main Conference)! We introduced a highly challenging benchmark for interleaved text-and-image generation along with a strong multi-aspect evaluator. See you in Miami! 🌴 Paper: Dataset:
@minqian_liu
Minqian Liu
8 months
🚨 New paper alert! We introduce InterleavedBench📚, the first comprehensive evaluation benchmark for interleaved text-and-image generation, as well as InterleavedEval🔍, a powerful GPT-based evaluator that supports multi-aspect assessment. arXiv: (1/n)
0
0
4
@minqian_liu
Minqian Liu
27 days
🔥Thrilled to announce ReFocus, our latest work led by @XingyuFu2 that teaches MLLMs to generate “visual thoughts” 🧠 via visual editing on tables and charts 📊 to improve reasoning. Huge thanks to all the amazing co-authors! 🙌
@XingyuFu2
Xingyu Fu
27 days
Teach GPT-4o to edit on charts and tables to ReFocus 🔍 and facilitate reasoning 🧠! 🔥 We introduce ReFocus, which edits input table and chart images to better reason visually 🤔 Can we teach smaller models to learn such visual CoT reasoning? 🚀 Yes -- They are better than QA and CoT data! 📈 ReFocus + GPT-4o brings +11.0% on tables and +6.8% on charts without using any tools🔧! 📊 We release a 14K Visual CoT Reasoning *Training Dataset* that provides intermediate refocusing supervision. 🤖+🔍 > CoT: ReFocus VCoT is 8.0% better than QA data and 2.6% better CoT data with supervised Finetuning on Phi3.5v. Trained model also released. 📑 Check out This work is done during intern @Microsoft with amazing coauthors @minqian_liu @zhengyuan_yang @JCorring36990 @YijuanLu @jw2yang4ai @DanRothNLP @DineiFlorencio @ChaZhang. A huge shoutout to everyone!
Tweet media one
0
0
2
@minqian_liu
Minqian Liu
2 months
RT @tuvllms: 📢✨ I am recruiting 1-2 PhD students at Virginia Tech this cycle. If you are interested in efficient model development (inclu…
0
78
0
@minqian_liu
Minqian Liu
6 months
@hengjinlp @Glaciohound Well deserved! Congratulations! 🎉
0
0
2
@minqian_liu
Minqian Liu
6 months
@mandarsharma @profnaren Congratulations Mandar! 🎉🎉
1
0
1
@minqian_liu
Minqian Liu
6 months
0
0
1
@minqian_liu
Minqian Liu
8 months
👏 Big thanks to @zhiyangx11, @lifu_huang, and all other awesome collaborators for their great effort in this work! The dataset and code will be released soon at Stay tuned! (6/n)
0
0
5
@minqian_liu
Minqian Liu
8 months
RT @GuoOctavia: Ever wondered if style lexicons still play a role in the era of LLMs? 🤔 We tested 13 established and 63 novel language sty…
0
9
0
@minqian_liu
Minqian Liu
8 months
Thank you for the great work! Will definitely check it out.
@XingyuFu2
Xingyu Fu
8 months
Can Text-to-Image models understand common sense? 🤔 Can they generate images that fit everyday common sense? 🤔 tldr; NO, they are far less intelligent than us 💁🏻‍♀️ Introducing Commonsense-T2I 💡 a novel evaluation and benchmark designed to measure commonsense reasoning in T2I models 🔥🔥 Paper: (1/n)
Tweet media one
0
0
1
@minqian_liu
Minqian Liu
8 months
RT @XingyuFu2: Can Text-to-Image models understand common sense? 🤔 Can they generate images that fit everyday common sense? 🤔 tldr; NO, t…
0
39
0
@minqian_liu
Minqian Liu
10 months
RT @YingShen_ys: 🚀 Excited to introduce my internship work at @Apple MLR : Many-to-many Image Generation with Auto-regressive Diffusion Mo…
0
34
0
@minqian_liu
Minqian Liu
1 year
RT @_akhaliq: Vision-Flan Scaling Human-Labeled Tasks in Visual Instruction Tuning Despite vision-language models' (VLMs) remarkable capa…
0
42
0
@minqian_liu
Minqian Liu
1 year
RT @DrogoKhal4: Knowledge Conflict gets accepted in #ICLR24 #Spotlight! We updated more results on 5 open- and 3 close-source LLMs and foun…
0
12
0
@minqian_liu
Minqian Liu
1 year
RT @barry_yao0: Our entity linking work has been accepted by #EACL2024. Check our work: Ameli: Enhancing Multimodal Entity Linking with Fin…
0
1
0