Shujin Wu Profile Banner
Shujin Wu Profile
Shujin Wu

@shujin_wu

Followers
358
Following
125
Media
10
Statuses
21

Graduate Student @USCViterbi . Natural Language Processing & AI Alignment

Los Angeles, CA
Joined July 2022
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
@shujin_wu
Shujin Wu
4 months
Is your Vision-Language Model really helpful at all times? Can we instruct them to interact with users during conversations to avoid hallucinations or biased responses? 🍰Take some bites of PIE and MACAROON! We present a benchmark to evaluate LVLMs’ proactive engagement
Tweet media one
4
31
189
@shujin_wu
Shujin Wu
4 months
🧵(1/n) Existing Large vision-language models (LVLMs) invariably generate detailed responses even when questions are ambiguous or unanswerable, leading to hallucinations and bias issues. Thus, it is essential for LVLMs to proactively engage with humans to ask for clarifications
Tweet media one
1
2
8
@shujin_wu
Shujin Wu
4 months
Really appreciate the help and guidance from my best mentor @YiFung10 , advisor @hengjinlp , and of course all the amazing collaborators @ZoeyLi20 @yixin_wan_ @kaiwei_chang .🥰🫶
0
0
4
@shujin_wu
Shujin Wu
4 months
🧵(5/n) Our case study further proves that MACAROON not only successfully gains proactive engagement capabilities, but also exhibits promising multi-turn conversational skills.
Tweet media one
Tweet media two
1
0
4
@shujin_wu
Shujin Wu
4 months
🧵(2/n) We create a new benchmark PIE, which consists of 853 questions across six distinct, fine-grained question types based on a three-tiered hierarchy of model’s interactive behaviors: challenging invalid question settings, seeking clarifications, and uncovering latent human
Tweet media one
1
0
3
@shujin_wu
Shujin Wu
4 months
🧵(4/n) Experimental results on PIE and other general benchmarks such as MME show that MACAROON demonstrates significantly better proactive engagement capabilities while maintaining the general visual-language performance.
Tweet media one
Tweet media two
1
0
3
@shujin_wu
Shujin Wu
4 months
🧵(3/n) We then present MACAROON, self-iMaginAtion for ContrAstive pReference OptimizatiON. MACAROON avoids using extensive human or teacher model supervision via self-imagined desirable and undesirable responses based on human-written criteria. The contrastive response pairs,
Tweet media one
1
0
2
@shujin_wu
Shujin Wu
4 months
@lucy3_li wow I didn't know that! Thanks for letting me know😆😆😆
0
0
1