Linjie (Lindsey) Li @LINJIEFUN profile

Linjie (Lindsey) Li

@LINJIEFUN

Followers

2K

Following

817

Statuses

137

researching @Microsoft, @UW, contributing to https://t.co/a3zper7NJG

Seattle, WA

Joined August 2012

Don't wanna be here? Send us removal request.

Linjie (Lindsey) Li

@LINJIEFUN

2 years

Sorry to leave out one important detail on this job posting. The research area is multimodal understanding and generation.

Linjie (Lindsey) Li

@LINJIEFUN

2 years

We are hiring full-time/part-time research interns all year round. If you are interested, please send your resume to linjli@microsoft.com.

7

3

86

Linjie (Lindsey) Li

@LINJIEFUN

13 hours

RT @yining_hong: SlowFast-VGen has been accepted to ICLR as a spotlight paper with scores of 8-8-8-6 🎉

0

4

0

Linjie (Lindsey) Li

@LINJIEFUN

7 days

Our ShowUI has reached 100K+ downloads! Kudos to the team and especially our amazing intern @KevinQHLin. If you are interested in developing computer using agent, we encourage you to try ShowUI!

Kevin Lin

@KevinQHLin

7 days

🚀 Exciting milestone! Our ShowUI model on Hugging Face just surpassed 100K downloads! 🎉 Big thanks to everyone exploring GUI automation with our vision-language-action models.

0

3

25

Linjie (Lindsey) Li

@LINJIEFUN

15 days

RT @ManlingLi_: 🤖Reasoning Agent: A General-Purpose Agent Learning Framework 💫DeepSeek-R1 training approach for agents, no SFT, just RL tr…

0

40

0

Linjie (Lindsey) Li

@LINJIEFUN

19 days

RT @KevinQHLin: The OpenAI Operator/CUA looks like an end-to-end Vision-Language Model (VLM) via RL training that takes an image as input,…

0

4

0

Linjie (Lindsey) Li

@LINJIEFUN

1 month

RT @will_wang_whc: (5/8) We also try various test-time compute scaling strategies. While they tend to boost model performance, they are far…

0

1

0

Linjie (Lindsey) Li

@LINJIEFUN

1 month

RT @will_wang_whc: (1/8) Can your MLLM actually reason over text and images? ✨Introducing EMMA: An Enhanced MultiModal ReAsoning Benchmark…

0

5

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @XiyaoWang10: Our code and model is now available: Code Model Feel free to contact me if…

0

24

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

@wzihanw Congrats, Zihan!

1

0

2

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @KevinQHLin: 😄 I am honored that our work ShowUI is recognized as one of the Outstanding Paper Award at the NeurIPS Open-World Agent Wor…

0

8

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @drjingjing2026: 2/3 The speaker chose to specify that the perpetrator is Chinese. To me, this implies the speaker’s assumption that bei…

0

16

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @furongh: I saw a slide circulating on social media last night while working on a deadline. I didn’t comment immediately because I wante…

0

185

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @drjingjing2026: 1/3 Today, an anecdote shared by an invited speaker at #NeurIPS2024 left many Chinese scholars, myself included, feelin…

0

630

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @xyz2maureen: 🔥Poster: Fri 13 Dec 4:30 pm - 7:30 pm PST (West) It is the first time for me try to sell a new concept that I believe but…

0

14

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @jiasenlu: 📢Come to join our 1st Workshop on Video-Langauge Models at #NeurIPS 2024. We have seen a great progress on image-language mo…

0

17

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @furongh: 🔥 Test-time reasoning for LLMs is trending! GPT-O1 and follow-ups have shown the power of inference-time search. But what abou…

0

29

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @oodgnas: We're thrilled to announce the _Workshop on Video-Language Models_ at #NeurIPS2024! Join us, along with leading researchers a…

0

6

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @1jaskiratsingh: Prompt: Realistic, real life photo of person, ultra realistic facial details. Top: Flux-Dev Bottom: Flux-Dev + NegToMe…

0

2

0

Linjie (Lindsey) Li

@LINJIEFUN

2 months

RT @1jaskiratsingh: 🔥Negative Token Merging: Image-based Adversarial Feature Guidance🔥 NegToMe is training-free and leads to better: diver…

0

7

0

Linjie (Lindsey) Li

@LINJIEFUN

3 months

Check out our latest work on building a lightweight vision-language-action model for GUI agent control!

Kevin Lin

@KevinQHLin

3 months

❓Interested in developing a local multimodal model to control your screen (PC 🖥️ or phone 📱) like the Claude API? 🌟Introducing “ShowUI”—an open-source, lightweight 2B vision-language-action model designed for GUI agent control 🤖! 🔗

1

16

Linjie (Lindsey) Li

@LINJIEFUN

3 months

RT @yuyangzhao_: 🚀🚀Excited to introduce GenXD: Generating Any 3D and 4D Scenes! A joint framework for general 3D and 4D generation, support…

0

20

0