Haoning Wu Profile
Haoning Wu

@HaoningTimothy

Followers
710
Following
444
Statuses
238

PhD Nanyang Technological University๐Ÿ‡ธ๐Ÿ‡ฌ, BS @PKU1898

Singapore
Joined December 2020
Don't wanna be here? Send us removal request.
@HaoningTimothy
Haoning Wu
2 months
We are releasing the BASE models of Aria! Aria-Base-64K (: after 64k long-context multimodal training, before post-training; Aria-Base-8K (: after 8k native multimodal pre-training, base of Base-64K. @DongxuLi_ @LiJunnan0409
2
21
83
@HaoningTimothy
Haoning Wu
4 days
RT @dyhTHU: ๐Ÿ”ฅ๐Ÿ”ฅIntroducing Ola! State-of-the-art omni-modal understanding model with advanced progressive modality alignment strategy! Ola rโ€ฆ
0
29
0
@HaoningTimothy
Haoning Wu
18 days
RT @LiJunnan0409: Video-MMMU is a great benchmark with meticulous data collection and annotation processes. Very happy to see Aria rankingโ€ฆ
0
2
0
@HaoningTimothy
Haoning Wu
18 days
RT @BoLi68567011: VideoMMMU is a meticulously crafted benchmark designed to evaluate multimodal modelsโ€™ video understanding abilities for cโ€ฆ
0
5
0
@HaoningTimothy
Haoning Wu
1 month
My posters in 2024.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
0
14
@HaoningTimothy
Haoning Wu
1 month
@_TobiasLee Hi Lei, which row shall I regard as final aggregated score?
1
0
0
@HaoningTimothy
Haoning Wu
1 month
RT @BoLi68567011: After nearly a year of development, LMMs-Eval has reached 2K+ stars and 60+ contributors! ๐Ÿš€ Now with integrated image, vโ€ฆ
0
9
0
@HaoningTimothy
Haoning Wu
1 month
Glad to be one among these!
@KennyUTC
Haodong Duan
2 months
After 1yr of Building VLMEvalKit now reaches 100+ Contributors On the journey of exploring LMM capabilities, we will go further
Tweet media one
0
0
3
@HaoningTimothy
Haoning Wu
2 months
Magic powers! Excellent work from my fellow colleagues. Noted that this model is fine-tuned from Aria-Base (, the base model of Aria, to reach optimal performance on UI tasks. Hope to see more domain-specific models fine-tuned from Aria-Base series!
@itsyuhao
Yuhao Yang
2 months
๐Ÿš€ Introducing Aria-UI โ€“ a cutting-edge grounding LMM for GUI agents with a lightning-fast 3.9B parameters activated backbone! ๐ŸŒ Try it yourself: ๐Ÿ“„ Project page: ๐Ÿ“‚ Explore on GitHub:
1
1
10
@HaoningTimothy
Haoning Wu
2 months
Glad to contribute to some milestones in this domain~
@_vztu
Zhengzhong Tu
2 months
๐™Š๐™ช๐™ง ๐™ฃ๐™š๐™ฌ๐™š๐™จ๐™ฉ, ๐™ข๐™ค๐™จ๐™ฉ ๐™˜๐™ค๐™ข๐™ฅ๐™ง๐™š๐™๐™š๐™ฃ๐™จ๐™ž๐™ซ๐™š ๐™จ๐™ช๐™ง๐™ซ๐™š๐™ฎ ๐™ค๐™ฃ ๐™‘๐™ž๐™™๐™š๐™ค ๐™Œ๐™ช๐™–๐™ก๐™ž๐™ฉ๐™ฎ ๐˜ผ๐™จ๐™จ๐™š๐™จ๐™จ๐™ข๐™š๐™ฃ๐™ฉโ€”led by my legendary advisor, Alan Bovik, who has pioneered this field for over three decades, and myself, dedicated my (almost) entire PhD journey to this topicโ€”๐’Š๐’” ๐’๐’๐’˜ ๐’๐’Š๐’—๐’† ๐’๐’ ๐’‚๐’“๐‘ฟ๐’Š๐’—! ๐Ÿ“Ž Paper: ๐Ÿ–ฅ๏ธ GitHub: In this work, weโ€™ve curated a panoramic, deeply-researched view of the Video Quality Assessment (VQA) landscape. We cover the evolution from classic methods to cutting-edge deep learning solutionsโ€”offering a clear guide for both newcomers and seasoned experts. ๐Š๐ž๐ฒ ๐ก๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ ๐ข๐ง๐œ๐ฅ๐ฎ๐๐ž: ๐Ÿ“š A holistic categorization and analysis of existing VQA models, with insights into how techniques have evolved and where theyโ€™re headed. ๐Ÿง A thorough look at subjective evaluation fundamentals, including major datasets and what they mean for real-world applications. ๐Ÿค– A deep dive into loss functions and architectural innovations, illuminating how modern frameworks are pushing the frontier of #VQA. ๐Ÿ“Š Broad comparisons across emergent data types, shedding light on the importance of modeling spatiotemporal details and leveraging prior knowledge. ๐ŸŽฏ Real-world applications and future directions that underscore how these advancements can revolutionize streaming platforms, social media, and beyond. We hope this survey catalyzes new research avenues, encourages innovative solutions, and serves as a catalyst for the potential industry-university cooperation to foster fast and practical integration of such essential technologies into the social media, video streaming, or even the generative imagery/videography industry! ๐Ÿš€ Dive in, share your thoughts, and letโ€™s drive the future of #VQA together!
Tweet media one
4
0
8
@HaoningTimothy
Haoning Wu
2 months
RT @LiJunnan0409: Introducing ๐Ÿ”ฅAria-Chat๐Ÿ”ฅ, our latest multimodal chat model optimized for open-ended and multi-round dialogs! It outperformโ€ฆ
0
5
0
@HaoningTimothy
Haoning Wu
2 months
RT @mervenoyann: VLMs go MoE โœจ @deepseek_ai dropped three new commercially permissive vision LMs based on SigLIP encoder and their DeepSeeโ€ฆ
0
27
0
@HaoningTimothy
Haoning Wu
2 months
Hiking in a skiing resort in Vancouver. #NeurIPS2024
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
0
6
@HaoningTimothy
Haoning Wu
2 months
RT @wenhaocha1: This is crazy. I hope itโ€™s not cherry pick. Definitely another big step to โ€œtrueโ€ later multimodal models for Gemini-2!
0
1
0
@HaoningTimothy
Haoning Wu
2 months
Finally a really capable any2any model in 2024!
@osanseviero
Omar Sanseviero
2 months
Gemini 2.0 Flash is out and it has lots of exciting things - Audio (multilingual)+image generation - Image editing - Multimodal real-time API - 2D/3D spatial understanding ๐Ÿคฏ - Great code capabilities Try it: Docs:
Tweet media one
Tweet media two
0
0
2
@HaoningTimothy
Haoning Wu
2 months
RT @JustinLin610: I advise you to look at this. This is more huge for me!
0
5
0
@HaoningTimothy
Haoning Wu
2 months
Goodbye Singapore and see you in Vancouver! #NeurIPS2024
Tweet media one
0
0
20
@HaoningTimothy
Haoning Wu
2 months
@MuCai7 Looking forward to discussions at the Workshop!
0
0
3
@HaoningTimothy
Haoning Wu
2 months
Looking forward to exciting discussions!
@LiJunnan0409
Li Junnan
2 months
I'll be at #NeurIPS 2024. Join me at the LongVideoBench Poster (East Exhibit Hall A-C, #4611) on Wed, Dec 11, 4:30โ€“7:30 PM. Letโ€™s chat about all things related to multimodal research!
0
0
1
@HaoningTimothy
Haoning Wu
2 months
RT @LiJunnan0409: Excited to share that Aria is now officially supported by Transformers! Huge thanks to @AymericRoucher and the @huggingfaโ€ฆ
0
2
0
@HaoningTimothy
Haoning Wu
2 months
RT @JustinLin610: ๐Ÿ˜“ I almost forgot we released something tonight... Yes, just the base models for Qwen2-VL lah. Not a big deal actually.โ€ฆ
0
134
0