Ziniu Hu Profile Banner
Ziniu Hu Profile
Ziniu Hu

@acbuller

Followers
2,184
Following
747
Media
19
Statuses
75
Explore trending content on Musk Viewer
@acbuller
Ziniu Hu
2 months
Thrilled to receive the KDD Dissertation Award Runner-Up, for my PhD works on Neural-Symbolic Reasoning. Sincerely thanks to my PhD advisors @YizhouSun and @kaiwei_chang , my letter supporters @yisongyue and @jhamrick . Thanks to the award committee @kdd_news for such honor.
18
11
180
@acbuller
Ziniu Hu
10 months
Interested in LLM + Tool-Use, via Tree-Search? This afternoon in #NeurIPS2023 , #215 , I'll present "AVIS: Autonomous Visual Information Seeking with Large Language Model Agent" () Feel free to drop by and chat.
2
26
149
@acbuller
Ziniu Hu
1 year
🤔 How to let Large Language Models (LLMs) agent utilize diverse tools via Tree Search 🔍? In AVIS, we enable LLM Agent to dynamically traverse a transition graph with self-critic (when one path is not informative, backtrack to previous state). This achieves SOTA VQA result.
Tweet media one
@GoogleAI
Google AI
1 year
Today on the blog, read all about AVIS — Autonomous Visual Information Seeking with Large Language Models — a novel method that iteratively employs a planner and reasoner to achieve state-of-the-art results on visual information seeking tasks →
Tweet media one
36
212
812
2
25
130
@acbuller
Ziniu Hu
1 year
Can LLMs play a hidden-identity board game "Renaissance Avalon"? Check out: Code: In this work, we built a game engine AvalonBench, consisting of several fixed rule baselines. We found ChatGPT 3.5 still cannot beat simple rules.
1
13
76
@acbuller
Ziniu Hu
4 months
How to control LLM behavior with LLM-as-a-judge? Check our paper: "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller" Website: Paper: Code:
Tweet media one
2
12
46
@acbuller
Ziniu Hu
2 months
🪫🚀 🔋
@xai
xAI
2 months
3K
2K
9K
4
0
42
@acbuller
Ziniu Hu
2 years
Excited to receive the #SoCalNLP Best Paper Award for our paper "Empowering Language Models with Knowledge Graph Reasoning for Question Answering". The paper link is: Thanks to the organizers and all the great collaborators!
@ucsbNLP
UC Santa Barbara NLP Group
2 years
Our @MegagonLabs Best Paper Award winner was "Empowering Language Models with Knowledge Graph Reasoning for Question Answering" by Ziniu Hu et al from UCLA! Paper link: Thank you to award sponsor @MegagonLabs for supporting our event! (4/4)
Tweet media one
0
0
12
3
8
30
@acbuller
Ziniu Hu
4 months
Can LLM be the "world model🌍" to predict the future? Our new benchmark evaluate LLM Agents for global international events (conflict ⚔️vs mediation🤝) We provide agents of Python APIs to interact with diverse tools and knowledge sources, and SoTA GPT-4o + code achieves 32% F1.
@chenchenye_ccye
Chenchen Ye
4 months
📢New LLM Agents Benchmark! Introducing 🌟MIRAI🌟: A groundbreaking benchmark crafted for evaluating LLM agents in temporal forecasting of international events with tool use and complex reasoning! 📜 Arxiv: 🔗 Project page: 🧵1/N
15
72
304
0
2
26
@acbuller
Ziniu Hu
8 months
🎹 How to make music diffusion model a really useful tool for music composition? Check our work on non-differential rule guided diffusion: . It controls the music generation by rules (e.g. chord progression) in a training-free, plug-and-play manner.
@YujiaHuangC
Yujia Huang
8 months
Excited to share our work on symbolic music generation: ! We introduce a symbolic music generator with non-differentiable rule guided diffusion models, enabling musicians to effectively use it as a compositional tool. Website: . 🧵👇
Tweet media one
4
34
197
0
3
19
@acbuller
Ziniu Hu
2 years
Hi #NeurIPS2022 This afternoon 4pm at poster 211, we'll present our work "Improving Multi-Task Generalization via Regularizing Spurious Correlation". If you're interested in unique challenges of out-of-distribution generalization for Multi-Task Learning, please come and chat!
@acbuller
Ziniu Hu
2 years
Interested in how Spurious Correlation affects Multi-Task Generalization (especially out-of-distribution setting)? Check out our #NeurIPS2022 spotlight paper: I will present the poster at Hall J 211 on Thursday 2-4pm (Dec 1). Please drop by and chat!
Tweet media one
1
0
12
0
0
16
@acbuller
Ziniu Hu
1 year
#CVPR2023 This afternoon from 4pm, I will present our CVPR ✨highlight paper, REVEAL, at Exhibit Halls ABC 264. If you're interested in augmenting large Visual-Language model with external and up-to-date knowledge, please drop by and chat~
@GoogleAI
Google AI
1 year
Learn how REVEAL, an end-to-end retrieval-augmented visual-language model that learns to use multi-source multi-modal data to answer knowledge-intensive queries, achieves state-of-the-art results on visual question answering and image caption tasks.
Tweet media one
16
89
279
1
1
14
@acbuller
Ziniu Hu
2 years
Interested in how Spurious Correlation affects Multi-Task Generalization (especially out-of-distribution setting)? Check out our #NeurIPS2022 spotlight paper: I will present the poster at Hall J 211 on Thursday 2-4pm (Dec 1). Please drop by and chat!
Tweet media one
1
0
12
@acbuller
Ziniu Hu
1 year
How can Physical law help GraphODE? Check out our recent work that incorporate Time-symmetry as a regulaization for multi-agent dynamic system.
Tweet media one
@HuangZi71008374
Zijie Huang@Neurips2024
1 year
🧐 Can neural simulators softly satisfy multiple physical constraints? Check out: We propose TANGO, a physics-informed GraphODE that injects time-reversal symmetry and in the meanwhile numerically benefits various dynamical systems.
1
8
37
0
0
9
@acbuller
Ziniu Hu
1 year
Interested in building 🚀fast and efficient #GraphNeuralNetworks for large-scale data? Check out our recent survey () led by Shichang, which provides a clear taxonomy of GNN acceleration, and suggest future directions.
@ShichangZhang
Shichang (Ray) Zhang
1 year
Our survey on #GraphNeuralNetwork acceleration is now on arXiv: . We have consolidated #GNN acceleration algorithms, systems, and customized hardware. Any comments or questions are highly appreciated! @YizhouSun @acbuller @HZJ_jingjing @eiclab @UCLA_DM
0
4
16
0
1
8
@acbuller
Ziniu Hu
10 months
Tweet media one
0
0
7
@acbuller
Ziniu Hu
1 year
The game requires both complicated decision-making and language skills (including cooperation, deception and deduction). We hope it can serve as a test-bed for future research of LLM Agent. Thanks for the contributions by @JonathanMLight , Min Cai and @shengs1123
Tweet media one
0
0
4
@acbuller
Ziniu Hu
2 months
Try "sus-column-r" if you're interested in playing grok:
@lmarena_ai
lmarena.ai (formerly lmsys.org)
2 months
Come chat with the model at !
4
3
66
0
0
4
@acbuller
Ziniu Hu
4 months
Self-Control enable fine-grained control for a wide range of tasks, including emotional modulation, ensuring harmlessness, and enhancing complex reasoning (GSM8K, comparable with CoT-Decoding), and achieve 0 error on privacy leakage task.
Tweet media one
1
0
0
@acbuller
Ziniu Hu
2 years
After proving our hypothesis via theoretical and empirical analysis, we propose a Multi-Task Causal Representation Learning (MT-CRL) framework to learn 1) disentangled neural modules; 2) Task-to-Module Causal Graph; 3) Regularize spurious correlation over learned causal graph.
Tweet media one
1
0
2
@acbuller
Ziniu Hu
10 months
@dezhou Yeah it can. The tree search framework is quite general, one concurrent work tree-of-thought has many interesting results on pure text problem. Major differences to ToT are: 1) a transition graph as prior to narrow down search space; 2) working memory to avoid taking same paths.
0
0
1
@acbuller
Ziniu Hu
4 months
The searched gradient for each task can also provide insights on interpreting LLM: different tasks apply different layers of transformers, meaning such behavior is "stored" at different places of Transformer.
Tweet media one
1
0
1
@acbuller
Ziniu Hu
2 years
MT-CRL could improve multi-task generalization (especially with distribution shift), and we show that it could indeed help alleviate spurious correlation. Without MT-CRL, the children's movie recommendation could be associated with violent words. MT-CRL could address this issue.
Tweet media one
0
0
1