Chenchen Ye @chenchenye_ccye profile

Chenchen Ye

@chenchenye_ccye

Followers

806

Following

840

Media

12

Statuses

28

CS PhD student @UCLA | Research Intern @Microsoft | Prev Undergrad @NUSingapore | LLM

https://t.co/6ur5dGv54k

Los Angeles, CA

Joined August 2022

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

#KawalPutusanMK • 2114271 Tweets

#TolakPilkadaAkal2an • 1376627 Tweets

#TolakPolitikDinasti • 1351916 Tweets

Tim Walz • 640378 Tweets

#jjk267 • 308712 Tweets

Nobara • 208736 Tweets

Oprah • 164684 Tweets

Botafogo • 160424 Tweets

Reza Rahadian • 105592 Tweets

Mulyono • 82604 Tweets

Sian • 70284 Tweets

Gus Walz • 69002 Tweets

Gege • 67577 Tweets

BOTO PARA SA SB19 • 37578 Tweets

Sabine • 30887 Tweets

TRAP U MEMBER PHOTO 1 • 30246 Tweets

チャイルドシート • 28708 Tweets

PilihDAMAI BarengPRABOWO • 24641 Tweets

LebihSEJUK LebihNYAMAN • 20412 Tweets

ポケットキャンプ • 19974 Tweets

Bolt • 19902 Tweets

ムアラニ • 19057 Tweets

KITA SEMUA TURUN • 16894 Tweets

BBFA LAST HURRAH • 15658 Tweets

鯉登少尉 • 14814 Tweets

首都高バトル • 14165 Tweets

ポケ森サ終 • 11423 Tweets

KKT熊本県民テレビ

서울시의회의장

Orlen

チッケム

Pastor Jerry

ゲーム先行

リーフチケット

中川くん

有料アプリ

学校の普通科

木村知事

一般事務職

悪魔ほむら

ポケキャン

Widjiatno Notomihardjo

買い切りアプリ

花道囲い席

オフライン版

乱舞音曲祭

#剣持リクエスト

BATALKAN BUKAN TUNDA

GCSE

中川大志

Last Seen Profiles

@DHSsection1

@UpscaleLimited

@Acid_130

@jgregorydesign

@thegailmuller

@u5b83

@MoniqueMaria24

@m8ng9zhang6pe4y

@MuskRifky

@estebanlozs

@GKambile

@BVB_Fanb

@Unite__0204

@Tejakaran4

@sjr38

@NillkinOfficial

@anthooooooo71

@sokanburuk31

@Deep_Chand_

@Skyjudge4NFL

Pinned Tweet

Chenchen Ye

@chenchenye_ccye

2 months

📢New LLM Agents Benchmark! Introducing 🌟MIRAI🌟: A groundbreaking benchmark crafted for evaluating LLM agents in temporal forecasting of international events with tool use and complex reasoning! 📜 Arxiv: 🔗 Project page: 🧵1/N

14

71

304

Chenchen Ye

@chenchenye_ccye

11 days

🚀 Excited to introduce our #ACL2024 Main Paper TCELongBench: Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding! 📰 As online news grows, the challenge of swiftly understanding complex events, spread across

1

12

82

Chenchen Ye

@chenchenye_ccye

2 months

🧵2/N We released our code, data and an iteractive demo: 💻 GitHub Repo: 📁 Dataset: 📊 Interactive Demo Notebook:

1

13

Chenchen Ye

@chenchenye_ccye

2 months

🧵11/N Sincere thanks to all amazing collaborators and advisors @acbuller , @Yihe__Deng , @HuangZi71008374 , @mingyu_ma , @Zhu_Yanqiao , and @WeiWang1973 for their invaluable advice and efforts! 🙏❤️

0

11

Chenchen Ye

@chenchenye_ccye

2 months

🧵 8/N Forecasting with Temporal Distance Our ablation study let agents predicts 1, 7, 30, and 90 days ahead. 📊Results: As days increases, F1📉and KL📈. Agent's accuracy drops for distant events. Longer ones anticipate trend shifts influenced by more factors and complexities.

1

0

10

Chenchen Ye

@chenchenye_ccye

2 months

🧵 4/N Forecasting Task 🔮 Forecasting involves collecting essential historical data and performing temporal reasoning to predict future events. 📅 Example: Forecasting cross-country relations on 2023-11-18 using event and news information up to 2023-11-17.

1

0

10

Chenchen Ye

@chenchenye_ccye

2 months

🧵 7/N Forecasting with Different Base LLMs 1️⃣ 📈 Code Block benefits stronger LLMs but hurts weaker models. 2️⃣ 🏆GPT-4o consistently outperforms other models. 3️⃣ 💪 Self-consistency makes a small model stronger.

1

0

10

Chenchen Ye

@chenchenye_ccye

2 months

🧵 6/N Agent Framework 💡 Think: Agent analyzes and plans the next action using API specs. ⚡ Act: Generates Single Function or Code Block to retrieve data. 🚀 Execute: Python interpreter runs the code for observations. These steps are repeated until reaching a final forecast.

1

0

10

Chenchen Ye

@chenchenye_ccye

2 months

🧵10/N Check our paper out for more details! 🌟 Code error analysis, different event types, variation of API types, and different agent planning strategies! Join us in advancing the capabilities of LLM agents in forecasting and understanding complex international events! 🚀

1

0

10

Chenchen Ye

@chenchenye_ccye

2 months

🧵 9/N Tool-Use Ordering in Forecasting 🗂️Tool-Use Transition Graph: Agents start with recent events for key info and end with news for context. 🧠 Freq.(correct) - Freq.(incorrect): Highlight the need for strategic planning in LLM agents for effective forecasting.

1

0

9

Chenchen Ye

@chenchenye_ccye

2 months

🧵 5/N APIs & Environment 💻 Our comprehensive APIs empower agents to generate code and access the database. 🔧 APIs include data classes and functions for various info types and search conditions. 🔄 Agents can call a single function or generate a code block at each step.

1

0

9

Chenchen Ye

@chenchenye_ccye

2 months

🧵3/N Data 🌐With 59,161 unique events and 296,630 unique news articles, we curate a test set of 705 forecasting query-answer pairs. (a)📊 Circular Chart: The relation hierarchy and distribution in MIRAI. (b-c) 🔥 Heatmap: Intensity of global events, from conflict to mediation.

1

0

9

Chenchen Ye

@chenchenye_ccye

11 days

Zhihan is at #ACL2024 in Bangkok to present our paper on new LLM benchmark TCELongBench for evaluating Temporal, Long Context Understanding! Catch her at Poster 📍Poster Session 1 ⏰ 8/12 at 11 AM (local time) For more details, check out our paper::

Analyzing Temporal Complex Events with Large Language Models? A...

The digital landscape is rapidly evolving with an ever-increasing volume of online news, emphasizing the need for swift and precise analysis of complex events. We refer to the complex events...

arxiv.org

Zhihan Zhang@ACL 2024

@zhihan72

11 days

Excited to attend #ACL2024 in Bangkok! I will present our newest LLM benchmark **TCELongBench**: 💥 Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding 💥 Come to Poster Session 1 on 8/12 at 11:00AM!

0

1

6

0

8

Chenchen Ye

@chenchenye_ccye

2 months

@nicolayr_ Thanks for sharing your thoughts, Nicolay! Your idea about forecasting from literature sounds really interesting!

1

0

2

Chenchen Ye

@chenchenye_ccye

2 months

@aviaviavi__ Thank you so much, Avi! Looking forward to hearing your thoughts!

0

1