![Cheng Luo Profile](https://pbs.twimg.com/profile_images/1815960820037431296/Gw18xCy5_x96.jpg)
Cheng Luo
@ChengLuo_lc
Followers
14
Following
29
Statuses
13
RT @ChengLuo_lc: 🤩🤩 we introduce MST, a memory-efficient transformer, reducing intermediate memory usage and enabling longer sequence train…
0
1
0
RT @rohanpaul_ai: Really 👀 new Paper, MINI-SEQUENCE TRANSFORMER claims to extend the maximum context length of Qwen, Mistral, and Gemma-2 b…
0
4
0
RT @papers_anon: Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer Saw a 16x increase in sequence le…
0
7
0
RT @AnimaAnandkumar: Introducing long-context transformer using mini sequences. It is a simple and effective method for highly efficient an…
0
8
0