Ge Zhang Profile
Ge Zhang

@GeZhang86038849

Followers
2K
Following
714
Statuses
638

Founder: M-A-P(https://t.co/CGWz8JrHzH) Contribution @Tiger_Lab @Yi-01.AI @Bytedance MAP-Neo, MMMU, MAmmoTH, MMLU-pro, Yi, COIG, YuE

Joined April 2021
Don't wanna be here? Send us removal request.
@GeZhang86038849
Ge Zhang
5 hours
@laurence_ai @xidulu If talking about the brief intuition, broadly speaking, it is indeed MoE. But if you check the mechanism detailedly, it is way more beautiful. And it is practical.
0
0
0
@GeZhang86038849
Ge Zhang
6 hours
@_akhaliq More details found here๏ผš
0
0
3
@GeZhang86038849
Ge Zhang
7 hours
@cognitivecompai check the email affiliation maybe. I have no idea but these talented folks may already have it training in progress.
0
0
0
@GeZhang86038849
Ge Zhang
8 hours
Thanks for sharing our work!
@_akhaliq
AK
10 hours
Generating Symbolic World Models via Test-time Scaling of Large Language Models
Tweet media one
0
0
5
@GeZhang86038849
Ge Zhang
6 days
Around one week remaining for submission! Share your insights about foundation models with us in SCI-FM @ ICLR 2025!
@sivil_taram
Qian Liu
1 month
๐ŸŽ‰ Announcing the first Open Science for Foundation Models (SCI-FM) Workshop at #ICLR2025! Join us in advancing transparency and reproducibility in AI through open foundation models. ๐Ÿค Looking to contribute? Join our Program Committee: ๐Ÿ” Learn more at: #OpenScience #MachineLearning #FoundationModels 1/N
Tweet media one
0
2
10
@GeZhang86038849
Ge Zhang
6 days
[7/n] Resources: I personally own very little credit of the paper. Amazing work still, glad to contribute to it and help the Bytedance's engineers. We do notice that Code-LLM has significantly changed how coders work!
0
0
2
@GeZhang86038849
Ge Zhang
14 days
@soldni Wish that Yi and deepseek have the support of investigators and are truly with more than 10k H800s. Then MAP may not be soooooo poor as well.
1
0
1
@GeZhang86038849
Ge Zhang
14 days
@soldni The rumor is just crazy. At least in the last half of 2024, I donโ€™t think that they own much more GPUs than Yi. These Chinese entrepreneur LLM teams are just poor.
0
0
0
@GeZhang86038849
Ge Zhang
14 days
M-A-Pโ€˜s Chinese New Year gift to the Open-Source Community. LLaMA moment of music foundation model! The first open-source model with Suno-v3.5 Level performance! Your personal music AI assistant is already here.
@abc43992899
Ruibin Yuan
14 days
1/n: ๐Ÿš€ Announcing YuE (ไน) โ€“ the most powerful open-source full-song music generation model! ๐ŸŽต Tackle the lyrics-to-song task (like with support for diverse genres, stunning vocals, & multiple languages. Bonus? Itโ€™s Hugging Face & LLAMA-compatible for easy fine-tuning. ๐Ÿ› ๏ธ Code: Demo:
0
1
14
@GeZhang86038849
Ge Zhang
14 days
@alexandr_wang This manโ€˜s success is a representative tragedy of Chinese Americans. African Americans and Jews win their rights by fighting. You folks win by kneeing and faking no bias against you in academics, career, etc while there is too much.
2
0
14
@GeZhang86038849
Ge Zhang
14 days
Amazing! Love the work!
@linzhengisme
Lin Zheng
19 days
๐Ÿš€ Meet EvaByte: The best open-source tokenizer-free language model! Our 6.5B byte LM matches modern tokenizer-based LMs with 5x less data & 2x faster decoding, naturally extending to multimodal tasks while fixing tokenization quirks. ๐Ÿ’ป Blog: ๐Ÿงต 1/9
Tweet media one
0
0
2
@GeZhang86038849
Ge Zhang
16 days
These idiots saying this know nothing about training an LLM and don't read paper. They only know emotional geopolitics instead of rational engineering and science. The cost of Yi-Lightning and MiniMax-01 is no larger and even smaller. Answer is here for a long time.
@nealkhosla
Neal Khosla
17 days
deepseek is a ccp state psyop + economic warfare to make american ai unprofitable they are faking the cost was low to justify setting price low and hoping everyone switches to it damage AI competitiveness in the us dont take the bait
0
0
9