RELIGN AI
@relignai
Followers
3K
Following
61
Statuses
44
OPEN SOURCE REINFORCEMENT LEARNING FRAMEWORK. $RELIGN: 7AkSKHomPcrJHSgnKFmbrqKARR7PyDk9XoE1PHrtpump
/
Joined July 2022
about multi-step learning inference__ a multi-step learning inference strategy decomposes complex tasks into smaller, sequential steps. at each step, the model refines its understanding, incorporates previous results, and guides its reasoning. this iterative approach enables much deeper analysis, reduces error propagation, and fosters substantially more accurate decisions or predictions compared to single-step methods. GRPO math benchmarks loading__
1
5
28
RT @adamcreates_: open source flywheel: 1. work on difficult problems 2. difficult problems attract high-caliber talent 3. high-caliber ta…
0
4
0
the relign framework turns base models into reasoning models__ this allows all ai applications built on top of these models to think harder about complex problems, improving performance and output. relign is framework agnostic, and as such can be utilized to post train any open source base model. ___
1
13
43
deepseek GRPO implementation in progress__ GRPO (group relative policy optimization) is the core rl algorithm that was used to drive deepseek reasoning abilities. it is a reinforcement learning algorithm used to help a model learn better by comparing different actions and making small, controlled updates using a set of observations. a smart way to learn from experience without making drastic changes that could interfere with the goals of the training. developer bounty program active. ___
4
12
55
roadmap__ phase i - launch__complete phase ii - develop a strong reasoning library | grpo, full docs, whitepaper, modularization, expand reasoning stack, expand dev community phase iii - scaling up | cloud runner, 70B+ parameter reasoning model, relign toolkit, research program, reasoning-as-a-service platform
5
19
60
relign gitbook v1 live - covering concept, vision, mission, roadmap, use cases notable quote: relign is a step towards artificial general intelligence. the technology has far-reaching application, because it is a fundamental tool that teaches base models how to think. This allows the reasoning model to do anything a human is capable of - scientific research, software engineering, financial analysis, psychological analysis, the list goes on. reasoning models don't have a defined list of use cases. it is the future.
8
16
57
@BoochaoNA1 @bookwormengr hey @BoochaoNA1 we're working on an open source post training framework for os llm's, would love to get you involved ___
0
0
2
@theanakin87 @picampusschool hey @theanakin87 we're working on an open source post training framework for os llm's, would love to get you involved ___
0
0
2
@nickcdryan hey @nickcdryan we're working on an open source post training framework for os llm's, would love to get you involved ___
0
0
4
@lateinteraction hey @omarkhattab we're working on an open source post training framework for os llm's, would love to get you involved ___
0
0
4
@xlr8harder hey @xlr8harder we're working on an open source post training framework for os llm's, would love to get you involved ___
1
0
3