til17792 Profile Banner
noni Profile
noni

@til17792

Followers
4
Following
15
Statuses
121

training LLMs, and building agents.

Joined October 2023
Don't wanna be here? Send us removal request.
@til17792
noni
1 month
SmolAgents by Hugging Face is here! Your lightweight Pythonic toolkit for building powerful AI agents. Let’s trace the origins of the universe with this new release. 🌌 Here’s how to get started. 👇
1
0
0
@til17792
noni
2 days
@paraschopra lead the way
0
0
0
@til17792
noni
2 days
but I just woke up
@bryan_johnson
Bryan Johnson /dd
2 days
Go to bed. Now.
0
0
0
@til17792
noni
3 days
one more 'attention is all you need'
@naval
Naval
3 days
The currency of life isn’t money. It is not even time. It’s attention.
0
0
1
@til17792
noni
4 days
@manasjsaloi tried learnlm by Google for learning?
0
0
0
@til17792
noni
7 days
@pdhsu dude, come on give some actual novel findings instead of just hyping
1
0
3
@til17792
noni
7 days
@elonmusk 'doge support' sounds better
0
0
0
@til17792
noni
7 days
@tunguz yes, uncompresses cot ftw!
0
0
0
@til17792
noni
7 days
@mignano what made you convinced? anything you want to share?
0
0
0
@til17792
noni
7 days
@BasedBeffJezos Hmm, would like to see some novel findings instead of the whole CoT if you can share.
1
0
6
@til17792
noni
7 days
@emollick seems like what entropix was trying to do
0
0
0
@til17792
noni
14 days
just google it moron
@nisten
nisten - e/acc
15 days
it's touch filename you f*****g moron read a book
0
0
0
@til17792
noni
14 days
@tannuiscoding
Tannu Choudhary
15 days
I'm seeing people getting into FAANG, and I'm here doing basics of DSA and struggling academically...
0
0
0
@til17792
noni
16 days
o3 live in GitHub copilot already and it goes brrrrr
@kimmonismus
Chubby♨️
16 days
o3-mini tomorrow confirmed it seems. Reposted by Adam, OpenAI researcher.
Tweet media one
0
0
0
@til17792
noni
16 days
open the laptop
@sajal_batra
Sajal Batra
16 days
Just bought my first mac ever🥳 What to do first?
Tweet media one
0
0
1
@til17792
noni
17 days
love it
@SauravDassss
Saurav Das
17 days
USA gave the world Open AI China now gives DeepSeek India meanwhile— “A-I”
0
0
0
@til17792
noni
18 days
@shaneguML
Shane Gu
19 days
In 2022, after feeling AGI I wrote 2 papers as a newcomer from RL: - "LLMs are zero-shot reasoners" - "LLMs can self-improve" The last two academic papers I worked on before I moved on to ChatGPT and Gemini. So yeah o1/DeepSeek/gemini-thinking are amazing but not surprising :)
Tweet media one
Tweet media two
0
0
0
@til17792
noni
18 days
apart from gemini, is there any other provider which provides free but limited llm api access? please help in comments
0
0
1
@til17792
noni
18 days
groq groq
@chamath
Chamath Palihapitiya
19 days
Several important questions/comments come to my mind as I read more about DeepSeek. Listing them here: 1) Let’s give 1% probability to all the conspiracy theories upfront so we can address it and move on. If it is possible for China/Chinese companies to use shell companies in Singapore or other countries to be a “beard” to buy otherwise export controlled chips from Nvidia and use them for AI training, this likely needs to be investigated and adjudicated. 2) The battle of usage is now more about AI inference vs Training. We always knew this day would come but it probably surprised many that it could be this weekend. With a model this cheap, many new products and experiences can now emerge trying to win the hearts and minds of the global populace. Team USA needs to win here. To that point, while we may still want to export control AI Training chips, we should probably view Inference chips differently - we should want everyone around the world using our solutions over others. I can explain my reasoning as follows: we should never export our knowledge of enriching uranium to be weapons grade to other countries but we should export our ability to build nuclear energy (which requires far less sophistication) if it can help advance American priorities and leadership abroad. Training and Inference can be roughly equated this way. (Disclaimer: Groq, of which I’m a shareholder, is in this game so this benefits me tbf.) 3) We need to cooperate with our allies (especially those in the ME) to stand up the necessary infrastructure to enable Inference - Data centers, subsidized energy etc. all around the world ASAP.. They pay to build it, we supply the Inference hardware and the software to run the clouds. We need this buildout to happen ASAP. This is clearly our version of Belt and Road and we need to take it as seriously as China took their version, similarly named. 4) There will be volatility in the stock market as capital markets absorb all of this information and re-price the values of the Mag7. Tesla is the least exposed, the rest are exposed as a direct function of the amount of CapEx they have publicly announced. Nvidia is the most at risk for obvious reasons. That said, markets will love it if Meta, Microsoft, Google etc can win WITHOUT having to spend $50-80B PER YEAR. 5) The innovation from China speaks to how “asleep” we’ve been for the past 15 years. We’ve been running towards the big money/shiny object spending programs (AI is not the first and it likely won’t be the last) where we (Team USA) have thrown hundreds of billions of dollars at a problem vs thinking through the problem more cleverly and using resource constraints as an enabler. Let’s get our act together. We need all the bumbling middle managers out of the way - let the engineers and the brilliant folks we have actually working on this stuff to cook! More spending, more meetings, more oversight, more weekly reports and the like does not equate to more innovation. Unburden our technical stars to do their magic. 6) Startups need to realize that they are “default dead” companies. This means that they must, by definition, grasp victory from the jaws of defeat. Meanwhile, VCs are asleep at the switch - massively overfunding marginal ideas. We need to get better at taking huge shots on goal and allocating capital to the best of these ideas. I worry that in this current melee, we’ve overspent billions on dumb features which these next-gen models will roll over in the next 12months or earlier. Lots of capital losses are coming. Crazily, I initially posted about DeepSeek a month ago! Comments/reactions appreciated.
0
0
0
@til17792
noni
18 days
Tweet media one
0
0
0