![Mikhail Parakhin Profile](https://abs.twimg.com/sticky/default_profile_images/default_profile_x96.png)
Mikhail Parakhin
@MParakhin
Followers
20K
Following
5K
Statuses
2K
Joined March 2022
Off-topic for me, but that’s how it should be done. C++, of course, not C, but something fast and debuggable. For anything remotely complicated I never write scripts - just C++ and system()
I don't use Makefiles. I build my C projects with C. I literally use C code as Bash script to run the C compiler. Keep thinking you need CMake or Package Managers.
0
0
12
@Plinz The mistranslation of “gnosiology” as “epistemology” in English versions of Lenin’s works had a devastating effect on people using the word :-) It’s especially funny to read China Miéville’s modern rehashings.
0
0
4
@PseudoProphet @RhysSullivan I would claim most interactions with the Mac crowd are like this :-). Unlike iOS, Macs are incredibly outdated.
0
0
0
@max77sabers I think it is a really big step forward. I was very happy to see GRPO - very similar to NPO It is clearly a distillation to a large degree - that makes it look optically better than it really is.
0
0
5
@Dinilein01 @vinscribedotcom @gdb I assume Operator is crashing. @gdb, let me know if you need anything from us.
0
0
2
@K3vn_C @cosminnegruseri From the paper, Titans is a dynamic gisting - a way to compress the previously seen context on the fly. It is quite clever (gisting tends to work well, but I only ever saw it in a static form - compressing the prompt). It is a performance optimization, not a new capability.
0
0
1
@arivero True, but you need those sentinels in the model then and you need to train with those sentinels present. I kind of suspect that multistream will work better, but maybe adding sentinels is enough.
1
0
1
@cosminnegruseri No, it's the same amount of information, same quadratic algorithm. At a lower level it's about giving the model information about what's the coherent piece of text it needs to continue and what is a supplemental/parallel information
1
0
3
@cHHillee Totally. I'm in a violent agreement with this statement: use the correct programming model, optimize manually the hell out of the primitives. Only recently I started to think that MAYBE LLMs are able to do the optimization part, so the programming model is between Human <-> LLM.
1
0
9
@Abhijee14150265 I suspect we will always need readable code. We already have compilers, the real machine code is very hard to read, and yes, with LLMs we will shift to even higher level of abstraction, but you still need to communicate and review your instructions later.
1
0
1