![Aravind Srinivas Profile](https://pbs.twimg.com/profile_images/1886811342780403712/F6IPogbf_x96.jpg)
Aravind Srinivas
@AravSrinivas
Followers
216K
Following
26K
Statuses
6K
Perplexity is building a very focused inference team that wants to host custom versions of open source models like DeepSeek and Llama with as high throughput and as low latency as possible. This blog post talks about our work on building in house networking. We will be sharing more soon on our post training work.
Using a custom RDMA-based networking library, we've been able to achieve 3200 Gbps GPU memory transfers, bypassing NCCL limits for 97.1% theoretical bandwidth efficiency. Our latest blog shares our journey of building a custom high-performance networking solution on AWS.
16
21
326
@caviterginsoy Weird. Could you send a screen record? Are you on the latest version of the app? @fayfers
1
0
4
@Klotzkette Sure. But it will also use a ton of other sources and can build way more cards than Wikipedia can.
1
0
5
@Pulkit_Saraf Will start showing up for celebrities and well known individuals, and then to businesses and events and related entities.
1
0
3