Carlos Lassance Profile
Carlos Lassance

@cadurosar

Followers
438
Following
277
Statuses
249

MTS @ Cohere, constantly trying to make Information Retrieval work better, while making mistakes on the process.

Grenoble
Joined March 2018
Don't wanna be here? Send us removal request.
@cadurosar
Carlos Lassance
13 days
RT @nadiinchi: Excited to share that Provence is accepted to #ICLR2025! Provence is a method for training an efficient & high-performing cโ€ฆ
0
5
0
@cadurosar
Carlos Lassance
1 month
RT @cohere: Today, weโ€™re launching early access for North! Our all-in-one secure AI workspace platform combines LLMs, search, and agents iโ€ฆ
0
99
0
@cadurosar
Carlos Lassance
2 months
RT @Nils_Reimers: ๐‹๐š๐ฎ๐ง๐œ๐ก ๐จ๐Ÿ ๐‚๐จ๐ก๐ž๐ซ๐ž ๐‘๐ž๐ซ๐š๐ง๐ค ๐Ÿ‘.๐Ÿ“ - ๐๐จ๐จ๐ฌ๐ญ ๐ฒ๐จ๐ฎ๐ซ ๐’๐ž๐š๐ซ๐œ๐ก ๐Ÿš€ What is new: - Large gains in multilingual retrieval ๐Ÿ‡บ๐Ÿ‡ณ - Reasoning Caโ€ฆ
0
21
0
@cadurosar
Carlos Lassance
4 months
RT @Nils_Reimers: Aya-Expanse, the strongest open weights multilingual LLM, was just released by @CohereForAI It beats Llama 70B multilinโ€ฆ
0
41
0
@cadurosar
Carlos Lassance
4 months
RT @aidangomez: Your search can see now. We're excited to release fully multimodal embeddings for folks to start building with! https://t.โ€ฆ
0
72
0
@cadurosar
Carlos Lassance
4 months
RT @nadiinchi: Do not miss an application deadline for #ALPS2025 on October 15! ALPS is an Advanced Language Proceโ€ฆ
0
7
0
@cadurosar
Carlos Lassance
5 months
@antonio_mallia @prithivida @MrParryParry No they are not the same, I was going over the data that @prithivida shared, but from looking at the opensearch post they are not using sparse embed
2
0
0
@cadurosar
Carlos Lassance
5 months
@antonio_mallia @prithivida @MrParryParry What I mean is that they have more than 1 dimension per sparse embedding. For example SparseEmbed๐ฟ 64 has 64 dimensions per embedding. In my view, this is like storing 64 times the information per token you store on the database
1
0
2
@cadurosar
Carlos Lassance
5 months
@prithivida @antonio_mallia @MrParryParry Just my two cents: 1. They use less expansion, but way more information, their actual smallest flops on the table is 11.86 (0.74 flops with 16 dims) 2. It is easy to reduce FLOPS in domain, it is hard to make it work OOD (SparseEmbed๐ฟ 64 = SPLADE++ on BEIR)
1
0
1
@cadurosar
Carlos Lassance
5 months
@antonio_mallia To me is simply a question of Data, data, data. They are using pretraining data and might be using more than just msmarco to train the model (even if not using data that is on BEIR)
1
0
0
@cadurosar
Carlos Lassance
5 months
@antonio_mallia @MrParryParry Hey Antonio, that's an old study and in in-domain data. FLOPS becomes more important as you go out-of-domain and it is correlated with inference speed (but not perfectly, it really depends on the internal search algorithm).
0
0
1
@cadurosar
Carlos Lassance
6 months
@alexlimh23 @cohere @Nils_Reimers Awesome to have you join! Looking forward to working together
0
0
0
@cadurosar
Carlos Lassance
6 months
RT @nadiinchi: I will present our study on Multilingual Retrieval-augmented generation, tomorrow at #ACL2024NLP workshop on Knowledgeable Lโ€ฆ
0
4
0
@cadurosar
Carlos Lassance
10 months
0
0
1
@cadurosar
Carlos Lassance
10 months
@srchvrs It also reminds me of HyDE where instead of using the query you use an hypothetical document as your search anchor:
1
0
4
@cadurosar
Carlos Lassance
10 months
RT @sylvieshi00: Looking for a team lead to join our search team at @cohere working with @Nils_Reimers and many other kind & smart people.โ€ฆ
0
13
0
@cadurosar
Carlos Lassance
10 months
RT @cohere: Announcing the private beta of our newest foundation embedding model, Cohere Compass: designed specifically for multi-aspect daโ€ฆ
0
51
0
@cadurosar
Carlos Lassance
11 months
RT @Nils_Reimers: 0โƒฃ ๐–๐จ๐ซ๐ฅ๐ ๐…๐ข๐ซ๐ฌ๐ญ ๐๐ข๐ง๐š๐ซ๐ฒ ๐•๐ž๐œ๐ญ๐จ๐ซ ๐ƒ๐š๐ญ๐š๐›๐š๐ฌ๐ž 1โƒฃ Happy to annouce the world first ๐๐ข๐ง๐š๐ซ๐ฒ ๐•๐ž๐œ๐ญ๐จ๐ซ ๐ƒ๐š๐ญ๐š๐›๐š๐ฌ๐ž (for educational purposโ€ฆ
0
62
0
@cadurosar
Carlos Lassance
11 months
@andersonbcdefg check dm's
0
0
1