jn2clark Profile Banner
jesse Profile
jesse

@jn2clark

Followers
278
Following
610
Statuses
351

Founder @marqo_ai, multimodal search, https://t.co/VVojrbcdBF. Made robots see & learn @ Amazon RAI. Ex physicist @ Stanford & UCL, https://t.co/xkht6fb7XQ

Joined December 2022
Don't wanna be here? Send us removal request.
@jn2clark
jesse
3 months
Today @marqo_ai open-weight (Apache 2.0) released the two best embedding models for ecommerce search and recommendations available anywhere. Marqo ecommerce models significantly outperform models from Amazon, Google, Cohere and Jina (see below). Fun fact: we had to create a significantly smaller and easier evaluation dataset just to accommodate some of the private models! + Up to 88% improvement on the best private model, Amazon-Titan-Multimodal (and better than Google Vertex, Cohere). + Up to 31% improvement on the best open source model, ViT-SO400M-14-SigLIP. + 5ms single text/image inference (A10g). + Up to 231% improvement over other bench-marked models (see blog below). + Evaluated on over 4M products across 10,000's of categories. + Detailed performance comparisons across three major tasks: Text2Image, Category2Image, and AmazonProducts-Text2Image. + Released 2 evaluation datasets: GoogleShopping-1m and AmazonProducts-3m. + Released evaluation code. + Apache 2.0 model weights available on @huggingface and to test out on Hugging Face Spaces.
Tweet media one
Tweet media two
4
5
26
@jn2clark
jesse
1 month
@SeunghyunSEO7 We see this a lot in contrastive learning which also couples problem difficulty
0
0
1
@jn2clark
jesse
1 month
Seems to be quite noticeable style differences between o1 and gpt-4o for python. o1 is technically correct but seems to completely ignore a lot of popular ways of doing things. I suspect it is the training data, generated vs scraped?
0
0
1
@jn2clark
jesse
2 months
@AravSrinivas Bluetooth connectivity
1
0
2
@jn2clark
jesse
2 months
a few more
@elder_plinius
Pliny the Liberator 🐉
2 months
🧃 THE FORBIDDEN JUICE 🧃 OpenAI’s reasoning models won’t process these perfectly harmless tokens! Why is that? 🤔 Juice: 128
Tweet media one
0
0
0
@jn2clark
jesse
2 months
Tweet media one
0
0
8
@jn2clark
jesse
2 months
@karpathy patches
0
0
1
@jn2clark
jesse
2 months
@Spotify did you kill the DJ?
0
0
0
@jn2clark
jesse
2 months
I just...asked it...
Tweet media one
1
0
1
@jn2clark
jesse
3 months
@indigoorton Cloudflare sales playing 4D chess
1
0
0
@jn2clark
jesse
3 months
Tweet media one
1
0
0
@jn2clark
jesse
3 months
RT @marqo_ai: Personalised feeds, starter packs, and lists? This platform is next level for creative content sharing Check out our @bluesk
0
2
0
@jn2clark
jesse
3 months
Come say hi!
Tweet media one
0
0
1
@jn2clark
jesse
3 months
Tweet media one
0
0
1
@jn2clark
jesse
3 months
RT @elsleightholm: breaking down @marqo_ai's new, state-of-the art embedding models for #ecommerce 🔥 the best part, these models are open…
0
5
0
@jn2clark
jesse
3 months
Love to see more data tooling!
@dvilasuero
Daniel Vila Suero
3 months
Let's go @marqo_ai team! Here's how to start using their latest open dataset for exploration, labelling, and/or curation without leaving the @huggingface Hub.
0
1
2
@jn2clark
jesse
3 months
RT @marqo_ai: What a week for @marqo_ai on @huggingface! 🤗 1. We released two of the best embedding models for #Ecommerce Search and Recom…
0
8
0
@jn2clark
jesse
3 months
@leavittron @datologyai Very interesting! Why not use SigLIP instead of CLIP?
1
0
2