
William J.B. Mattingly
@wjb_mattingly
Followers
3K
Following
4K
Media
658
Statuses
4K
Digital Nomad · Historian · Data Scientist · NLP · Machine Learning Cultural Heritage Data Scientist @Yale Former @SIDataScience @huggingface Fellow 🤗
Fort Myers, FL
Joined May 2020
Want to do a full-finetune of Dots.OCR? I've got a fork working! It handles the conversion of data from PageXML (Transkribus) to Dots.OCR format for you!.(Link down below). The first models are already on @huggingface and working as expected. Still training them.
5
5
58
Throw back to OpenAI 5 years ago. This appeared my YouTube feed. By the way, @twominutepapers is a great channel to watch.
0
0
3
Yessssss.
SAM 2 by @AIatMeta has finally been integrated into @huggingface Transformers! 🔥. It's a generalization of SAM 1 to video, allowing you to segment and track something you care about across a sequence of frames. SOTA performance, Apache 2.0 license
0
0
1
RT @vllm_project: 🚀 Amazing community project!. vLLM CLI — a command-line tool for serving LLMs with vLLM:.✅ Interactive menu-driven UI & s….
0
186
0
RT @Prince_Canuma: LFM2-VL is done✅. M3 max stats:.- Full precision (~250 tok/s) .- 4 bit quant (~530 tok/s)
0
40
0
RT @osanseviero: Introducing Gemma 3 270M 🔥. 🤏A tiny model! Just 270 million parameters.🧠 Very strong instruction following.🤖 Fine-tune in….
0
334
0
I'm working on a few finetunes for LFM2-VL for medieval texts. Any app developers interested in teaming up and building a prototype that would use these models to take pics of a manuscript and transcribe it/generate metadata about it? Also open to non-medieval stuff and general.
Two Weeks. $10K. No Excuses. Hack-01 is still live, and you’ve got until Aug 20, 2025 at 12 PM PST to ship your on-device AI build. ⚙️Tools: LFM2 + LEAP.💰Prizes: $10K every 2 weeks.📍Where: Discord (yes, you need to join). Build private, real-time AI on the edge — or just keep
0
0
1
@LiquidAI_ 3) Speed and size. This is the real appeal of this model. It's tiny and fast. I see this potentially replacing TrOCR for some line-level transcription workflows. Its smaller, faster, and has a broader understanding of language (vibes only here).
0
0
0
@LiquidAI_ 3) Speed and size. This is the real appeal of this model. It's tiny and fast. I see this potentially replacing TrOCR for some line-level transcription workflows. Its smaller, faster, and has a broader understanding of language (vibes only here).
0
0
0
@LiquidAI_ 2) 2) Word level accuracy is clearly improving. This will only get better as the text decoder learns medieval languages better. This is where I believe future checkpoints will improve.
0
0
0