![SkalskiP Profile](https://pbs.twimg.com/profile_images/1557421659925237763/RzJu2YI3.jpg)
SkalskiP
@skalskip92
Followers
30K
Following
15K
Media
2K
Statuses
8K
Open-source Lead @roboflow. VLMs. GPU poor. Dog person. Coffee addict. Dyslexic. | GH: https://t.co/dEmzMDGq5H | HF: https://t.co/4Lx1Yw34W7
Kraków, Polska
Joined February 2014
.@Arsenal is looking for people to help them build football Al
football AI code is finally open-source. - player detection and tracking.- team clustering.- camera calibration. I still need to work on README; don't judge me on that. code:
30
141
1K
I fine-tuned my first vision-language model. PaliGemma is an open-source VLM released by @GoogleAI last week. I fine-tuned it to detect bone fractures in X-ray images. thanks to @mervenoyann and @__kolesnikov__ for all the help!. ↓ read more
30
193
1K
tomorrow, I'll be hosting a talk for @MIT; I'll be speaking about open-source computer vision tools. 12:00 PM PST / 03:00 PM EST / 09:00 PM CET. we'll be streaming on X:
15
126
1K
YOLOv9 is out. looks like a new SOTA real-time object detector. I'm already working on a custom training tutorial
YOLOv9. Learning What You Want to Learn Using Programmable Gradient Information. Today's deep learning methods focus on how to design the most appropriate objective functions so that the prediction results of the model can be closest to the ground truth. Meanwhile, an appropriate
24
159
1K
Florence-2 + SAM-2. SAM-2 doesn't understand language on its own, but Florence-2 does. I'm having a lot of fun with this combo! The first version of my @huggingface space is already online. link:
16
139
917
would you watch a stream where I show how to build analytics system like this with supervision?. - detection filtering with polygon zones.- object tracking.- customizable annotators.- line zones with in/out counters.- per-class counts. supervision repo:
supervision-0.24.0 is out! you can finally count per-class line crossings. many of you have been asking for this, now we have it! . it took me barely 30 minutes to make this demo using supervision!. link:
47
72
913
analyzing store traffic to find the most frequently visited areas. super demo created by @Hine__Po - member of Supervision community. link to repo if you want to build something over the weekend:
13
145
801
I managed to fine-tune @OpenAI GPT-4o for object-detection task!!!. here's a veeeery dirty colab:
21
68
801
parking occupancy analysis. calculation of percentage occupancy in individual parking zones. all this was done with supervision: btw, @UenoLeo is cooking a blog post covering this project, so stay tuned!. ↓ read more
13
94
692
Sports Analytics with GPT-4 Vision. I wondered whether GPT-4V had the capability to automatically separate players into teams based on the color of their uniforms. It took me a ridiculously long time to create this image, but in the meantime, I learned a lot about GPT-4V.
supervision-0.13.0 is out! We added ByteTrack support! Now you can easily plug in any object detector and use it for tracking. GitHub repository:
19
86
656
- Object detection over HTTP? .- Easy! . We just open-sourced our inference server under Apache 2.0. Left terminal: @roboflow inference.Right terminal: video client
6
76
652
YOLO-World + EfficientSAM + StableDiffusion for language-guided inpainting. I was inspired yesterday by the work of @MrDravcan (see attached), and I decided to try to replicate it. SPOILER ALERT: it didn't quite work out for me. ↓ read more
16
95
596
segment anything 2 (SAM2) is out; I have been waiting for this for a long time!. I spent most of my morning playing with the model. here's the initial version of my tutorial notebook. I'll be updating it to include all the cool stuff.
Introducing Meta Segment Anything Model 2 (SAM 2) — the first unified model for real-time, promptable object segmentation in images & videos. SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experiences. Details ➡️
12
61
600
player clustering component of my Football AI project is pushed to GitHub. - feature extraction with SigLIP.- dimensionality reduction with UMAP.- clustering with KMeans . code:
no more new VLMs? . I'm finally working on a YouTube tutorial for my football AI project; the tutorial should be out next week. stay tuned:
12
59
595