![Donald Shenaj Profile](https://pbs.twimg.com/profile_images/1874869931243630592/3VUhZkMh_x96.jpg)
Donald Shenaj
@DonaldShenaj
Followers
41
Following
63
Statuses
7
AI Research Intern @SamsungResearch UK | Ph.D. Student @UniPadova | Previously @Mila_Quebec @Concordia
Earth, Solar System
Joined March 2021
๐ธExcited to release ๐๐ผ๐ฅ๐.๐ฟ๐ฎ๐ฟ, a groundbreaking method for personalized content and style image generation ๐ฆ. ๐ Paper and video: Huge thanks to the co-authors: @OBohdal, Mete Ozay, Pietro Zanuttigh, and @umbertomichieli
4
3
9
@OBohdal @umbertomichieli ๐๐ผ๐ฅ๐.๐ฟ๐ฎ๐ฟ outperforms the current state of the art in both content and style fidelity, as validated by MLLM assessments and human evaluations.
0
0
0
@OBohdal @umbertomichieli โก๏ธ We achieved improved generation fidelity and footprints compared to ZipLoRA (2024 SOTA). ๐๐ผ๐ฅ๐.๐ฟ๐ฎ๐ฟ is 4000x faster in the merging process, uses 3x fewer parameters than a single subject-style combination of ZipLoRA.
0
0
0
@OBohdal @umbertomichieli ๐ค We proposed a new evaluation protocol with MARSยฒ, a new metric based on Multimodal Large Language Models, for better content-style fidelity assessment, which aligns closely with user preferences.
0
0
0
@OBohdal @umbertomichieli โจ We pre-trained a hypernetwork to enable zero-shot merging of unseen content-style LoRAs.
0
0
0
Awesome work from @joanrod_ai for Image-to-SVG code generation ๐ฆ
StarVector: Generating Scalable Vector Graphics Code from Images paper page: Scalable Vector Graphics (SVGs) have become integral in modern image rendering applications due to their infinite scalability in resolution, versatile usability, and editing capabilities. SVGs are particularly popular in the fields of web development and graphic design. Existing approaches for SVG modeling using deep learning often struggle with generating complex SVGs and are restricted to simpler ones that require extensive processing and simplification. This paper introduces StarVector, a multimodal SVG generation model that effectively integrates Code Generation Large Language Models (CodeLLMs) and vision models. Our approach utilizes a CLIP image encoder to extract visual representations from pixel-based images, which are then transformed into visual tokens via an adapter module. These visual tokens are pre-pended to the SVG token embeddings, and the sequence is modeled by the StarCoder model using next-token prediction, effectively learning to align the visual and code tokens. This enables StarVector to generate unrestricted SVGs that accurately represent pixel images. To evaluate StarVector's performance, we present SVG-Bench, a comprehensive benchmark for evaluating SVG methods across multiple datasets and relevant metrics. Within this benchmark, we introduce novel datasets including SVG-Stack, a large-scale dataset of real-world SVG examples, and use it to pre-train StarVector as a large foundation model for SVGs. Our results demonstrate significant enhancements in visual quality and complexity handling over current methods, marking a notable advancement in SVG generation technology.
1
0
4
RT @CoLLAs_Conf: Ditch the traditional single-distribution learning and soak up some fresh perspectives in Montreal this summer! ๐Join us aโฆ
0
13
0