Shuai Kyle Zheng @bittnt profile

Shuai Kyle Zheng

@bittnt

Followers

1K

Following

15K

Statuses

483

Researcher at @cruise. Interests in: computer vision and machine learning. All opinions are my own.

California, USA

Joined May 2012

Don't wanna be here? Send us removal request.

Shuai Kyle Zheng

@bittnt

2 months

@doomie @willccbb Another maybe important question: which 75% training samples are selected? Are they randomly selected?

1

0

Shuai Kyle Zheng

@bittnt

2 months

That's actually not true. It uses JSON representation to represent all the data. ```messages=[ {"role": "user", "content": [ {"type": "text", "text": prompt}, {"type": "image_url", "image_url": { "url": f"data:image/png;base64,{base64_image}"} } ]} ],```. It requires user to encode the raw image as base64 and then converts this base64 string into a UTF-8 string. e.g. ```import base64 def encode_image(image_path): with open(image_path, "rb") as image_file: # Read the image in binary mode return base64.b64encode(image_file.read()).decode("utf-8") ```. I think this can later be read using OpenCV.

0

Shuai Kyle Zheng

@bittnt

3 months

@chriswolfvision Follow your analogy, switching to Camtasia is like hopping onto an electric bike: straightforward, smooth, and still powerful enough to get you where you need to go without breaking a sweat. No need for pilot training—just start pedaling and enjoy the ride!

1

0

1

Shuai Kyle Zheng

@bittnt

4 months

@nntsn Congratulations on the new role! I know nothing about nuclear fusion. Which introduction machine learning papers do you recommend to read that is also related to nuclear fusion? Or r u going to write one?

0

Shuai Kyle Zheng

@bittnt

5 months

@YiMaTweets I want to agree. But how can you prove that claim? One direction could go wrong is that the existing mathematical languages that human invented are not sufficient to explain the progress made in the AI through “engineering hacks”.

0

Shuai Kyle Zheng

@bittnt

8 months

@mtrainier2020 感觉主要是因为，千里马常有而伯乐不常有。可能这里伯乐就是工程师。当前AI最火的一个方向，diffusion model就是基于一个微分方程求解的论文，BDO Anderson 1982. 那个paper的引用几十年都是几十个，无人问津，图像生成sora出来之后，开始暴涨。可能��力学好微积分某天也能用来做Netflix电影。

0

1

7

Shuai Kyle Zheng

@bittnt

8 months

Text2image is the coolest app in town! 🚀 Just when I thought trainable MRFs were irrelevant, they prove otherwise, making text2image inference efficient/cheaper! Check out this #CVPR2024 paper: MarkovGen: Structured Prediction with Trainable MRFs.

0

2

Shuai Kyle Zheng

@bittnt

8 months

Excited to head to Seattle for #CVPR Sun-Fri! Looking forward to connect and chat about the latest in foundational models and autonomous robotics. DM me to meet up—looking forward to catching up with old friends and making new ones!

0

3

Shuai Kyle Zheng

@bittnt

10 months

Love the Google Scholar PDF Reader for its quick access to paper abstracts while reading! It's great but lacks support for arXiv links like chrome-extension://xxx/ Currently, you must download and reopen the PDF to activate this feature.

0

1

Shuai Kyle Zheng

@bittnt

1 year

My toddler ate paper today. To convince him that paper isn’t food, I used #DALLE3 to generate some cute images. They’re quite convincing. Check them out! The second one isn’t as compelling as the first.

0

1

Shuai Kyle Zheng

@bittnt

2 years

@chriswolfvision Is this using the same trick like the good old integral image in HOG?

1

0

Shuai Kyle Zheng

@bittnt

2 years

I asked chatGPT what are the favorite CV/ML papers before 2010. Turns out he mentioned: Eigenfaces for Recognition (1991), A Few Useful Things to Know About Machine Learning (1997), t-SNE (2008), Conditional Random Fields for Object Recognition (2005), and SIFT (1999).

1

0

4

Shuai Kyle Zheng

@bittnt

2 years

@ducha_aiki Definitely “ A discriminatively trained, multiscale, deformable part model” Cvpr 2008.

0

1

3

Shuai Kyle Zheng

@bittnt

2 years

@eric_brachmann “ Wer mit Ungeheuern kämpft, mag zusehn, dasser nicht dabei zum Ungeheuer wird. Und wenn du lange in einen Abgrund blickst, blickt der Abgrund auch in dich hinein.” Lol

1

0

3

Shuai Kyle Zheng

@bittnt

2 years

@ftm_guney Very cool work! A bit disappointed, no discussion like unknown unknown. Would be useful to have that uncertainty measure.

0

Shuai Kyle Zheng

@bittnt

2 years

Yes, with 8 USD per month, @elonmusk twitter needs have at least markdown/latex support, so that we can tweet code/equation and more.

Ligeng Zhu

@LigengZhu

2 years

🤣everyone should switch from openreview to twitter-review

0

1

Shuai Kyle Zheng

@bittnt

2 years

@sqcai It is not totally new, people can be IEEE paid member, with 210+ USD per year fee. That is 17.5 USD per month.

0

1

Shuai Kyle Zheng

@bittnt

2 years

I really like parrots, who can speak whatever language after some training. Maybe it is a path for AI. Checkout the latest workshop @icdm2022 Foundation Models in Vision and Language and our paper (w code)

0

2

Shuai Kyle Zheng

@bittnt

2 years

Holger Caesar from @tudelft covers the talk titled autonomous vehicles from imperfect and limited labels. A new autonomous vehicle dataset called nuPlan is introduced in this talk.

1

0

3

Shuai Kyle Zheng

@bittnt

2 years

Yu Cheng from @MSFTResearch presents the talk title Towards data efficient vision-language (VL) models. It covers the methods such as FewVLM and Grounded-FewVLM.

0

1