Lin Tan @Lin0Tan profile

Lin Tan

@Lin0Tan

Followers

505

Following

942

Statuses

93

Elmore New Frontiers Professor @PurdueCS | Ex @Meta @UWaterloo @IllinoisCDS @MSFTResearch @IBMResearch | #SE #TextAnalytics #LLM4Code #AI #Security

Joined August 2024

Don't wanna be here? Send us removal request.

Lin Tan

@Lin0Tan

3 months

Can #LLMs replace developers? Introducing RepoCod-Lite 🐟 for faster evaluation to answer this: 200 of the toughest #RepoCod #code-generation tasks: - GPT-4o and other LLMs have < 10% accuracy/pass@1 on RepoCod-Lite tasks - Leaderboard - 67 repository-level, 67 file-level, and 66 self-contains tasks - Detailed problem descriptions (967 tokens) and long canonical solutions (918 tokens) - Dataset: Thanks to the great feedback from #swe-bench’s @OfirPress, here are some clarifications about #RepoCod (: Compared to #SWE-Bench, RepoCod tasks are - General code generation tasks, while SWE-Bench tasks resolve pull requests from GitHub issues - With 2.6X more tests per task (313.5 compared to SWE-Bench’s 120.8) Compared to #HumanEval, #MBPP, #CoderEval, and #ClassEval, RepoCod has 980 instances from 11 Python projects, with - Whole function generation - Repository-level context - Validation with test cases, and - Real-world complex tasks: longest average canonical solution length (331.6 tokens) and the highest average cyclomatic complexity (9.00) #LLMs #LLM4Code #security #codegen

Lin Tan

@Lin0Tan

3 months

Can language models replace developers? RepoCod says “Not Yet”, because GPT-4o and other LLMs have <30% accuracy/pass@1 on real-world method-level code generation tasks. Leaderboard #LLM4code #LLM #CodeGeneration #Security @cerias @PurdueScience

1

16

68

Lin Tan

@Lin0Tan

3 days

@anikbera @yiwu5cs @huyiran1007 @NanJiang719 @cerias @PurdueCS @ieee_ras_icra Thank you! @anikbera It was a fun and productive collaboration for interdisciplinary research!

0

1

Lin Tan

@Lin0Tan

3 days

@yiwu5cs @huyiran1007 @NanJiang719 @anikbera @cerias @PurdueCS @ieee_ras_icra 3/3 📊 Our experiments demonstrate SELP’s effectiveness across diverse tasks. In drone navigation, SELP outperforms state-of-the-art LLM planners by 10.8% in safety rate and by 19.8% in plan efficiency. For robot manipulation, SELP achieves a 20.4% improvement in safety rate.

0

2

Lin Tan

@Lin0Tan

3 days

@yiwu5cs @huyiran1007 @NanJiang719 @anikbera @cerias @PurdueCS @ieee_ras_icra 2/3 3️⃣ Domain-Specific Fine-Tuning: Customizes LLMs for specific robotic tasks, boosting both safety and efficiency.

0

1

Lin Tan

@Lin0Tan

3 days

@yiwu5cs @huyiran1007 @NanJiang719 @anikbera @cerias @PurdueCS @ieee_ras_icra 1/3 💡SELP has 3 key insights: 1️⃣ Equivalence Voting: Ensures robust translations from natural language instructions into LTL specifications. 2️⃣ Constrained Decoding: Uses the generated LTL formula to guide the inference of plans, ensuring the generated plans conform to the LTL.

0

1

Lin Tan

@Lin0Tan

4 days

@FSEconf This is awesome! Isn’t the new conference name just FSE 2025 now? :-)

1

0

3

Lin Tan

@Lin0Tan

11 days

2/3 For example, when performing tasks such as identifying attacks and malware, security analysts often only have access to assembly, i.e., the human-readable representation of binary code, which is extremely difficult to understand.

0

1

Lin Tan

@Lin0Tan

11 days

3/3 Thus, combined with the increasing sophistication of cybercrime that poses significant threats worldwide (e.g., cybercrime is predicted to cost the world $10.5 trillion annually by 2025 (Sausalito, 2020)), effective binary analysis techniques are in high demand.

0

1

Lin Tan

@Lin0Tan

11 days

1/3 Binaries are a form of code. Think about binaries when you work on LLM4Code. Why binaries and binary models? Binary code plays an irreplaceable role in crucial tasks including vulnerability detection, malware detection, binary recovery, and legacy software maintenance.

0

1

Lin Tan

@Lin0Tan

12 days

@AbhikRoychoudh1 Abhik, congratulations! Very well deserved!

0

2

Lin Tan

@Lin0Tan

18 days

RT @AbhikRoychoudh1: Shonan meeting 217 on Trusted Automatic Programming is currently underway in Japan this week! The discussions are buzz…

0

4

0

Lin Tan

@Lin0Tan

28 days

RT @sg1753: 📢📢📢 1st IEEE Intl. Conference on LLM-Aided Design (ICLAD) papers due Feb. 28th, 2025! We invite papers on a broad range of topi…

0

2

0

Lin Tan

@Lin0Tan

1 month

RT @chun_yang_chen: 🚨 Join @TU_Muenchen's School of CIT! We're hiring: 1️⃣ W3 Associate/Full Professor in Healthcare Robotics 2️⃣ Tenure Tr…

0

4

0

Lin Tan

@Lin0Tan

1 month

@timmenzies Tim, Congratulations! Well deserved!

0

2

Lin Tan

@Lin0Tan

1 month

RT @chun_yang_chen: 🚀 Call for Papers! 📢 Excited to announce our new special issue @emsejournal: 🎯 “When Software Security Meets #LLM: Opp…

0

5

0

Lin Tan

@Lin0Tan

1 month

RT @Kexin_Pei: The 8th Deep Learning Security and Privacy workshop co-located with IEEE S&P @IEEESSP May 15, 2025, San Francisco ( https://t…

0

9

0

Lin Tan

@Lin0Tan

2 months

RT @ManlingLi_: [Long Tweet Ahead] Faculty Interview Tips & Common Questions: 🧘‍♀️0. Firstly, do not be nervous - Almost everything can…

0

78

0

Lin Tan

@Lin0Tan

2 months

@LingmingZhang Thank you, Lingming!

0

2

Lin Tan

@Lin0Tan

2 months

@maxxbw54 Bowen, Thank you so much for hosting Nan!

0

1