Jiao Sun Profile
Jiao Sun

@sunjiao123sun_

Followers
2,990
Following
410
Media
47
Statuses
345

Research Scientist at Google Gemini \n\n NLP PhD @ USC, Amazon ML Fellow \n\n ex-{Google Brain, Alexa AI} nlper, IIIS Tsinghua-Ren

Joined September 2019
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@sunjiao123sun_
Jiao Sun
8 months
Generated images not following your prompt? Introducing 𝔻𝕣𝕖𝕒𝕞𝕊𝕪𝕟𝕔 from @GoogleAI : improving alignment + aesthetics of image generation models with feedback from VLMs! ✅ Model Agnostic ✅ Plug and Play ❌ RL ❌ Human Annotation ❌ Real Image
Tweet media one
5
70
330
@sunjiao123sun_
Jiao Sun
26 days
Honored to receive the 🥇BEST PAPER AWARD🥇 from CVPR 2024, please consider using our collected fine grained feedback! Huge shout out to our work DreamSync, the key method that we use for using the fine grained feedback to improve the model, detail in my pined tweet! 🚀
@sunjiao123sun_
Jiao Sun
1 month
🌟Rich Human Feedback for Text-to-Image Generation selected as CVPR 2024 Best Paper Award Candidate (top 1%)🌟 Current text-to-image models are not perfect, but where exactly? They suffer from artifacts, alignment and aesthetics. We collect feedback on 18K images to capture all
Tweet media one
3
27
226
20
26
594
@sunjiao123sun_
Jiao Sun
8 months
Can LLMs generate exact 5 words? No How about 5 sentences? No How about 5 paragraphs? No 🤷🏻‍♀️ In , we evaluate the performance of LLMs on various controlled generation tasks including numerical planning, story generation, paraphrase generation, and etc. (1/n)
14
81
418
@sunjiao123sun_
Jiao Sun
6 months
Today I defended my thesis and became Dr. Sun! 🌞 Thank you my committee members @MaxMa1987 @VioletNPeng @jonathanmay @emilio__ferrara and Dan O’Leary! The slides of my presentation are here: . Ph.D done but research never ends! Fight on!
Tweet media one
Tweet media two
Tweet media three
39
3
316
@sunjiao123sun_
Jiao Sun
1 month
🌟Rich Human Feedback for Text-to-Image Generation selected as CVPR 2024 Best Paper Award Candidate (top 1%)🌟 Current text-to-image models are not perfect, but where exactly? They suffer from artifacts, alignment and aesthetics. We collect feedback on 18K images to capture all
Tweet media one
3
27
226
@sunjiao123sun_
Jiao Sun
2 months
Thanks @CSatUSC for capturing this one of the most important moments of my life! Thanks for my family and my dearest advisor @MaxMa1987 for making it happen! #PhD
Tweet media one
17
5
213
@sunjiao123sun_
Jiao Sun
3 years
A team of collaborators from ALL different institutes? 5 female researchers + 1 high school student? I am excited that our fairness work "Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages" is conditionally accepted by #CHI2022 ! Stay tuned for details!
Tweet media one
3
13
180
@sunjiao123sun_
Jiao Sun
1 year
After being four-year LinkedIn-less, I’m finally back! Let’s connect and chat if you: - are hiring — have an opening that I might be a fit! - are graduating, let’s go through the job searching together! - know me or my work! - just want to know me!
6
10
166
@sunjiao123sun_
Jiao Sun
2 years
Wouldn't it be a 🌩️DISASTER if evaluation metrics always rate American English 10 times better than Indian English? ⚠️We (🔗) study the dialect robustness systematically, find current that evaluation metrics are NOT robust to dialects🤯, and propose NANO🧵
Tweet media one
6
28
137
@sunjiao123sun_
Jiao Sun
2 years
#chi2022 Best paper honorable mention!! OMG?! thanks again my wonderful collaborators! esp. @tongshuangwu @YueJiang_nj @VictoriaLinML @Diyi_Yang ! See y’all at New Orleans!
Tweet media one
6
7
127
@sunjiao123sun_
Jiao Sun
3 years
Can we paraphrase sentences into desirable syntactic structures? How to select proper syntactic parses that can properly guide paraphrase generation? 🤔 Our #EMNLP2021 paper AESOP (w/ @MaxMa1987 @VioletNPeng ) proposes an adaptive way to retrieve compatible parses! 😎(1/6)
Tweet media one
1
14
84
@sunjiao123sun_
Jiao Sun
2 months
I’m working on @eccvconf rebuttal, and here’s one of the review: “The reliance on training data may raise concerns about the model's generalizability to unseen prompts and scenarios.” How should I rebut this? 🥲 I’m so speechless right now…
16
4
81
@sunjiao123sun_
Jiao Sun
3 years
While #Wikipedia has been a great resource for knowledge, implicit biases can be subtle and detrimental. In our new #ACL2021 paper (w/ @VioletNPeng ), we found that #Wikipedia pages intermingle professional career events with personal events in a systematically biased way. 1/5
Tweet media one
3
12
68
@sunjiao123sun_
Jiao Sun
2 years
🤶 Pretty Princess vs. Successful Leader? Have you ever sent someone greeting cards? People write greeting card messages out of goodwill, but gender stereotypes in these messages may be enforced without being noticed! Check out our #chi2022 work for a systematic analysis! (1/n)
Tweet media one
1
10
66
@sunjiao123sun_
Jiao Sun
11 months
One reviewer emotionally lowered their score after a round of discussion without saying anything technical 😢 What should we do? #emnlp2023
9
0
63
@sunjiao123sun_
Jiao Sun
10 months
Tweet media one
2
0
47
@sunjiao123sun_
Jiao Sun
2 years
CFP of ACL 2023 is out! Ddl of direct submission is **Jan 20th**! As always, bunch of other deadlines to keep an eye out. Check it out!
1
13
38
@sunjiao123sun_
Jiao Sun
1 year
I will be at ACL next week to present this work! Look forward to connecting with folks who work on evaluation, data and beyond! HMU if any of these sounds interesting to you! DMs are open
@sunjiao123sun_
Jiao Sun
2 years
Wouldn't it be a 🌩️DISASTER if evaluation metrics always rate American English 10 times better than Indian English? ⚠️We (🔗) study the dialect robustness systematically, find current that evaluation metrics are NOT robust to dialects🤯, and propose NANO🧵
Tweet media one
6
28
137
1
3
36
@sunjiao123sun_
Jiao Sun
2 years
I enjoyed the interview with Amazon a lot! It is not only a summary about my experience in natural language generation, but also a deep conversation about how my works connect and contribute to the community! Read to learn more about me, my Amazon internship and more! 👇
@AmazonScience
Amazon Science
2 years
Can AI help an aspiring author write a novel? Could machines learn how to make jokes? Inspired by these questions, Jiao Sun has been exploring the potential of AI-generated text. Now, as an Amazon ML Fellow, she's hoping to develop her research further. #ConvAI #NLProc
2
9
58
4
3
34
@sunjiao123sun_
Jiao Sun
2 years
Sebastian was my internship mentor for 6 months. He taught me everything including technicals, how to write a better paper and collaborate with others more efficiently! If you want to have a lifelong mentor and do great NLP research, I don’t see any reason why you wouldn’t apply!
@sebgehr
Sebastian Gehrmann
2 years
My group is hiring interns for summer 2023. If you are a current PhD student and interested, please email me. Info on internship topics: There are also multiple open full-time roles in AI Engineering - feel free to reach out :)
5
61
289
1
1
33
@sunjiao123sun_
Jiao Sun
7 months
My awesome co-first author Deqing is looking for a research internship opportunity this summer; He’s one of the most fast-paced researcher I’ve seen in these years! We would appreciate if you can send him a DM if you are recruiting interns working on LLM/Large Vision Models!
@DeqingFu
Deqing Fu
8 months
🚨New paper alert🚨 With 𝔻𝕣𝕖𝕒𝕞𝕊𝕪𝕟𝕔, large language models (LLMs), vision-language models (VLMs), and text-to-image (T2I) models 𝕊𝕪𝕟𝕔 together! They interactively and iteratively improve alignments and aesthetics of T2I models. No RL needed. No human annotation
2
7
92
0
3
29
@sunjiao123sun_
Jiao Sun
2 years
Are you excited about pun generation? In #EMNLP2022 , we have two works accepted in the main conference: 1️⃣ Context-Situated Pun Generation 👉 a brand-new task! 2️⃣ ExPUNations: Augmenting Puns with Keywords and Explanations 👉 a new dataset! Learn more! 🧵👇
2
6
29
@sunjiao123sun_
Jiao Sun
2 years
What does it mean for a generative AI model for code to be explainable? My internship work at IBM Research investigated the XAI need under 3 scenarios: code translation, code autocompletion and natural language to code. to appear at #IUI2022 #HCI 😏 (1/n)
2
5
27
@sunjiao123sun_
Jiao Sun
2 years
Wow thanks for the nice words and I think EVERY modeling work from the creative generation community should really think about having context as the constrain for the generation!
@liyucheng_2
Yucheng Li
2 years
When I had the idea of Pun Generation one year ago, i told myself it is not going to be possible. Until i saw this in #EMNLP2022 from the incredible author @sunjiao123sun_ . So exciting to see creative language generation paper in our community!
Tweet media one
1
2
21
0
0
27
@sunjiao123sun_
Jiao Sun
3 years
Thanks @QVeraLiao for the warm welcome! A super late announcement: I will be doing a research internship @IBMResearch on code generation! The great combination of my beloved text generation and Human-AI collaboration! Saying I’m excited would be a massive understatement! 💪💯
@QVeraLiao
Vera Liao
3 years
same 🎉welcome @sunjiao123sun_ to the team!
0
0
5
3
0
27
@sunjiao123sun_
Jiao Sun
2 years
Would NLU models trained on EN-US generalize well to EN-IN (Indian English)/ EN-GB (British English)? I am thinking about exploring the transferability of models between dialects. Does anyone here know some good datasets for this task? 🙏
4
2
27
@sunjiao123sun_
Jiao Sun
2 years
We ⚠️Investigate the Benefits of Free-form Rationales in our #EMNLP2022 findings work, from both the human and the model perspectives. For humans, do rationales aid human interpretability? For models, do rationales boost the model performance? (0/n)
Tweet media one
2
8
25
@sunjiao123sun_
Jiao Sun
8 months
LLMs just cannot count and generate exactly the number of words that we are asking for! With 7 being the magic number that models start to struggle with! (3/n)
Tweet media one
2
1
21
@sunjiao123sun_
Jiao Sun
1 year
Tu has been my amazing Google internship mate, close friend and life mentor. Can’t wait to see what he will achieve at his Google & VT adventure! All the best Tu!
@tuvllms
Tu Vu
1 year
I successfully defended my Ph.D. thesis. A special thank you to the members of my thesis committee: my wonderful advisor @MohitIyyer , @MajiSubhransu , @HamedZamani , @lmthang , and @colinraffel for their insightful feedback and advice on my research and career plans.
19
6
137
1
0
20
@sunjiao123sun_
Jiao Sun
1 year
Honored to be part of the efforts. Check out the magic that LIMA can achieve with only 1000 prompts! Our human eval resonates with GPT-4 eval showing how humans prefer LIMA over/on-par with other LLMs!
@sriniiyer88
Srini Iyer
1 year
New paper! Fine-tuning on just 1000 carefully selected prompts and responses produces a surprisingly strong chatbot model!
0
0
15
0
0
18
@sunjiao123sun_
Jiao Sun
1 year
Presenting this from 11:00-12:30 today! Come chat with me at the poster session!
@sunjiao123sun_
Jiao Sun
2 years
Wouldn't it be a 🌩️DISASTER if evaluation metrics always rate American English 10 times better than Indian English? ⚠️We (🔗) study the dialect robustness systematically, find current that evaluation metrics are NOT robust to dialects🤯, and propose NANO🧵
Tweet media one
6
28
137
0
1
15
@sunjiao123sun_
Jiao Sun
2 years
I’m excited to see awesome things #chatGPT can do, but we need to make sure it’s not producing gobbledygook that seems to be right — it is misleading and can be harmful as knowledge query. What is needed to explain generative models? Re-sharing our work:
@Thom_Wolf
Thomas Wolf
2 years
@geetkhosla because of behaviors like this 👇: super convincing yet plain and fully wrong. almost went back to my trigonometry book to check...
Tweet media one
9
4
52
2
4
15
@sunjiao123sun_
Jiao Sun
3 years
I'm in #gradcohort2021 organized by amazing @CRA_WP ! I've been enjoying the event a lot as it provides a platform for us female PhD students to connect and support each other! If you are here as well, feel free to drop me an email and we should talk!
1
0
15
@sunjiao123sun_
Jiao Sun
8 months
The key recipes of DreamSync are: 1. Diverse text prompts from LLMs 2. VQA feedback (TIFA score) for alignment and VILA feedback for aesthetics 3. Rejection Sampling with feedback 4. LoRA Fine-tuning 5. Multiple Iterations (2/n)
1
2
15
@sunjiao123sun_
Jiao Sun
8 months
In total, we include five controlled generation tasks, we show a spectrum of abilities of LLMs. They are good at: constrained content generation (e.g., sentiment), story generation, rationale generation! Bad at: numerical planning and paraphrase generation! (4/n)
Tweet media one
1
0
14
@sunjiao123sun_
Jiao Sun
2 years
Thanks for featuring my work with @QVeraLiao and all other colleagues at IBM Research. It has been a increasing effort around generative AI, and our work outlines what explainability would benefit users who will be using the models!
@censius
Censius
2 years
Generative AI is taking the industry by storm & seeing how it has become a niche of its own, How can we make Generative AI Models Explainable?🤔 This paper by @sunjiao123sun_ attempts to make Code-based GenAI Models explainable, let's break it down. 🧵
1
1
7
0
5
14
@sunjiao123sun_
Jiao Sun
9 months
Great work by @DeqingFu that could potentially be used for data augmentation and challenging current models!
@DeqingFu
Deqing Fu
9 months
Excited to share our self-labeled counterfactual paper @emnlpmeeting #EMNLP2023 with @ameya_godbole1 and @robinomial : we develop an automated procedure that generates hard negative examples (e.g., subtle unanswerable questions) from positive examples (e.g. answerable examples).
Tweet media one
1
5
29
0
0
14
@sunjiao123sun_
Jiao Sun
3 months
Welcome Logan!
@OfficialLoganK
Logan Kilpatrick
3 months
Excited to share I’ve joined @Google to lead product for AI Studio and support the Gemini API. Lots of hard work ahead, but we are going to make Google the best home for developers building with AI. I’m not going to settle for anything less.
577
188
5K
0
0
13
@sunjiao123sun_
Jiao Sun
7 months
Congratulations! If you are interested in decoding methods for generation, please check out the paper: . The look back decoding method automatically removes potential failures, repetitions and topic drifting from the decoding steps!
@xunannancy
Nan Xu
7 months
🌟Thrilled to share that our paper "Look-back Decoding for Open-Ended Text Generation" won the Outstanding Paper Award at EMNLP2023! Immense gratitude to anonymous reviewers and to my incredible collaborators @violet_zct , @real_asli and @MaxMa1987 . #EMNLP2023
Tweet media one
Tweet media two
Tweet media three
2
5
33
0
0
12
@sunjiao123sun_
Jiao Sun
2 years
@ReviewAcl so will April 15th review cycle will be 4 week or 6 week? It is important as many of us want the reviews back before EMNLP’s May 24 decision deadline of if submitting it to softconf. Btw, not a big fan of “surprise” announcement 📣🥲
Tweet media one
3
2
12
@sunjiao123sun_
Jiao Sun
4 months
The deadline is around the corner, please consider voting for Kai-Wei! Please search for “sigdat elections”in your email inbox and it should less than two minutes to vote! Your support is greatly appreciated! ❤️
@kaiwei_chang
Kai-Wei Chang
4 months
I am honored to be nominated by SIGDAT (the org that oversees EMNLP) to run for VP-elect with other awesome candidates who share the goal of improving our community. Please check your email to vote by 3/24.🗳️ See details:
Tweet media one
3
36
135
0
1
12
@sunjiao123sun_
Jiao Sun
2 years
Congrats! Need to read this! 📝
@sarahookr
Sara Hooker
2 years
Our work on "Intriguing Properties of Compression on Multilingual Models" has been accepted to EMNLP 2022. A collaboration led by Kelechi Ogueji w @orevaahia @lekeonilude , @sebgehr , @KreutzerJulia . 🎉🔥 Great news to hear at the end of a long two weeks of travel.
3
15
139
0
0
11
@sunjiao123sun_
Jiao Sun
1 year
It’s interesting to see how regional stereotypes got reflected in LLM just by adding the country tag in the prompts! Awesome work led by @esindurmusnlp !
@AnthropicAI
Anthropic
1 year
We develop a method to test global opinions represented in language models. We find the opinions represented by the models are most similar to those of the participants in USA, Canada, and some European countries. We also show the responses are steerable in separate experiments.
99
173
799
0
3
11
@sunjiao123sun_
Jiao Sun
3 years
Efficient transformer architecture from Max!! 🎉 Check it out!
@MaxMa1987
Xuezhe Ma (Max)
3 years
Thrilled to share our #NeurIPS2021 work! "Luna: Linear Unified Nested Attention". This is a new linear time transformer architecture achieves competitive results across multiple benchmarks. co-authors: @XiangKong4 @sinongwang @violet_zct @jonathanmay @gabema @LukeZettlemoyer
Tweet media one
Tweet media two
1
8
50
0
2
11
@sunjiao123sun_
Jiao Sun
10 months
Mark is extremely nice, work with him! 📣
@mdredze
Mark Dredze
10 months
I'm looking for a postdoc! Topics: LLMs, text generation, QA, medical NLP. Join two amazing postdocs in my group: @hanjie_chen (phd @CS_UVA , incoming prof @RiceCompSci ) and @sharonlevy21 (phd @ucsbNLP , incoming prof @RutgersU ) Apply: 🙏retweet Questions?
Tweet media one
5
56
133
0
0
10
@sunjiao123sun_
Jiao Sun
8 months
This #EMNLP2023 work is co-lead by four students from USC, UCLA and ETH @xunannancy @yufei_t @wangchunshu Awesome collaborators @johnwieting2 Rahul and Qian and of course amazing advisors @MaxMa1987 and @VioletNPeng ! We welcome all kinds of feedback and discussions! (n/n)
0
0
10
@sunjiao123sun_
Jiao Sun
7 months
I sadly cannot make it to EMNLP, but please talk to @yufei_t our work, especially about numerical planning! A lot of people have reached out about code release, we are sorry for the day and are working on this. The first release of our input and output will come very soon! :)
@sunjiao123sun_
Jiao Sun
8 months
Can LLMs generate exact 5 words? No How about 5 sentences? No How about 5 paragraphs? No 🤷🏻‍♀️ In , we evaluate the performance of LLMs on various controlled generation tasks including numerical planning, story generation, paraphrase generation, and etc. (1/n)
14
81
418
0
1
10
@sunjiao123sun_
Jiao Sun
8 months
Finally, Human annotators also agree that DreamSync aligns to texts better than SDXL. (7/n)
Tweet media one
1
0
10
@sunjiao123sun_
Jiao Sun
2 years
@mark_riedl Well, I really want to self-recommend my two pun generation papers that are gonna appear at emnlp 2022, but I’m pretty sure they are not the “best” 😑 how about checking AmbiPun first! By @yufei_t
1
0
10
@sunjiao123sun_
Jiao Sun
2 years
As my EMNLP trip is coming close, I wonder if there is a list of people who will be attending in person so that I don’t need to stalk everyone’s twitter? @emnlpmeeting if not, I’m happy to start one that people who want to connect can put down their name and websites 👩🏻‍💻
2
1
10
@sunjiao123sun_
Jiao Sun
8 months
Among all the tasks, Numerical Planning Benchmark (NPB) is the most intuitive task where LLMs are asked to generate sentences matching exact numerical constraints, such as count of words/syllables. We got the motivation from the real-world scenarios such as creative writing. 2/n
Tweet media one
1
0
9
@sunjiao123sun_
Jiao Sun
2 years
The work is done during my internship at Google Research with my hosts @sebgehr @jacobeisenstein , and my awesome collaborators @ThiboIbo @eaclark07 @tuvuumass @TDozat @dhgarrette @adisid01 ! Discussion is more than welcome! Happy Thanksgiving! 🦃🍁
0
1
9
@sunjiao123sun_
Jiao Sun
1 month
Finally, we show that the predicted rich human feedback can be leveraged to improve image generation quality. Following the same recipe as in DreamSync, we use the rich human feedback to select high-quality training data to finetune and improve the generative models! (4/n)
Tweet media one
1
0
8
@sunjiao123sun_
Jiao Sun
3 years
Thank you Nedjma so much for liking our work and such a wonderful summarization! 💯 We hope that you enjoyed our talk, and we would love to have/spike more discussion about event fairness in the community!
@nedjmaou
Nedjma Ousidhoum نجمة أوسيدهم
3 years
#ACL2021NLP Session:4E Ethics in NLP "Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia" by @sunjiao123sun_ and @VioletNPeng Paper Presentation #ACL2021EN 1/n
1
2
7
1
0
8
@sunjiao123sun_
Jiao Sun
2 years
You should catch me at the conference if you are attending in person! 👇 1️⃣ Context-Situated Pun Generation (Dec 9th 16:00-17:30 @ Atrium) 2️⃣ ExPUNations: Augmenting Puns with Keywords and Explanations (Dec 11th 15:30-17:00 @ Aritum) Look forward to seeing many of you there!
@sunjiao123sun_
Jiao Sun
2 years
Are you excited about pun generation? In #EMNLP2022 , we have two works accepted in the main conference: 1️⃣ Context-Situated Pun Generation 👉 a brand-new task! 2️⃣ ExPUNations: Augmenting Puns with Keywords and Explanations 👉 a new dataset! Learn more! 🧵👇
2
6
29
0
0
8
@sunjiao123sun_
Jiao Sun
3 years
Looking for a high-quality QA dataset for event-centric reasoning? You definitely don’t want to miss out ESTER with **FIVE** event relation types! We are looking forward to seeing everyone’s great efforts on solving this challenging task! 💪💪
@HanRujun
Rujun Han
3 years
(1/5) Introducing our #EMNLP21 paper “ESTER: A Machine Reading Comprehension Dataset for Event Semantic Relation Reasoning.” We invite everyone interested in event-centric reasoning to test your models on ESTER and submit results to our leaderboard:
Tweet media one
Tweet media two
1
6
31
0
0
8
@sunjiao123sun_
Jiao Sun
2 years
A bit surprised, but this is important for folks who are having hard time deciding between ACL and EACL. Also, EACL anonymity deadline is October 13th, it sounds like a good combo of arxiv + EACL + ACL
@eaclmeeting
eaclmeeting
2 years
[1/3] Cross-submission policy with ACL 2023: As the #EACL2023 notification deadline and #ACL2023 submission deadline are unfortunately on the same day, you may submit your paper to ACL 2023 while it is still under review at EACL 2023. Keep reading...
2
14
69
2
3
7
@sunjiao123sun_
Jiao Sun
8 months
This work is co-led by @DeqingFu and @huyushi98 ! With great collaborators: Su Wang, @RoyiRassin , Da-Cheng Juan, Dana Alon, Charles Herrmann, @vansteenkiste_s @RanjayKrishna and @CyrusRashtchian ! Discussions and feedbacks are more than welcomed!
0
0
6
@sunjiao123sun_
Jiao Sun
3 years
Congrats on the fine work @yufei_t ! Actually AESOP, my EMNLP work on paraphrasing contributes to converting the generated hyperboles to more natural expressions! This is a great use case showing how much paraphrasing can help! Please keep tuned with my new post about AESOP!
@yufei_t
Yufei Tian @ NAACL
3 years
Is generating hyperboles easy? Our machine says yes! Check our new #EMNLP2021 Findings paper "HypoGen: Hyperbole Generation with Commonsense and Counterfactual Knowledge" with Arvind and @VioletNPeng !🧾 Code and data coming soon!
Tweet media one
3
2
17
0
0
7
@sunjiao123sun_
Jiao Sun
2 years
Thanks Vera! Please swing by my talk — I look forward to talking to folks interested in Code Generation + explainability! It will happen at Weds March 23th around 9:20am EDT. 🤓
@QVeraLiao
Vera Liao
2 years
Trying to attend as many #IUI2022 sessions I can this week. Looking forward to catching up! If you are at IUI, check out the XAI session on Wednesday and @sunjiao123sun_ 's talk on "Investigating Explainability of Generative Models for Code through Scenario-based Design"😇
0
1
20
0
0
7
@sunjiao123sun_
Jiao Sun
1 month
Awesome collaboration with our student intern lead Yowei Liang from UCSD, Junfeng He, Gang Li, Peizhao, Arseniy, @N_Carolan +all other Google folks who are not on X at all :rofl. Feedback and discussions are absolutely welcome! (n/n)
0
0
6
@sunjiao123sun_
Jiao Sun
8 months
First, where did we get the prompts for training? We utilize LLM’s creativity (i.e., PaLM 2 for us)! Check out the qualitative examples as a glimpse of the diverse prompts in our training, which sets the solid foundation of DreamSync’s performance.  (3/n)
Tweet media one
1
0
7
@sunjiao123sun_
Jiao Sun
3 years
Paper link: . Live demo at , and talk to us in Session 10F (Tue 18:20 PDT)! #NAACL2021 👀🙌
0
0
7
@sunjiao123sun_
Jiao Sun
8 months
See the qualitative examples below about how DreamSync iteratively improves text-image alignment after each iteration! (4/n)
Tweet media one
1
0
7
@sunjiao123sun_
Jiao Sun
8 months
We also evaluate the performance of DreamSync on two benchmarks for both the text faithfulness and visual appeal. DreamSync performs the best among all the methods for textual faithfulness! (5/n)
Tweet media one
1
0
6
@sunjiao123sun_
Jiao Sun
1 month
With the collected data, we train a multimodal transformer to predict the rich feedback (plausibility/ alignment/aesthetics scores) automatically. Our model greatly outperforms (w and wo finetuning) CLIP in terms of correlation coefficients on our test set. (3/n)
Tweet media one
1
0
6
@sunjiao123sun_
Jiao Sun
2 years
Awesome work from Thibault and team! If you are working on TTS, this metric would greatly help with auditing the quality! Check it out!
@ThiboIbo
Thibault Sellam
2 years
1/N Tired of listening to your multilingual TTS models? SQuId 🦑 is an automatic metric for multilingual speech synthesis: give it a waveform, it predicts how natural it sounds. To develop the model we gathered 1.9 Million listening tests in 65 locales.
Tweet media one
5
28
101
0
0
6
@sunjiao123sun_
Jiao Sun
1 month
From the annotation example, you can see that we not only 1) mark the image regions that are misaligned or implausible, but also 2) provide which words in the text prompts are misrepresented or missing! (2/n)
Tweet media one
1
0
6
@sunjiao123sun_
Jiao Sun
2 years
Both works are done during my internship @AmazonScience with awesome @VioletNPeng @anjalisaa @shrnrby @Ale_Cervone @iuaaui Yang Liu, Tagyoung Chung and Jing Huang!
1
0
6
@sunjiao123sun_
Jiao Sun
3 years
Come and say HI! 👋 🤩
@uclanlp
uclanlp
3 years
Join us for the 12:30-12:30 AST poster session on 11/8! @sunjiao123sun_ will present our work on adaptive syntactically controlled paraphrase generation. Joint work w/ @MaxMa1987 . She had a more interesting introduction 👇👇👇
1
0
1
0
0
5
@sunjiao123sun_
Jiao Sun
2 years
In summary, yes, rationales can help with both human interpretability and model performance, but with a lot of caveats that people should take care of before getting to any conclusion! I will present this poster tomorrow #BlackboxNLP (Dec 8th) 11:00-12:30 at Mezzanine and Hall!
1
0
5
@sunjiao123sun_
Jiao Sun
8 months
As an iterative approach, we also see the progressive improvement after each iteration quantitatively, both for text faithfulness and aesthetics. (6/n)
Tweet media one
1
0
5
@sunjiao123sun_
Jiao Sun
2 years
@MaxMa1987 Thanks Max!
0
0
5
@sunjiao123sun_
Jiao Sun
3 years
@WikiResearch @USC @WikiWomenInRed Thanks for tagging! We hope our work brings awareness of potential event gender biases in knowledge sources (e.g., Wikipedia! I personally use it everyday 🥸), and urges Wikipedia contributors to be cautious when contributing to the pages! Check out my pinned Tweet for more!
0
0
4
@sunjiao123sun_
Jiao Sun
2 years
You should catch me at the conference if you are attending in person! 👇 1️⃣ Context-Situated Pun Generation (Dec 9th 16:00-17:30 @ Atrium) 2️⃣ ExPUNations: Augmenting Puns with Keywords and Explanations (Dec 11th 15:30-17:00 @ Aritum) Look forward to seeing many of you there!
1
0
4
@sunjiao123sun_
Jiao Sun
1 year
@sebgehr My tears 🥹 thank you!!
0
0
4
@sunjiao123sun_
Jiao Sun
2 years
This work is conducted with my amazing mentor @QVeraLiao and @mayankagarwal__ , together with expert @michael_muller , Stephanie Houde, @kr_t and fabulous manager Justin Weisz ( @gratefulspam (🧐)) ! Please check out our paper for more details, and HMU if you want to discuss! ❤️
1
0
4
@sunjiao123sun_
Jiao Sun
2 years
In addition, our experiments show that NANO also helps improve the metric performance on the standard metric benchmarks!! You should use our metric if you are evaluating dialectal texts and want a more fair judgment! (6/n)
Tweet media one
1
0
4
@sunjiao123sun_
Jiao Sun
2 years
We investigate XAI needs for generative AI models for code through scenario design. More specifically, we conducted 9 workshops with 43 software engineers using **real examples** from state-of-the-art generative AI models to elicit users' explainability needs! (2/n)
Tweet media one
1
1
4
@sunjiao123sun_
Jiao Sun
2 years
We ask two questions: 1️⃣ HOW MUCH do dialect rewrites improve the metric value over semantic perturbations? 2️⃣ HOW OFTEN do dialect rewrites score higher than semantic perturbations? We find that existing metrics BLEURT, Prism, YiSi, BLEU, chrF struggle at both. (2/n)
Tweet media one
1
0
4
@sunjiao123sun_
Jiao Sun
2 years
Ideally, dialects that share the same semantics should get the exact same score! this is too strict and can be easily violated. We introduce semantic perturbation, and define relaxed dialect robustness as dialects should score higher than semantic perturbations! (1/n)
Tweet media one
1
1
4
@sunjiao123sun_
Jiao Sun
3 years
Always a pleasure to work together with these wonderful researchers! 😊
@VioletNPeng
Violet Peng
3 years
Just had a 1-hour meeting with such a diverse group of undergrad, graduate, and post-doctoral researchers in CS. Guess what topic we discussed? w/ @ewsheng @jieyuzhao11 @JiaosunT @sunipa17 @houyu0939 @ovalle_elia @mattiesansev @kaiwei_chang Jinn Kim
Tweet media one
4
3
48
0
0
4
@sunjiao123sun_
Jiao Sun
3 years
Thanks for tagging! We hope our work brings awareness of potential event gender biases in knowledge sources (e.g., Wikipedia! I personally use it everyday 🥸), and urges Wikipedia contributors to be cautious when contributing to the pages! Check out my pinned Tweet for more!
@WikiResearch
WikiResearch
3 years
"Men Are Elected, Women Are Married: Events Gender Bias on #Wikipedia " event-centric study of gender biases on a large English Wikipedia corpus shows that personal life related events are more likely to appear for females than males. (Sun et al, 2021)
Tweet media one
6
90
175
0
2
4
@sunjiao123sun_
Jiao Sun
2 years
We then propose NANO, a pretraining schema to distill the dialectal information into training. We take mC4, the pretraining corpus of mT5, distill a LangID model, and use the dialectal information from the LangID model for second-stage pretraining an evaluation metric. (3/n)
Tweet media one
1
0
3
@sunjiao123sun_
Jiao Sun
3 years
Dear @emnlpmeeting , could you please clarify if the deadline for putting an anonymous manuscript is April 17th 23:59 AOE or not? Thanks a lot!
0
0
3
@sunjiao123sun_
Jiao Sun
2 years
We find that people tend to talk about achievement and career for males but appearance and domestic topics for females. Using WEAT scores, we find AI (GPT-2) generated greeting card messages further amplify such stereotypes!! 🥲 Check out techniques below: (2/n)
Tweet media one
1
0
2
@sunjiao123sun_
Jiao Sun
2 years
@BlancheMinerva You can probably refer to what we did in our work. We took the mC4 corpus, get the region information from url (.in) and combine it with language identification model output (English) and use those text as en-IN, aka Inglish. This is a rough approximation but benefits at scale
0
1
3
@sunjiao123sun_
Jiao Sun
2 years
ExPUNations: Augmenting Puns with Keywords and Explanations 🧵 Humor understanding and generation are challenging even for humans! e.g., to get the funniness of the pun "the sushi said to the bee, wasabi!" it requires the commonsense that wasabi often goes with sushi! (0/2)
Tweet media one
1
0
3
@sunjiao123sun_
Jiao Sun
2 years
@BlancheMinerva It depends on if you want a very accurate or coarse level of Inglish. For the analysis part of our Inglish dataset — we use the dataset from and I think this is probably the best that you can refer to! If you just want an approximation, (to be continued)
0
0
3
@sunjiao123sun_
Jiao Sun
2 years
According to the value of {dialect} - {semantic perturb}, NANO helps improve the dialect robustness across different model sizes and languages (English, Portuguese, and Mandarin Chinese!) The success rates of {dialect} > {semantic perturb} also indicate that NANO helps! (5/n)
Tweet media one
1
0
3
@sunjiao123sun_
Jiao Sun
2 years
To facilitate this new setup, we collect a corpus that contains 4,551 tuples of context keywords and associated pun pairs, labeled with whether they are compatible for composing a pun, and a human-written pun for each compatible tuple. (1/3)
Tweet media one
1
0
3
@sunjiao123sun_
Jiao Sun
2 years
As a result, we identify 11 categories of explainability needs in the context of GenAI for code with definitions and examples! (3/n)
Tweet media one
1
0
3
@sunjiao123sun_
Jiao Sun
4 years
How to help Fraud Detection experts better tune the algorithms and evaluate the result? Check out our paper FDHelper! #chi2020 #AutoML
@lynn20434203
Yin Li
4 years
Although still in the mood of shattered Hawaii dream, I want to share a pre-print of our accepted #CHI2020 paper "FDHelper: Assist Unsupervised Fraud Detection Experts with Interactive Feature Selection and Evaluation". Find out more here: . 🧡 @JiaosunT
0
0
6
0
0
3
@sunjiao123sun_
Jiao Sun
2 years
We include 10 languages, 95 language variants in pretraining. Then, we adapt the metric to different use cases including within-language assessment and quality estimation with or without references. (4/n)
Tweet media one
1
0
3