Heiga Zen (全炳河) @heiga_zen profile

Heiga Zen (全炳河)

@heiga_zen

Followers

7,367

Following

194

Media

239

Statuses

8,149

Principal Scientist (Director) @GoogleDeepMind . 波瀬小⇒一志中⇒鈴鹿高専⇒名工大 (IBM TJ Watson intern for a year)⇒東芝欧州研⇒Google (Speech🇬🇧⇒Brain🇯🇵) ⇒Google DeepMind

https://t.co/CBAtDfnTn3

Tokyo, Japan

Joined August 2016

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

#WeAreSeriesEP15 • 622239 Tweets

PondPhuwin WeAre EP15 • 247284 Tweets

JISOO CARTIER TRINITY • 169651 Tweets

#GalaxyZFlip6 • 144932 Tweets

#GalaxyZFold6 • 143089 Tweets

Netherlands • 133509 Tweets

Coutinho • 116471 Tweets

Barron • 90585 Tweets

Katy Perry • 71144 Tweets

ATE MASHUP VIDEO • 64279 Tweets

George Clooney • 60726 Tweets

Pelosi • 53549 Tweets

#BliblixSamsung • 48784 Tweets

Team USA • 46648 Tweets

Dortmund • 45158 Tweets

Dutch • 44552 Tweets

Kawhi • 39442 Tweets

A. Interpreter • 39354 Tweets

Andrew Garfield • 39281 Tweets

F-16 • 38628 Tweets

बैल बुद्धि • 36656 Tweets

#NEDENG • 35131 Tweets

Leandro • 30272 Tweets

OUR DREAM IN TIME CAPSULE • 19400 Tweets

Derrick White • 17253 Tweets

Ben Shapiro • 15513 Tweets

Balotelli • 14632 Tweets

Ingrid • 13126 Tweets

Eduardo Macaya • 12914 Tweets

Kyrie • 12258 Tweets

Swalwell • 11914 Tweets

Payet • 11756 Tweets

Haunted • 11197 Tweets

Emio

Haberimiz Olsun

第902回

Dzeko

Καρμο

Capri

Kanzlerkandidatur

Lexie Hull

Hangi AB

John Hunt

CMLL

Ostrich

Mystics

Stephanie Vaquer

$RECORD

$JEB

Jaylen Brown

Last Seen Profiles

@JavaidI71629997

@FantasyIbu

@Athletics_CHS

@hillintltrucks

@mbahmaryono_

@deanhassell

@sunghoor

@BeHo

@u_ibrahim_altay

@claramatheus

@LilShrimpee

@SpaceX

@gndtech

@AlastairLiving

@not_you87

@elantinoti

@DavidSRidings

@sullympls

@autodeskhelp

@olgun_sever33

Heiga Zen (全炳河)

@heiga_zen

4 years

グーグル社内でロンドンから東京へ移ったら給料十数％下がった。

ファイナンシャルスター

@financial_star7

4 years

香港の知人に確認したところ「日本は所得水準が低すぎて誰も行かない」らしい。「金融で年収1000万円以下の仕事があること自体信じられない」「みんなイギリスに行く」との事。ある程度の金融マンは2000万〜3000万円は必要らしい。

76

7K

12K

13

2K

3K

Heiga Zen (全炳河)

@heiga_zen

2 years

Personal role update: From January 2023 I'll start leading the entire Google Brain team in Japan including the research team previously led by David Ha @hardmaru . I'm looking forward to working with the team members & learning new topics such as RL, ALIFE, & AI creativity!

32

47

1K

Heiga Zen (全炳河)

@heiga_zen

1 year

I've been promoted to Principal Research Scientist at Google Brain (soon Google DeepMind). I'm grateful to my family, colleagues, collaborators & those who have supported me throughout my career. I'm excited to continue working w/ all of you to make a difference in the world.

25

19

702

Heiga Zen (全炳河)

@heiga_zen

2 years

任期有りなら、よっぽど給料が高くない限りわざわざ民間行かない気がします。

日本経済新聞　大学取材班

@nikkei_daigaku

2 years

三菱電機は、博士号を取得した若手研究者を任期付きで採用する人事制度を4月から始めると発表しました。将来的に大学など研究機関での正規職員を目指す人に対し、民間での経験を提供することでキャリア形成を支援する狙いです。

41

591

1K

2

137

586

Heiga Zen (全炳河)

@heiga_zen

5 years

We released a new large-scale corpus of English speech derived for TTS; LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech Dataset: Paper:

5

157

360

Heiga Zen (全炳河)

@heiga_zen

1 year

この度Google Brain (近日中にGoogle DeepMind) のPrincipal Research Scientist（主幹研究員?）に職位が上がりました。家族・同僚・徳田研・外部の共同研究者・議論や意見を頂いた皆様のおかげです。これからも研究を通じて社会に貢献できればと思います。今後もどうぞよろしくお願いいたします。

9

10

346

Heiga Zen (全炳河)

@heiga_zen

5 years

Google Brain Tokyo members had a chance to meet and have lunch with Prof. Hinton today. He also gave a tech talk about his latest work on capsule network in the Tokyo office this afternoon. Thanks a lot to @hardmaru for organizing them!

1

38

338

Heiga Zen (全炳河)

@heiga_zen

4 years

Yet another neural vocoder from my team mates in Google Brain is out! The new model, "WaveGrad", is not autoregressive/Flow/GAN. It is based on score matching / diffusion probabilistic models. Check it please!!

arXiv Sound

@ArxivSound

4 years

``WaveGrad: Estimating Gradients for Waveform Generation. (arXiv:2009.00713v1 []),'' Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, William Chan,

0

22

90

2

62

312

Heiga Zen (全炳河)

@heiga_zen

5 years

"Googleでは機械学習に特化したイベントGoogle Developers ML Summitを7/11に開催します。Jeff Dean をはじめとしたTensorFlowチームが来日し、TensorFlow、Cloud ML、ML Kitなど、Googleが開発者のみなさんに提供する機械学習ツールについてのセッションを行います"

1

89

304

Heiga Zen (全炳河)

@heiga_zen

4 years

My team in Google Brain Tokyo is hiring!

8

81

282

Heiga Zen (全炳河)

@heiga_zen

4 years

文化大革命時の毛沢東と紅衛兵みたいだ。「最終的には毛沢東の父が富農だったことを批判する壁新聞まで出現し、もはや毛沢東すら紅衛兵をコントロールできない事が明らかになってしまった。」

紅衛兵 - Wikipedia

ja.m.wikipedia.org

Kazuto Suzuki

@KS_1013

4 years

おお、トランプ支持者だった白人至上主義者たちは、トランプが平和的な政権移行を表明したことで、トランプを標的にし始めたとのこと。ついにMAGA運動はトランプの手を離れ、独自の生命力を持つようになってしまった…。 @politico より

9

1K

2K

1

173

255

Heiga Zen (全炳河)

@heiga_zen

8 years

DeepMindの研究者と一緒に新しいニューラルネットベースの音声合成システムを作りました。品質はLSTMや波形接続型音声合成器を上回ってます。

2

177

245

Heiga Zen (全炳河)

@heiga_zen

2 years

I've been elected as a fellow of the International #Speech Communication Association ( #ISCA ). Huge thanks to my colleagues & collaborators who have taught me so much & #Google which has given me opportunities to work on real-world problems.

19

15

247

Heiga Zen (全炳河)

@heiga_zen

4 years

Google Research Tokyo page is now live!

Careers – Google Research

Join our team. We're looking for talented people who have applied experience in the fields of Machine Learning, Natural Language Processing and Machine Intelligence.

research.google

1

58

234

Heiga Zen (全炳河)

@heiga_zen

4 years

I was advised to make this tweet more informative :-) My team (Google Brain Applied Research in Tokyo) is hiring a Research Software Engineer (Machine Learning) who has professional experience and/or publications in speech processing, NLP, or Computer Vision area.

Heiga Zen (全炳河)

@heiga_zen

4 years

My team in Google Brain Tokyo is hiring!

8

81

282

5

87

229

Heiga Zen (全炳河)

@heiga_zen

2 years

135本出す人には135本査読してほしい

3

21

216

Heiga Zen (全炳河)

@heiga_zen

6 years

Google Speech team is coming to Tokyo! グーグル音声チームが東京オフィスにやって来ます！ &

2

77

205

Heiga Zen (全炳河)

@heiga_zen

3 years

Job opening at Google Korea. Lead Software Engineer, Machine Learning Seoul, South Korea

2

45

194

Heiga Zen (全炳河)

@heiga_zen

21 days

NotebookLMがとても便利。今年度に次女の小学校からもらった紙のプリント全部アップロード ⇒ 書いてあることを（だいたい）答えてくれます。例えば「個人面談はいつですか？」と聞いたら正しく答えてくれました。未だに紙が多い学校とのやり取りが、これで楽になりそうです

NotebookLM を日本語でも提供開始。ウェブサイトや Google スライドにもサポート

blog.google

1

45

190

Heiga Zen (全炳河)

@heiga_zen

4 years

@issei_sato Google社員が論文・会議・本・外部テックトーク等アカデミックな発表をする際、必ず投稿前に承認が必要です。社内査読はこの承認を得るためのプロセスです。投稿者は（ほぼ仕上がった）原稿を添付してプロセスを開始します。投稿者は査読者数名と承認者を指名します。

1

50

183

Heiga Zen (全炳河)

@heiga_zen

3 years

12年前イギリスに渡った直後は「日本は技術があって進んでて便利。なんでイギリスは…」とか思っていましたが、10年経って帰ってきてからは「日本は10年前から変わってない。なんでこんなことまだやってるの…」と思うことが多いです。

0

43

177

Heiga Zen (全炳河)

@heiga_zen

6 years

I will move to Google Brain from this July. By the end of this year, I will go back to Japan and be one of the founding members of the new Google Brain Tokyo team; I'm looking forward to working with the Brain team members and people in the Tokyo office!

Jeff Dean (@🏡)

@JeffDean

6 years

東京オフィスでAI研究に取組む仲間を募集します！Happy to see our #GoogleAI efforts expanding w/ Google Brain now having a research presence in Tokyo. We’re hiring machine learning researchers there, if you’re interested in helping advance AI, apply here —>

17

613

1K

2

36

177

Heiga Zen (全炳河)

@heiga_zen

4 years

🇺🇸ニューヨークでインターン／🇯🇵地方国立大学卒／🇬🇧ケンブリッジで就職／🇬🇧ロンドンのGAFAに転職／🇯🇵帰国／GAFAのAI研究部門の中の人でマネージャー／🇺🇸に1年🇬🇧に10年在住

2

4

167

Heiga Zen (全炳河)

@heiga_zen

2 years

明けましておめでとうございます。今年もどうぞよろしくお願いいたします。今日から公式にGoogle Brain Japan全体のリードとなりました。チームの皆さん及びその共同研究者の方々と、これから一緒に仕事するのを楽しみにしています。

1

6

171

Heiga Zen (全炳河)

@heiga_zen

2 years

C++書いてる。ミーティングの100倍楽しい。

1

16

165

Heiga Zen (全炳河)

@heiga_zen

7 months

本日Gemini 1.0がリリースされました！Geminiは画像・音声・動画・テキストを単一モデルでサポートするマルチモーダルLLMです。様々なベンチマークで最高の性能を達成しています。

Sundar Pichai

@sundarpichai

7 months

Introducing Gemini 1.0, our most capable and general AI model yet. Built natively to be multimodal, it’s the first step in our Gemini-era of models. Gemini is optimized in three sizes - Ultra, Pro, and Nano Gemini Ultra’s performance exceeds current state-of-the-art results on

1K

4K

24K

1

26

164

Heiga Zen (全炳河)

@heiga_zen

5 years

Today I presented this paper (Best Student Paper at Interspeech2019) at Google Brain Tokyo's paper reading group. It was fun :-) Adversarially Trained End-to-end Korean Singing Voice Synthesis System Paper: Slide & Demo:

GitHub - juheo/Adversarially-Trained-End-to-end-Korean-Singing-Voice-Synthesis-System: Adversaria...

Adversarially Trained End-to-end Korean SInging Voice Synthesis System - juheo/Adversarially-Trained-End-to-end-Korean-Singing-Voice-Synthesis-System

github.com

0

32

162

Heiga Zen (全炳河)

@heiga_zen

5 years

Our new paper: Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning

Learning to Speak Fluently in a Foreign Language: Multilingual...

We present a multispeaker, multilingual text-to-speech (TTS) synthesis model based on Tacotron that is able to produce high quality speech in multiple languages. Moreover, the model is able to...

arxiv.org

3

49

159

Heiga Zen (全炳河)

@heiga_zen

5 months

Our team @GoogleDeepMind Japan is hiring a Research Engineer to work on Neural Speech Understanding and Speech Generative Modeling in Tokyo! If you are interested and have experience in these topics, please consider applying via the link below:

DeepMind

boards.greenhouse.io

0

43

157

Heiga Zen (全炳河)

@heiga_zen

10 months

Google Cloud for Researchers "Submit a proposal to receive up to $5,000 in free Google Cloud credits for academic research. Use Google's high performance computing capabilities. ..."

1

51

151

Heiga Zen (全炳河)

@heiga_zen

5 years

土日や夜が多いからな気がします。自分はわざわざ家庭生活犠牲にして、勉強会に出たいと思えないです。

Yusuke Ando

@yando

5 years

カンファレンスや勉強会などのコミュニティでGoogleやFacebookの人をまり見かけないのはなぜなのだろうと思っていました。当事者になってみると、単純に忙しいというのと社内にも業務外のコミュニティがありそれだけでも無限に時間を投入できる。かなり強い動機がないと外に出なくなる。

5

147

444

2

59

145

Heiga Zen (全炳河)

@heiga_zen

2 years

Google Research Japan is hiring a Research Scientist in AI for Social Good. This is a great opportunity for researchers in Japan who are working on AI & Healthcare.

1

37

142

Heiga Zen (全炳河)

@heiga_zen

1 year

というわけで所属がGoogle DeepMindに変わるみたいです。昔DeepMindの皆さんと色々仕事したので、また色々したいと思います。

Parallel WaveNet: Fast High-Fidelity Speech Synthesis

The recently-developed WaveNet architecture is the current state of the art in realistic speech synthesis, consistently rated as more natural sounding for many different languages than any...

arxiv.org

1

9

143

Heiga Zen (全炳河)

@heiga_zen

6 years

Google Cloud TTS、WaveNetが日本語でも選べるようになりました。こちらでお試しいただけます。

0

60

136

Heiga Zen (全炳河)

@heiga_zen

5 years

I received a test-time award for my paper at ISCA SSW6 (12 years ago).

6

13

132

Heiga Zen (全炳河)

@heiga_zen

3 years

New paper from our team: Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, RJ. Ryan, Yonghui Wu "Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling" Arxiv: Samples:

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with...

This paper introduces Parallel Tacotron 2, a non-autoregressive neural text-to-speech model with a fully differentiable duration model which does not require supervised duration signals. The...

arxiv.org

2

29

133

Heiga Zen (全炳河)

@heiga_zen

4 years

@Ttsywtnb イギリスは食料品や生活必需品には消費税かからないので、その辺は東京より安かったです。家賃は東京の方が安い気がしますが、イギリスより狭い。外食は日本のほうが安い。税金は同じぐらい。

1

64

128

Heiga Zen (全炳河)

@heiga_zen

7 months

New paper from our collaborators at Google Research. "We present Translatotron 3, a novel unsupervised speech-to-speech translation architecture. In Translatotron 3, we show that it is possible to learn a speech-to-speech translation task from monolingual data alone."

Google AI

@GoogleAI

7 months

Introducing Translatotron 3, an unsupervised approach to speech-to-speech translation that can learn from monolingual data, mitigating the challenges of requiring parallel speech data & opening the door to translation of the non-textual speech attributes →

31

227

956

1

18

122

Heiga Zen (全炳河)

@heiga_zen

2 years

Received my ISCA fellow plaque at the opening session of #Interspeech . Huge thanks to all collaborators and colleagues!!

Odette Scharenborg

@OScharenborg

2 years

@heiga_zen , new ISCA fellow @ISCAInterspeech - congratulations to all our 6 new fellows! ❤️

0

3

14

7

8

119

Heiga Zen (全炳河)

@heiga_zen

3 years

Accepted at ICLR.

Heiga Zen (全炳河)

@heiga_zen

4 years

Yet another neural vocoder from my team mates in Google Brain is out! The new model, "WaveGrad", is not autoregressive/Flow/GAN. It is based on score matching / diffusion probabilistic models. Check it please!!

2

62

312

4

9

120

Heiga Zen (全炳河)

@heiga_zen

1 month

I'm back!

2

1

119

Heiga Zen (全炳河)

@heiga_zen

6 years

むかーしむかしDNN音声合成始めた頃、Jeffがロンドンに来たとき私の机まで来て、DNN音声合成について説明しました。Jeffから「波形そのまま使えないの？」とツッコまれました。当時は「キビシイっす」と返事しましたが、はい、数年たったら使えるようになりました。

0

29

116

Heiga Zen (全炳河)

@heiga_zen

5 years

Neural vocoder from Xiaomi. Representation is extracted by a neural encoder, rather than knowledge-based fixed representation such as mel-spectrogram or WORLD vocoder params. Conceptually similar to VQ-VAE. "RawNet: Fast End-to-End Neural Vocoder"

3

39

114

Heiga Zen (全炳河)

@heiga_zen

4 years

日本で生まれ育って（26年）出て（アメリカ1年）帰って来て（3年）また出て（イギリス10.5年）帰ってきた（1.5年）外国人としての感想「日本は旅行や一時帰国するには最高だけど、住むには正直辛い」

Yoshi Koike @オランダ在住

@yoshi_kotch

4 years

海外に出てみて、それによって「やっぱり日本最高」と思うか「日本には戻りたくない」と思うかは、もはや根本的な価値観の違いなので、別に人それぞれで良いと思う。いずれにせよ、人生のどこかのタイミングで海外に出て、日本を客観的に見る経験というのは、人生にとって確実にプラスになる。

26

700

5K

3

16

112

Heiga Zen (全炳河)

@heiga_zen

4 years

@quesokis QoLは難しいですね…。ロンドンでは郊外に住んでたので、通勤が長くて辛かったです。東京では会社に自転車で通勤できる距離に住んでいるので、だいぶ楽になりました。庭付きの戸建てで近くに広い公園があって緑に囲まれていたので、住居環境はあちらのほうが良かったです。

1

38

110

Heiga Zen (全炳河)

@heiga_zen

8 months

Google DeepMind's music generation model Lyria and Music AI tools under the partnership w/ YouTube. I'm so excited to see this annoucement, and looking forward to seeing how it will help creators!! (reposting as there was a typo)

Transforming the future of music creation

Announcing our most advanced music generation model and two new AI experiments, designed to open a new playground for creativity

deepmind.google

0

22

103

Heiga Zen (全炳河)

@heiga_zen

4 years

Today is my 9th Googleversary. Yay!

7

0

105

Heiga Zen (全炳河)

@heiga_zen

1 year

"25 Years of Evolution in Speech and Language Processing" "In this article, we summarize the evolution of speech and language processing (SLP) in the past 25 years. We first provide a snapshot of popular research topics and the associated state of ..."

1

25

108

Heiga Zen (全炳河)

@heiga_zen

3 years

Today (25th July, 2021) is my 10th #Googleversary . I am fortunate that I could work with so many talented people at @Google . A huge thank you to my friends and colleagues who have taught me so much.

0

1

107

Heiga Zen (全炳河)

@heiga_zen

5 years

7年半前入社。知り合いの会社がグーグルに買収された後リファーされました。TOEIC等受けたこと無いですが英国に住んでたので会話は問題無し。研究職なので日常の研究・論文・HTSやHTK等オープンソース関連が準備でしょうか。アルゴリズムやデータ構造もオープンソース化で学びました。

Ryoichi Imaizumi | 今泉竜一

@r_ima

5 years

(Good question!) 期待していたのは、面接でこの問題が出て、こう答えて受かった、という話ではなくて、どういう勉強/準備をしたのか、という話です。面接の時点で英語はどれ位話せたかとか、データ構造やアルゴリズムの勉強は特別に準備したかとか、コーティングはどれ位のレベルだったか、とかです。

2

33

155

3

16

105

Heiga Zen (全炳河)

@heiga_zen

4 years

先週某誌の査読を頼まれて、今日が締切でした。午後使って論文読んで、さあ査読結果書こうとしたら、AEから「判定に必要な査読十分集まったから、君のはもう必要ないです」というメールが。感情的になってはダメだと分かっていますが、当分この雑誌の査読は引き受けたくないですね。

3

27

105

Heiga Zen (全炳河)

@heiga_zen

6 years

音響学会誌2018年7月号に載った音声合成に関する記事、PDFがネットに上がってました -- テキスト音声合成技術の変遷と最先端

テキスト音声合成技術の変遷と最先端

日本音響学会誌, 2018 年 74 巻 7 号 p. 387-393

www.jstage.jst.go.jp

0

42

101

Heiga Zen (全炳河)

@heiga_zen

5 years

Recently Google Brain Tokyo got the 5th member of the team. Welcome Yujin!

1

12

101

Heiga Zen (全炳河)

@heiga_zen

4 years

My new WFH desk + cycling machine.

4

100

Heiga Zen (全炳河)

@heiga_zen

5 years

Having lunch with Prof. Hinton.

2

1

98

Heiga Zen (全炳河)

@heiga_zen

3 years

New paper from our team: Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang, Yonghui Wu "PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS" Arxiv: Samples:

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS

This paper introduces PnG BERT, a new encoder model for neural TTS. This model is augmented from the original BERT model, by taking both phoneme and grapheme representations of text as input, as...

arxiv.org

1

25

99

Heiga Zen (全炳河)

@heiga_zen

6 years

"Introducing Cloud Text-to-Speech powered by DeepMind WaveNet technology"

Introducing Cloud Text-to-Speech powered by DeepMind WaveNet technology | Google Cloud Blog

cloud.google.com

1

41

96

Heiga Zen (全炳河)

@heiga_zen

5 years

4th member of Google Brain Tokyo!

Yingtao Tian

@alanyttian

5 years

入社しました

11

1

87

1

9

93

Heiga Zen (全炳河)

@heiga_zen

6 years

After 7 years in Speech team @ Google, it’s time for me to take a new adventure; I'l leave the team at the end of this month. I feel grateful for having had the opportunity to work as a part of the team. I learned a lot. I also feel proud of the team's incredible achievements.

2

5

93

Heiga Zen (全炳河)

@heiga_zen

3 years

Google Japanが発行しているSTEAM Career Magazineのvol.2が出ました。"Life at Google"というセクションに私のインタビューも載ってます。

コンピュータサイエンスをより幅広い子どもたちへ

すべての子どもたちが学校内外でプログラミングやコンピュータサイエンスを楽しめるように、Google は様々な教育機関や非営利団体などと連携をし、様々な学びの機会を提供しています。

blog.google

1

22

90

Heiga Zen (全炳河)

@heiga_zen

1 year

ぼっちざろっく！

0

9

89

Heiga Zen (全炳河)

@heiga_zen

5 years

名刺できた。

0

1

90

Heiga Zen (全炳河)

@heiga_zen

5 years

A speech-to-speech translation paper from my colleagues; direct speech-to-speech translation via seq2seq.

Brundage Bot

@BrundageBot

5 years

Direct speech-to-speech translation with a sequence-to-sequence model. Ye Jia, Ron J. Weiss, Fadi Biadsy, Wolfgang Macherey, Melvin Johnson, Zhifeng Chen, and Yonghui Wu

1

7

27

1

25

89

Heiga Zen (全炳河)

@heiga_zen

1 year

Furuta-san, who is the first author of this paper, is a student researcher at Google Brain in Japan. Glad to see the paper is now out.

AK

@_akhaliq

1 year

Multimodal Web Navigation with Instruction-Finetuned Foundation Models propose an instruction-following multimodal agent, WebGUM, that observes both webpage screenshots and HTML pages and outputs web navigation actions, such as click and type. WebGUM is trained by jointly

4

50

170

0

13

85

Heiga Zen (全炳河)

@heiga_zen

4 years

A paper from my team: Jonathan Shen, Ye Jia, Mike Chrzanowski, Yu Zhang, Isaac Elias, Heiga Zen, Yonghui Wu Non-Attentive Tacotron: Robust & Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling paper: audio:

Non-Attentive Tacotron: Robust and Controllable Neural TTS...

This paper presents Non-Attentive Tacotron based on the Tacotron 2 text-to-speech model, replacing the attention mechanism with an explicit duration predictor. This improves robustness...

arxiv.org

cs.CL Papers

@arxiv_cs_cl

4 years

Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling. (arXiv:2010.04301v1 []) #NLProc

0

2

0

18

86

Heiga Zen (全炳河)

@heiga_zen

1 year

BardにTTSが追加されました。どうぞご利用ください。

Google Japan

@googlejapan

1 year

Bard に新機能を追加しました✨ アイデアを形にして、よりクリエイティブに、もっと効率的に。是非 Bard でお試しください →

2

69

205

2

22

87

Heiga Zen (全炳河)

@heiga_zen

4 years

リモート（カリフォルニア⇔東京）でテックリード業する生活は辛い。本気で転職やチーム代わるのを考えるレベル。

3

7

86

Heiga Zen (全炳河)

@heiga_zen

3 years

New paper about direct speech-to-speech translation from my team mates. Better translation w/ more naturally sounding speech.

AK

@_akhaliq

3 years

Translatotron 2: Robust direct speech-to-speech translation pdf: samples: outperforms Translatotron by a large margin in terms of translation quality and predicted speech naturalness

1

25

90

0

16

86

Heiga Zen (全炳河)

@heiga_zen

2 years

WaveNetの著者は、もう全員WaveNet使ってないので…

M. Morise (忍者系研究者)

@m_morise

2 years

アカデミアには巨人の肩に乗るとかそういう表現もあるけど，強い表現に相応しい結果が出ていて査読者をねじ伏せる自��があるなら，喧嘩売るタイトルも面白いと思う．是非ともWaveNet開発者の査読コメントを見てみたいｗ

0

1

8

1

16

86

Heiga Zen (全炳河)

@heiga_zen

7 years

Another end-to-end TTS from my colleagues at Google. "Tacotron: A fully end-to-end text-to-speech synthesis model"

Tacotron: Towards End-to-End Speech Synthesis

A text-to-speech synthesis system typically consists of multiple stages, such as a text analysis frontend, an acoustic model and an audio synthesis module. Building these components often requires...

arxiv.org

0

29

85

Heiga Zen (全炳河)

@heiga_zen

5 years

Direct speech-ro-speech translation model work from my team mates. Great work!

Google AI

@GoogleAI

5 years

Translatotron is our experimental model for direct end-to-end speech-to-speech translation, which demonstrates the potential for improved translation efficiency, fewer errors, and better handling of proper nouns. Learn all about it below!

7

305

899

0

22

83

Heiga Zen (全炳河)

@heiga_zen

5 years

Using neural machine translation to correct grammatical faux pas in Google Docs @google

Using neural machine translation to correct grammatical faux pas in Google Docs | Google Workspace...

We’re making a significant improvement to how we correct language errors by using Neural Grammar Correction in Docs.

workspace.google.com

0

18

82

Heiga Zen (全炳河)

@heiga_zen

2 years

Today is my 11th #Googleversary . Yay!

7

0

81

Heiga Zen (全炳河)

@heiga_zen

5 months

Gemmaは、AIイノベーションを推進する開発者・研究者コミュニティ向けに構築されています。研究者の皆様は、研究を加速するために最大50 万ドルの Google Cloud クレジットを申請することもできます。詳しくは以下のサイトをご参照ください

Gemma: Introducing new state-of-the-art open models

Gemma is a family of lightweight, state-of-the art open models built from the same research and technology used to create the Gemini models.

blog.google

0

19

80

Heiga Zen (全炳河)

@heiga_zen

5 months

BardがGeminiへと生まれ変わりました Gemini Advancedの心臓部には、当社の最先端マルチモーダルモデルGemini Ultra 1.0が搭載されています今回のアップデートにより、その可能性は大きく広がりました詳細については、ぜひブログ投稿をご覧下さい

Bard から Gemini へ：Ultra 1.0 とGemini アプリを発表

本日よりBard は Gemini になり、Ultra 1.0 の最大の AI モデルを発表し、 Gemini のアプリも提供します。

blog.google

Google Japan

@googlejapan

5 months

本日より、Bard は Gemini（ジェミニ）になります！✨ Gemini は Bard に搭載されている AI モデルですが、この高度なテクノロジーが反映されていることをわかりやすく伝えるために、名前を変えました！生まれ変わった Gemini を試す⬇️ #GeminiAI

17

510

1K

0

18

79

Heiga Zen (全炳河)

@heiga_zen

2 years

SpecGrad is yet another denoising diffusion probabilistic model (DDPM)-based neural vocoder incorporating more ideas from signal processing to achieve better performance.

AK

@_akhaliq

2 years

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping abs: project page:

1

20

122

0

15

78

Heiga Zen (全炳河)

@heiga_zen

7 years

Japanese samples are added to the demo of "A Neural Parametric Singing Synthesizer". I know this singer well :-)

1

44

78

Heiga Zen (全炳河)

@heiga_zen

7 years

Video recording of my talk at MIT is now available: "Generative Model-Based Text-to-Speech Synthesis"

Generative Model-Based Text-to-Speech Synthesis

Heiga Zen, GoogleAbstract: Recent progress in generative modeling has improved the naturalness of synthesized speech significantly. In this talk I will summ...

www.youtube.com

1

34

75

Heiga Zen (全炳河)

@heiga_zen

3 years

@fushiroyama @SythonUK すごい日本的な解決法ですね。オーガナイザが取るべき行動をその場でとらなかったのを、当事者集めてとりつくろって責任を有耶無耶にしてしてしまうという…。まさに最悪。

1

16

75

Heiga Zen (全炳河)

@heiga_zen

3 years

THIS IS ABSOLUTELY INSANE

The Nine

@BBCScotNine

3 years

"Even if we do not have a vaccine, our plan is that we will be able to deliver the Games" @Tokyo2020 spokesman Masa Takaya tells #TheNine 's @BBCchrismclaug that organisers are planning for the postponed Olympic Games to go ahead this summer with spectators present.

5

16

24

2

18

70

Heiga Zen (全炳河)

@heiga_zen

7 years

Neural TTS from FAIR: Voice Synthesis for in-the-Wild Speakers via a Phonological Loop demo:

VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop

We present a new neural text to speech (TTS) method that is able to transform text to speech in voices that are sampled in the wild. Unlike other systems, our solution is able to deal with unconstr...

ytaigman.github.io

0

35

73

Heiga Zen (全炳河)

@heiga_zen

5 years

Research roles in Tokyo are available here.

Search Jobs - Google Careers

Find your next job at Google — Careers at Google. Search by location, role, skills, and more.

www.google.com

Jeff Dean (@🏡)

@JeffDean

5 years

We are hiring for a wide variety of research roles within @GoogleAI . See our web site, or if you're at @NeurIPSConf , stop by the Google booth! (Someone wondered if we weren't hiring bc I hadn't tweeted about this, so trying to fix this impression).

11

158

566

0

21

72

Heiga Zen (全炳河)

@heiga_zen

6 years

"Cloud Text-to-Speech now supports 14 languages and variants, with 56 total voices including 30 standard voices, and 26 WaveNet voices."

Announcing updates to Cloud Speech-to-Text and the general availability of Cloud Text-to-Speech |...

cloud.google.com

0

33

70

Heiga Zen (全炳河)

@heiga_zen

4 years

ベルマークを持ってきてと言われた。これが日本の小学校というヤツかっ！！

1

4

67

Heiga Zen (全炳河)

@heiga_zen

3 years

Welcome to the Google Brain team in Tokyo!

tkasasagi 🐻

@tkasasagi

3 years

It had been 8 months since I signed the offer letter. Today is my first day as a Senior Research Scientist at Google Brain team based in Tokyo. I am so grateful for all the chances and supports I receive. I will do my best. よろしくお願いします。 I like Hanawa Hokiichi doodle.

33

32

807

0

4

69

Heiga Zen (全炳河)

@heiga_zen

5 years

3rd member of the Google Brain team in Tokyo!!

0

6

67

Heiga Zen (全炳河)

@heiga_zen

5 years

グーグル東京オフィスでStadiaゲームプロデューサーの求人してる。

0

31

65

Heiga Zen (全炳河)

@heiga_zen

5 years

2

17

68

Heiga Zen (全炳河)

@heiga_zen

3 years

WaveGrad 2 -- Iterative Refinement for Text-to-Speech Synthesis "WaveGrad 2 is trained to estimate the gradient of the log conditional density of the waveform given a phoneme sequence. "

WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

This paper introduces WaveGrad 2, a non-autoregressive generative model for text-to-speech synthesis. WaveGrad 2 is trained to estimate the gradient of the log conditional density of the waveform...

arxiv.org

2

14

67

Heiga Zen (全炳河)

@heiga_zen

7 years

1000倍WNの記事とペーパーが出ました。

Google DeepMind

@GoogleDeepMind

7 years

Here’s how we took WaveNet from research to production

4

308

687

1

31

64

Heiga Zen (全炳河)

@heiga_zen

3 years

Job opening: UX Lead, Google Translate (Tokyo, Japan)

0

15

63

Heiga Zen (全炳河)

@heiga_zen

2 years

今週より6ヶ月間、東大猿渡・小山研 () の佐伯さん () が、Student ResearcherとしてGoogle Brain Applied Research team in Japanに加わってくれました。一緒に研究できるのがとても楽しみです。

1

12

64

Heiga Zen (全炳河)

@heiga_zen

9 months

先程 @xutan_tx さんから、Neural Text-to-Speech Synthesisの教科書の献本を受け取りました。今回この本の出版にあたり、序文を寄せさせて頂きました。

0

8

65

Heiga Zen (全炳河)

@heiga_zen

2 years

式典が終わって先生方とお話した後、校内を散歩させていただきました。音楽部の部室にお邪魔した際、在学時に寄贈したベースアンプを発見しました。20年以上経ちますが、まだ現役とのことでした。懐かしすぎる… #鈴鹿高専

1

4

64

Heiga Zen (全炳河)

@heiga_zen

6 years

プロフィール更新しました。今週月曜から正式にBrain所属になりました。11月26日から東京オフィスで働き始めます。

0

11

61

Heiga Zen (全炳河)

@heiga_zen

1 year

Happy 12th #Googleversary to me! It's been an amazing journey so far, full of challenges and changes, but I've loved every minute of it 😆

6

3

62

Heiga Zen (全炳河)

@heiga_zen

3 years

大学関係者へのGCPクレジット提供が幾つかある事を今日知ったので、ここで共有します。 1. Teaching Credit 大学での講義想定のプログラム先生は$100ドルもらえる＋学生に$50ドルのチケットを配れる概要: 申請フォーム:

Teaching Credits | Google for Education

Give your students real, hands-on experience with Google Cloud Platform. Apply now for GCP credits designed to engage students and enhance teaching.

edu.google.com

1

27

62

Heiga Zen (全炳河)

@heiga_zen

4 years

R.I.P. Ann. She was one of the pioneers of the concatenative speech synthesis technology.

Ann Syrdal - Wikipedia

en.wikipedia.org

0

12

61

Heiga Zen (全炳河)

@heiga_zen

8 years

WaveNet for text generation.

My implementation of WaveNet for text generation (based in this repository) · Issue #117 · ibab/t...

Hi, friends. As I have not a good GPU for heping you directly in this, I have use the baseline of the work in this repository to develop a WaveNet text generator (self-generator): https://github.co...

github.com

0

16

59

Heiga Zen (全炳河)

@heiga_zen

5 years

Gmail is too difficult for me. It significantly affects my productivity. I really miss Inbox 😢

0

12

60