koheiw7 Profile Banner
Kohei Watanabe Profile
Kohei Watanabe

@koheiw7

Followers
737
Following
34
Statuses
80

Text analyst, R package developer (quanteda, newsmap, proxyC, LSS), political scientist interested in international communication

Joined June 2013
Don't wanna be here? Send us removal request.
@koheiw7
Kohei Watanabe
2 months
A few days ago, I received an email from a researcher asking if text analysis is becoming irrelevant because of AI... please read my post "AI products and text analysis methods" #quanteda #rstats
0
3
10
@koheiw7
Kohei Watanabe
3 months
If you think the number of topics, k, is the only important parameter for topic models, you need to read this post and the research paper. I created a new model to optimize the Dirichlet priors to analyze imbalanced corpus more accurately.
0
2
2
@koheiw7
Kohei Watanabe
6 months
In topic modeling, it is important to adjust parameters for topic sizes. If you don't know how, read this post and try the new algorithm: Looking forward to your feedback. #LDA #rstats #quanteda
Tweet media one
0
8
19
@koheiw7
Kohei Watanabe
10 months
You can make your R scripts for large-scale text analysis 2x faster and 3x more memory efficient using the external pointer tokens object (tokens_xptr) in new quanteda #RStats #quanteda
1
6
22
@koheiw7
Kohei Watanabe
10 months
Does anyone know how to link to C++ header files (installed using homebrew) in R package on MacOS? #Rstat #quanteda #macOS
0
1
0
@koheiw7
Kohei Watanabe
11 months
LSX v1.4.0 has been released with a new visualization function. The plot highlights words in an LSS model about security threats with different colors depending on their association with China, North Korea, Iran or Russia #R #quanteda
Tweet media one
0
6
23
@koheiw7
Kohei Watanabe
1 year
@TBroekel @marius_saeltzer @Res_Pol Replication code is
0
0
1
@koheiw7
Kohei Watanabe
1 year
Our semantic temporality analysis recognises various temporal features beyond the tense of verbs such as adjectives, adverbs and lexical aspects. You don't need to use POS tagger or training data because it is based on semi-supervised algorithm!
@marius_saeltzer
Marius Sältzer
1 year
New #openaccess publication in @Res_Pol. @koheiw7 and I present a new method to estimate temporal focus in text without Training data that only uses a low number of commonly used verbs and their inflections.
1
3
11
@koheiw7
Kohei Watanabe
1 year
RT @marius_saeltzer: New #openaccess publication in @Res_Pol. @koheiw7 and I present a new method to estimate temporal focus in text with…
0
13
0
@koheiw7
Kohei Watanabe
2 years
@wiasnews いつも研究をサポートしてくれてありがとうございます!
0
0
1
@koheiw7
Kohei Watanabe
2 years
日曜日の日本メディア学会の研究会で約束したとおり、種語の選び方について説明したページを作りました。
0
2
4
@koheiw7
Kohei Watanabe
2 years
Having trouble identifying topics of sentences in large corpora? Use Distributed Sequential LDA implemented in the seededlda package. #textanalysis #nlp #rstats
Tweet media one
0
6
35
@koheiw7
Kohei Watanabe
2 years
@Dr_Machinavelli Very interesting. Thanks for sharing. Looking forward to reading the paper.
0
0
0
@koheiw7
Kohei Watanabe
2 years
We can estimate emotions of words and emojis accurately using #LSS and #quanteda. If you are interested, please read our article in the J of Medical Internet Research: This is my best shot in analysis of social media.
Tweet media one
0
5
20
@koheiw7
Kohei Watanabe
2 years
@BoniKutela I don't think I will add network analysis functions, because there are many other packages for that. You can open a feature request about collocations on Github.
1
0
0
@koheiw7
Kohei Watanabe
2 years
RT @NataliaUmansky: Very much looking forward to presenting part of my PhD research at the @CPH_SODAS Data Discussion next week! Thanks so…
0
4
0
@koheiw7
Kohei Watanabe
2 years
@ste_mueller Let's discuss openly:
0
1
1
@koheiw7
Kohei Watanabe
2 years
Users should actively participate in discussions on open source development. Otherwise, the project stall and eventually die!
0
2
5
@koheiw7
Kohei Watanabe
2 years
Belatedly, I published instructions on how to perform Latent Semantic Scaling (LSS) using the LSX package. I hope this helps:
0
1
8