Jun Goto


2020

pdf bib
NHK_STRL at WNUT-2020 Task 2: GATs with Syntactic Dependencies as Edges and CTC-based Loss for Text Classification
Yuki Yasuda | Taichi Ishiwatari | Taro Miyazaki | Jun Goto
Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020)

The outbreak of COVID-19 has greatly impacted our daily lives. In these circumstances, it is important to grasp the latest information to avoid causing too much fear and panic. To help grasp new information, extracting information from social networking sites is one of the effective ways. In this paper, we describe a method to identify whether a tweet related to COVID-19 is informative or not, which can help to grasp new information. The key features of our method are its use of graph attention networks to encode syntactic dependencies and word positions in the sentence, and a loss function based on connectionist temporal classification (CTC) that can learn a label for each token without reference data for each token. Experimental results show that the proposed method achieved an F1 score of 0.9175, out- performing baseline methods.

pdf bib
Relation-aware Graph Attention Networks with Relational Position Encodings for Emotion Recognition in Conversations
Taichi Ishiwatari | Yuki Yasuda | Taro Miyazaki | Jun Goto
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Interest in emotion recognition in conversations (ERC) has been increasing in various fields, because it can be used to analyze user behaviors and detect fake news. Many recent ERC methods use graph-based neural networks to take the relationships between the utterances of the speakers into account. In particular, the state-of-the-art method considers self- and inter-speaker dependencies in conversations by using relational graph attention networks (RGAT). However, graph-based neural networks do not take sequential information into account. In this paper, we propose relational position encodings that provide RGAT with sequential information reflecting the relational graph structure. Accordingly, our RGAT model can capture both the speaker dependency and the sequential information. Experiments on four ERC datasets show that our model is beneficial to recognizing emotions expressed in conversations. In addition, our approach empirically outperforms the state-of-the-art on all of the benchmark datasets.

2019

pdf bib
Label Embedding using Hierarchical Structure of Labels for Twitter Classification
Taro Miyazaki | Kiminobu Makino | Yuka Takei | Hiroki Okamoto | Jun Goto
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)

Twitter is used for various applications such as disaster monitoring and news material gathering. In these applications, each Tweet is classified into pre-defined classes. These classes have a semantic relationship with each other and can be classified into a hierarchical structure, which is regarded as important information. Label texts of pre-defined classes themselves also include important clues for classification. Therefore, we propose a method that can consider the hierarchical structure of labels and label texts themselves. We conducted evaluation over the Text REtrieval Conference (TREC) 2018 Incident Streams (IS) track dataset, and we found that our method outperformed the methods of the conference participants.

2018

pdf bib
Classification of Tweets about Reported Events using Neural Networks
Kiminobu Makino | Yuka Takei | Taro Miyazaki | Jun Goto
Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User-generated Text

We developed a system that automatically extracts “Event-describing Tweets” which include incidents or accidents information for creating news reports. Event-describing Tweets can be classified into “Reported-event Tweets” and “New-information Tweets.” Reported-event Tweets cite news agencies or user generated content sites, and New-information Tweets are other Event-describing Tweets. A system is needed to classify them so that creators of factual TV programs can use them in their productions. Proposing this Tweet classification task is one of the contributions of this paper, because no prior papers have used the same task even though program creators and other events information collectors have to do it to extract required information from social networking sites. To classify Tweets in this task, this paper proposes a method to input and concatenate character and word sequences in Japanese Tweets by using convolutional neural networks. This proposed method is another contribution of this paper. For comparison, character or word input methods and other neural networks are also used. Results show that a system using the proposed method and architectures can classify Tweets with an F1 score of 88 %.

2017

pdf bib
Extracting Important Tweets for News Writers using Recurrent Neural Network with Attention Mechanism and Multi-task Learning
Taro Miyazaki | Shin Toriumi | Yuka Takei | Ichiro Yamada | Jun Goto
Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation

pdf bib
Tweet Extraction for News Production Considering Unreality
Yuka Takei | Taro Miyazaki | Ichiro Yamada | Jun Goto
Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation

2013

pdf bib
NICT Disaster Information Analysis System
Kiyonori Ohtake | Jun Goto | Stijn De Saeger | Kentaro Torisawa | Junta Mizuno | Kentaro Inui
The Companion Volume of the Proceedings of IJCNLP 2013: System Demonstrations

2003

pdf bib
Question-Answering Based on Virtually Integrated Lexical Knowledge Base
Key-Sun Choi | Jae-Ho Kim | Masaru Miyazaki | Jun Goto | Yeun-Bae Kim
Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages

pdf bib
A Spoken Dialogue Interface for TV Operations based on Data Collected by using WOZ Method
Jun Goto | Yeun-Bae Kim | Masaru Miyazaki | Kazuteru Komine | Noriyoshi Uratani
The Companion Volume to the Proceedings of 41st Annual Meeting of the Association for Computational Linguistics