Spoken Language Division, Research Department, NINJAL
Adjunct Researcher, Spoken Language Division, Research Department, NINJAL
Adjunct Researcher, Spoken Language Division, Research Department, NINJAL
Adjunct Researcher, Spoken Language Division, Research Department, NINJAL
Spoken Language Division, Research Department, NINJAL
Adjunct Researcher, Spoken Language Division, Research Department, NINJAL
Adjunct Researcher, Spoken Language Division, Research Department, NINJAL
Chiba University
Adjunct Researcher, Center for Corpus Development, NINJAL
We have been constructing the Corpus of Everyday Japanese Conversation (CEJC) under the NINJAL collaborative research project since 2016. The CEJC is designed to contain various kinds of everyday conversations in a balanced manner to capture the diversity of everyday conversations and to observe natural conversational behavior. Prior to the publication of the whole corpus, which scheduled for 2022, we published the monitor version of the CEJC in December 2018. In this paper, we first outlined the design of the monitor version of the CEJC, including recording methods, the release policy of the corpus, corpus size, and annotations. Then, we examined whether the speakers and the conversations in the corpus vary in a balanced manner. Finally, we conducted a preliminary analysis on some linguistic aspects of the monitor version of the CEJC, revealing the possible implications of the corpus.