## ---------------------------------- ## 基本情報、ライセンス ## ---------------------------------- タイトル: 『NPCMJ(NINJAL Parsed Corpus of Modern Japanese)』 出版者: Research Division, National Institute for Japanese Language and Linguistics. 出版地: Tokyo 出版年月日: 1 July 2024 doi: https://doi.org/10.15084/0002000284 ライセンス: CC BY 4.0 連絡先: Prashant Pardeshi (prashant@ninjal.ac.jp) バージョン: Version as of 2024.07.01 ファンド: Collaborative Research Project “Development of and Linguistic Research with a Parsed Corpus of Japanese”of the National Institute for Japanese Language and Linguistics (NINJAL), (April 2016 to March 2022)(Project leader:Prashant Pardeshi@National Institute for Japanese Language and Linguistics (NINJAL). @misc{NPCMJ Development Team 24, address = {Tokyo}, author = {NPCMJ(NINJAL Parsed Corpus of Modern Japanese)Development Team}, publisher = {Reserach Department, National Institute for Japanese Language and Linguistics (NINJAL)}, title = {NPCMJ(NINJAL Parsed Corpus of Modern Japanese)}, year = {2024}, yomi = {NPCMJ(NINJAL Parsed Corpus of Modern Japanese)Development Team}, doi = {https://doi.org/10.15084/0002000284}} ## ---------------------------------- ## ファイルの説明 ## ---------------------------------- NPCMJ(NINJAL Parsed Corpus of Modern Japanese)annotates syntactic and semantic information to texts of written and spoken Contemporary Japanese, making it possible to search and extract from the data a rich inventory of function words, phrase structures, clause types, and complex constructions, and to use the results actively for research. Approximately 90,000 sentences (90,000 trees) have been made publicly available. This data is in a compressed zip file containing containing all the sample files of the NPCMJ in bracketed tree format (.psd file). There are two versions: kana tree files and roman tree files. ## ---------------------------------- ## ファイルの構造 ## ---------------------------------- For details please refer the annotation manual (NPCMJ アノテーションマニュアル).