全文検索システム『ひまわり』を利用した言語資料検索環境の構築手法

山口, 昌也; ヤマグチ, マサヤ; YAMAGUCHI, Masaya

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

全文検索システム『ひまわり』を利用した言語資料検索環境の構築手法

https://doi.org/10.15084/00002175

名前 / ファイル	ライセンス	アクション
kk_ngkgk_021_06.pdf (1.1 MB)

アイテムタイプ

紀要論文 / Departmental Bulletin Paper(1)

公開日

2019-03-25

タイトル

全文検索システム『ひまわり』を利用した言語資料検索環境の構築手法

タイトル

A method of constructing a retrieval environment for language resources using full-text retrieval system "Himawari"

言語

jpn

キーワード

主題Scheme

Other

主題

全文検索システム

キーワード

主題Scheme

Other

主題

言語資料

キーワード

主題Scheme

Other

主題

XML

キーワード

言語

主題Scheme

Other

主題

full-text retrieval system

キーワード

言語

主題Scheme

Other

主題

language resource

キーワード

言語

主題Scheme

Other

主題

XML

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

departmental bulletin paper

ID登録

10.15084/00002175

ID登録タイプ

JaLC

著者

山口, 昌也

WEKO 7385

	山口, 昌也
ja-Kana	ヤマグチ, マサヤ

Search repository

YAMAGUCHI, Masaya

著者所属

内容記述タイプ

Other

内容記述

国立国語研究所

著者所属(英)

内容記述タイプ

Other

内容記述

The National Institute for Japanese Language

抄録

内容記述タイプ

Abstract

内容記述

現在,新聞・小説などのテキストデータベースや言語研究用に構築されたコーパスなどの言語資料が利用できるようになっている。しかし,言語資料を検索・閲覧するための手段が提供されることは少なく,言語資料が有効に活用されていないという問題がある。本稿の目的は,言語資料を有効に活用するため,全文検索システム『ひまわり』を用いて,言語資料の検索環境を構築する方法を示すことである。特に,検索環境構築時の実際的な事柄(文字コードなど)にも配慮し,既存の言語資料をどのような形式に整形すれば,どのような検索環境が構築できるのかを,実例に基づいて説明する。本稿では,まず,『ひまわり』の機能概要,および,検索能力を説明したのち,それに基づいて,(1)生テキストに近い言語資料,(2)形態素情報が付与された言語資料,(3)画像データと関連づけられた言語資料,の3種類の言語資料に対する検索環境を構築する。

抄録(英)

内容記述タイプ

Other

内容記述

Recently, many language resources, for example, text database of newspapers and novels, have become available for language study. However, these resources often do not have their own retrieval systems. This situation makes it difficult for users to utilize these resources for language study. In this paper, I propose a method of constructing a retrieval and browsing environment for existing language resources, using the full-text retrieval system "Himawari" that can retrieve contents and arguments in XML document under a user-specified condition. This paper describes a practical procedure that converts existing language resources into the XML documents that "Himawari" is able to handle. In the introduction, the functions and the retrieval ability of "Himawari" will be expounded. Then, the retrieval environment will be constructed for three kinds of language resources: 1) a nearly raw text, 2) a text with morphological information, 3) a text related to images (a four-frame comic strip)-i.e., these resources are converted to XML documents. Finally, the usefulness of the environment is demonstrated by showing the efficiency of the tagged information appeared in the search results.

出版者

国書刊行会

書誌情報

日本語科学
en : Japanese Linguistics

巻 21, p. 111-123, 発行日 2007-04-25

フォーマット

内容記述タイプ

Other

内容記述

application/pdf

著者版フラグ

出版タイプ

VoR

出版タイプResource

http://purl.org/coar/version/c_970fb48d4fbd8a85

戻る

views

See details

	Views

Versions

Ver.1

2023-05-15 15:09:31.023518

Show All versions

Cite as

Other

エクスポート

OAI-PMH

JPCOAR 2.0
JPCOAR 1.0
DublinCore
DDI

Other Formats

インデックスリンク

インデックスツリー

アイテム

全文検索システム『ひまわり』を利用した言語資料検索環境の構築手法

× 山口, 昌也

× YAMAGUCHI, Masaya

Versions

Share

Cite as

Other

エクスポート

コミュニティ

メニューを最小化

インデックスリンク

インデックスツリー

アイテム

全文検索システム『ひまわり』を利用した言語資料検索環境の構築手法

× 山口, 昌也

× YAMAGUCHI, Masaya

Versions

Share

Cite as

Other

エクスポート

コミュニティ