JAIST Repository >
a. 知識科学研究科・知識科学系 >
a10. 学術雑誌論文等 >
a10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/9212

完全登録情報レコード

ダブリン・コア・フィールド言語
contributor.authorZhang, Wenen_US
contributor.authorYoshida, Taketoshien_US
contributor.authorTang, Xijinen_US
contributor.authorHo, Tu Baoen_US
date.accessioned2010-10-25T06:56:04Z-
date.available2010-10-25T06:56:04Z-
date.issued2009-02-20en_US
identifier.urihttp://hdl.handle.net/10119/9212-
description.abstractOne of the deficiencies of mutual information is its poor capacity to measure association of words with unsymmetrical co-occurrence, which has large amounts for multi-word expression in texts. Moreover, threshold setting, which is decisive for success of practical implementation of mutual information for multi-word extraction, brings about many parameters to be predefined manually in the process of extracting multiword expressions with different number of individual words. In this paper, we propose a new method as EMICO (Enhanced Mutual Information and Collocation Optimization) to extract substantival multiword expression from text. Specifically, enhanced mutual information is proposed to measure the association of words and collocation optimization is proposed to automatically determine the number of individual words contained in a multiword expression when the multiword expression occurs in a candidate set. Our experiments showed that EMICO significantly improves the performance of substantival multiword expression extraction in comparison with a classic extraction method based on mutual information.en_US
format.extent558947 bytes-
format.mimetypeapplication/pdf-
language.isoenen_US
publisherElsevieren_US
rightsNOTICE: This is the author's version of a work accepted for publication by Elsevier. Wen Zhang, Taketoshi Yoshida, Xijin Tang, and Tu-Bao Ho, Expert Systems with Applications, 36(8), 2009, 10919-10930, http://dx.doi.org/10.1016/j.eswa.2009.02.026en_US
subjectSubstantival multiword expressionen_US
subjectMutual informationen_US
subjectEnhanced mutual informationen_US
subjectCollocation optimizationen_US
subjectEMICOen_US
titleImproving effectiveness of mutual information for substantival multiword expression extractionen_US
type.niiJournal Articleen_US
identifier.niiissn0957-4174en_US
identifier.jtitleExpert Systems with Applicationsen_US
identifier.volume36en_US
identifier.issue8en_US
identifier.spage10919en_US
identifier.epage10930en_US
relation.doi10.1016/j.eswa.2009.02.026en_US
rights.textversionauthoren_US
language.iso639-2engen_US
出現コレクション:a10-1. 雑誌掲載論文 (Journal Articles)

このアイテムのファイル:

ファイル 記述 サイズ形式
13876.pdf545KbAdobe PDF見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

 


お問合せ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係 (ir-sys[at]ml.jaist.ac.jp)