JAIST Repository >
e. 情報社会基盤研究センター >
e10. 学術雑誌論文等 >
e10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/7867

タイトル: Improving discriminative sequential learning by discovering important association of statistics
著者: Phan, Xuan-Hieu
Nguyen, Le-Minh
Inoguchi, Yasushi
Ho, Tu-Bao
Horiguchi, Susumu
キーワード: Discriminative sequential learning
feature selection
association rule mining
information extraction
text segmentation
発行日: 2006-12
出版者: Association for Computing Machinery
誌名: ACM Transactions on Asian Language Information Processing
巻: 5
号: 4
開始ページ: 413
終了ページ: 438
DOI: 10.1145/1236181.1236187
抄録: Discriminative sequential learning models like Conditional Random Fields (CRFs) have achieved significant success in several areas such as natural language processing or information extraction. Their key advantage is the ability to capture various nonindependent and overlapping features of inputs. However, several unexpected pitfalls have a negative influence on the model's performance; these mainly come from a high imbalance among classes, irregular phenomena, and potential ambiguity in the training data. This article presents a data-driven approach that can deal with such difficult data instances by discovering and emphasizing important conjunctions or associations of statistics hidden in the training data. Discovered associations are then incorporated into these models to deal with difficult data instances. Experimental results of phrase-chunking and named entity recognition using CRFs show a positive improvement in accuracy. In addition to the technical perspective, our approach also highlights a potential connection between association mining and statistical learning by offering an alternative strategy to enhance learning performance with interesting and useful patterns discovered from large datasets.
Rights: (c) ACM, 2006. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ACM Transactions on Asian Language Information Processing, 5(4), 2006, 413-438. http://doi.acm.org/10.1145/1236181.1236187
URI: http://hdl.handle.net/10119/7867
資料タイプ: author
出現コレクション:e10-1. 雑誌掲載論文 (Journal Articles)


ファイル 記述 サイズ形式
C11446.pdf257KbAdobe PDF見る/開く



お問い合わせ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係