Pushpa Publishing House

Journal Menu

Content

Volume 24 (2021)

Volume 23 (2020)

Volume 20 (2019)

Volume 19 (2019)

Volume 18 (2018)

Volume 17 (2017)

Special Volume 3 (2016)

Special Volume 2 (2016)

Volume 16 (2016)

	Volume 16, Issue 4 Pg 745 - 1017 (December 2016)
	Volume 16, Issue 3 Pg 471 - 744 (September 2016)
	Volume 16, Issue 2 Pg 203 - 469 (June 2016)
	Volume 16, Issue 1 Pg 1 - 201 (March 2016)

Special Volume 1 (2016)

Volume 15 (2015)

Volume 14 (2015)

Volume 13 (2014)

Volume 12 (2014)

Volume 11 (2013)

Volume 10 (2013)

Volume 9 (2012)

Volume 8 (2012)

Volume 7 (2011)

Volume 6 (2011)

Volume 5 (2010)

Volume 4 (2010)

Volume 3 (2009)

Volume 2 (2008)

Volume 1 (2007)

Important: All future articles and volumes will be published only on our new website: pphmjopenaccess.com. Authors are requested to submit their papers through the new website only. Visit now: pphmjopenaccess.com

Far East Journal of Electronics and Communications

Far East Journal of Electronics and Communications
Volume 16, Issue 4, Pages 763 - 774 (December 2016)
http://dx.doi.org/10.17654/EC016040763

FEATURE ENGINEERING FOR TOPICAL CLUSTERING BASED ON NAMED ENTITY

Hyo-Jung Oh and Bo-Hyun Yun

Abstract:

Conventional clustering researches are focused on the extraction of keywords for word similarity grouping. However, high complexity, low speed, and low accuracy are incurred owing to the computation of too many candidates. To overcome these weaknesses, this paper presents a topical web document clustering model using not only keywords but also named entities such as a person’s name, organization, and location. We compare our proposed model with traditional models experimentally and analyze how different the effects of named entities are according to the characteristics of the document collection. For feature engineering, we adopt word embedding techniques as the collective name for a set of language modeling in natural language processing. In particular, we examine the correlation among topic words of clustered sets according to the concept level of the named entities.

Keywords and phrases:

web document clustering, named entity, feature engineering, word embedding.

Number of Downloads: 521 | Number of Views: 1925