2020

Research / 2020

Research

A Named Entity Topic Model for News Popularity Prediction

2020.09.19

Yang Yang, Yang Liu, Xiaoling Lu, Jin Xu, Feifei Wang 


Publication Time2020.09.19

Lead Author】Yang Yang

Corresponding Author】Feifei Wang 

Journal】 KNOWLEDGE-BASED SYSTEMS

Abstract

Predicting the popularity of web content is widely regarded as an important but challenging task. Online news articles are typical examples of this. In particular, owing to their time-sensitive nature, it is preferable to predict the popularity of news articles before their publication. To achieve this, this study proposes a named entity topic model (NETM) to extract the textual factors that can drive popularity growth. Here, each named entity is assumed to have a popularity-gain distribution over all semantic topics. The popularity of a news article is considered as the accumulation of popularity gains generated by its named entities (NEs) over all the topics. By learning the popularity-gain matrix for each named entity, the popularity of any news article can be predicted. Experiments on two collections of news articles demonstrate that the proposed NETM can outperform existing models in terms of accuracy. Additionally, the popularity-gain matrix learned by the NETM can be used to effectively explain the popularity of specific news articles.

Keywords

Named entity,Web content,Popularity prediction,Topic model