Publications

International Journal

Associative Topic Models with Numerical Time Series
categorize
Machine Learning
Author
Park, S.R., Lee, W.S., & Moon, I.C.
Year
2015
Month
Sep
Journal Name
Information Processing & Management
Volume
51
Issue
5
Page
737 - 755
File
1-s2.0-S0306457315000825-main.pdf (3.6M) 44회 다운로드 DATE : 2023-11-09 22:13:47

Park, S.R., Lee, W.S., & Moon, I.C. (2015). Associative Topic Models with Numerical Time Series. Information Processing and Management, 51(5), 737–755 


Abstract

A series of events generates multiple types of time series data, such as numeric and text data over time, and the variations of the data types capture the events from different angles. This paper aims to integrate the analyses on such numerical and text time-series data influenced by common events with a single model to better understand the events. Specifically, we present a topic model, called an associative topic model (ATM), which finds the soft cluster of time-series text data guided by time-series numerical value. The identified clusters are represented as word distributions per clusters, and these word distributions indicate what the corresponding events were. We applied ATM to financial indexes and president approval rates. First, ATM identifies topics associated with the characteristics of time-series data from the multiple types of data. Second, ATM predicts numerical time-series data with a higher level of accuracy than does the iterative model, which is supported by lower mean squared errors.


@article{PARK2015737, 

title = {Associative topic models with numerical time series}, 

journal = {Information Processing & Management}, 

volume = {51}, 

number = {5},

pages = {737-755}, 

year = {2015}, 

issn = {0306-4573}, 

doi = {https://doi.org/10.1016/j.ipm.2015.06.007}, 

url = {https://www.sciencedirect.com/science/article/pii/S0306457315000825}, 

author = {Sungrae Park and Wonsung Lee and Il-Chul Moon}, 

keywords = {Time series analysis, Topic models, Text mining}, 

abstract = {A series of events generates multiple types of time series data, such as numeric and text data over time, and the variations of the data types capture the events from different angles. This paper aims to integrate the analyses on such numerical and text time-series data influenced by common events with a single model to better understand the events. Specifically, we present a topic model, called an associative topic model (ATM), which finds the soft cluster of time-series text data guided by time-series numerical value. The identified clusters are represented as word distributions per clusters, and these word distributions indicate what the corresponding events were. We applied ATM to financial indexes and president approval rates. First, ATM identifies topics associated with the characteristics of time-series data from the multiple types of data. Second, ATM predicts numerical time-series data with a higher level of accuracy than does the iterative model, which is supported by lower mean squared errors.} 

}


Source Website:

http://www.sciencedirect.com/science/article/pii/S0306457315000825