Start Date

16-8-2018 12:00 AM

Description

A great deal of research has been done on the prediction of stock prices, using various techniques to analyze data from several sources. Among the many papers in this area, Schumaker and Chen (2009) use a predictive machine learning approach to analyze financial news articles to predict stock prices. Kim et al. (2014) analyze Korean Language news articles to predict rises and falls of the KOSPI (Korean Composite Stock Price Index). Finally, Bollen and Mao (2011) and Mittal and Goel (2012) analyze collective mood states derived from large-scale Twitter feeds to see whether these are correlated with the value of the Dow Jones Industrial Average (DJIA) over time. \ \ Among the many factors that may cause stock prices to change over time, we are interested in finding out whether or not customers’ opinions about the products they purchased from those companies can be used to predict the companies’ stock performance. For example, if certain products are very successful and customers like them a lot, do their comments help to predict whether the stock prices rise for the companies that produce those products? We use sentiment analysis techniques, consisting of several machine learning algorithms, such as linear regressions, Naïve Bayes, support vector machines (SVM), and neural networks, to analyze a targeted portion of Amazon’s product reviews. This corpus consists of 34,686,770 reviews between June 1995 and March 2013, available at the Stanford Large Network Dataset Collection (SNAP) web site. A company’s stock performance is predicted by measuring customer opinions about its products, whether they are satisfied with those products or not. The satisfaction values returned by the algorithms are tested to see whether they can be used to predict the companies’ stock performance by computing p-values using time series techniques, in particular, Granger Causality tests. \ \ References \ \ Bollen, J. and Mao, H. 2011. Twitter mood as a stock market predictor. IEEE Computer, 44(10):91–94, 2011. \ \ Kim, Y., Jeong, S.R., and Ghani, I. 2014. Text Opinion Mining to Analyze News for Stock Market Prediction , Int. J. Advance. Soft Comput. Appl., Vol. 6, No. 1, 1-12. \ \ Mittal, A. and Goel, A. 2012. Stock prediction using twitter sentiment analysis. Stanford University Working Paper. \ \ Schumaker, R. P. and Chen, H. 2009. “Textual Analysis of Stock Market Prediction Using Breaking Financial News: The AZFinText System,” ACM Transactions on Information Systems, Vol.27, No.2, Article 12. \ \

Share

COinS
 
Aug 16th, 12:00 AM

Big Data Sentiment Analysis of Customers' Reviews, and the Prediction of Companies' Stock Performance

A great deal of research has been done on the prediction of stock prices, using various techniques to analyze data from several sources. Among the many papers in this area, Schumaker and Chen (2009) use a predictive machine learning approach to analyze financial news articles to predict stock prices. Kim et al. (2014) analyze Korean Language news articles to predict rises and falls of the KOSPI (Korean Composite Stock Price Index). Finally, Bollen and Mao (2011) and Mittal and Goel (2012) analyze collective mood states derived from large-scale Twitter feeds to see whether these are correlated with the value of the Dow Jones Industrial Average (DJIA) over time. \ \ Among the many factors that may cause stock prices to change over time, we are interested in finding out whether or not customers’ opinions about the products they purchased from those companies can be used to predict the companies’ stock performance. For example, if certain products are very successful and customers like them a lot, do their comments help to predict whether the stock prices rise for the companies that produce those products? We use sentiment analysis techniques, consisting of several machine learning algorithms, such as linear regressions, Naïve Bayes, support vector machines (SVM), and neural networks, to analyze a targeted portion of Amazon’s product reviews. This corpus consists of 34,686,770 reviews between June 1995 and March 2013, available at the Stanford Large Network Dataset Collection (SNAP) web site. A company’s stock performance is predicted by measuring customer opinions about its products, whether they are satisfied with those products or not. The satisfaction values returned by the algorithms are tested to see whether they can be used to predict the companies’ stock performance by computing p-values using time series techniques, in particular, Granger Causality tests. \ \ References \ \ Bollen, J. and Mao, H. 2011. Twitter mood as a stock market predictor. IEEE Computer, 44(10):91–94, 2011. \ \ Kim, Y., Jeong, S.R., and Ghani, I. 2014. Text Opinion Mining to Analyze News for Stock Market Prediction , Int. J. Advance. Soft Comput. Appl., Vol. 6, No. 1, 1-12. \ \ Mittal, A. and Goel, A. 2012. Stock prediction using twitter sentiment analysis. Stanford University Working Paper. \ \ Schumaker, R. P. and Chen, H. 2009. “Textual Analysis of Stock Market Prediction Using Breaking Financial News: The AZFinText System,” ACM Transactions on Information Systems, Vol.27, No.2, Article 12. \ \