Start Date

10-12-2017 12:00 AM

Description

Companies pay high prices for detailed customer information (e.g., income, household type) for gaining insights and conducting targeted marketing campaigns. We argue that companies can utilize predictive analytics artifacts to derive such information from existing customer data in combination with freely available data sources, such as open government data. In this study, we use a machine learning artifact for a specific yet highly relevant case from the utility industry, trained on data of 7,504 energy customers and investigate two important aspects for predictive business analytics: First, we identified the sparsely available open government statistics and found that even that limited amount of open data can increase our artifact’s performance. Second, we applied the predictive models, trained with a regional customer dataset, on households in other geographic regions with acceptable performance loss. The results support the development of systems aiding managerial decision-making, predictive marketing and showcase the value of open data.

Share

COinS
 
Dec 10th, 12:00 AM

Predictive Customer Data Analytics – The Value of Public Statistical Data and the Geographic Model Transferability

Companies pay high prices for detailed customer information (e.g., income, household type) for gaining insights and conducting targeted marketing campaigns. We argue that companies can utilize predictive analytics artifacts to derive such information from existing customer data in combination with freely available data sources, such as open government data. In this study, we use a machine learning artifact for a specific yet highly relevant case from the utility industry, trained on data of 7,504 energy customers and investigate two important aspects for predictive business analytics: First, we identified the sparsely available open government statistics and found that even that limited amount of open data can increase our artifact’s performance. Second, we applied the predictive models, trained with a regional customer dataset, on households in other geographic regions with acceptable performance loss. The results support the development of systems aiding managerial decision-making, predictive marketing and showcase the value of open data.