Abstract

Despite the importance of data quality in business analytics, most degree programs do not offer a course on the topic. It is usually left to the discretion of individual instructors to decide where and how much to cover the topic. In this case study, we describe a course on data quality in a Master of Science in Business Analytics program. Organized in eight modules, the first part of the course covers data preparation and preprocessing. This prepares students with the ability to tackle real datasets in other analytics courses. The second part covers analytics for data quality where algorithms for detecting and resolving data quality issues are covered. The third part addresses large scale and engineering issues of analytics practice where data collection needs to be managed and data quality tasks must be part of the pipeline.

Share

COinS