Track 6: Current Topics in IS Development

Towards a Conceptual Framework for the Specification of Reproducible and Replicable Data Analysis Projects

Roberto Rodriguez-Echeverria, Universidad de ExtremaduraFollow
Jose Maria Conejero, Universidad de ExtremaduraFollow
Francisco Javier Melchor González, Universidad de ExtremaduraFollow
Juan Diego Gutiérrez Gallardo, Universidad de ExtremaduraFollow
Álvaro E. Prieto, Universidad de ExtremaduraFollow

Abstract

It is becoming increasingly common to exploit the data collected by Information Systems in order to carry out an analysis of them and obtain conclusions that give rise to a series of decisions in the different research fields. The fact that in most cases these conclusions cannot be properly backed up has given rise to a reproducibility crisis in Data Science, the discipline that makes it possible to convert such data into knowledge, and it research fields that apply it. In this paper we envision a conceptual framework to foster reproducible and replicable Data Science projects. The framework proposes the definition of systematic pipelines that may be (semi)automatically executed in terms of concrete implementation platforms. Proprietary or third party tools are also considered so that flexibility may be ensured without hindering reproducibility and replicability.

Recommended Citation

Rodriguez-Echeverria, R., Conejero, J. M., Melchor González, F. J., Gutiérrez Gallardo, J. D., & Prieto, A. E. (2021). Towards a Conceptual Framework for the Specification of Reproducible and Replicable Data Analysis Projects. In E. Insfran, F. González, S. Abrahão, M. Fernández, C. Barry, H. Linger, M. Lang, & C. Schneider (Eds.), Information Systems Development: Crossing Boundaries between Development and Operations (DevOps) in Information Systems (ISD2021 Proceedings). Valencia, Spain: Universitat Politècnica de València.

Paper Type

Short Paper

Download

COinS

Towards a Conceptual Framework for the Specification of Reproducible and Replicable Data Analysis Projects

Track 6: Current Topics in IS Development

Towards a Conceptual Framework for the Specification of Reproducible and Replicable Data Analysis Projects

Abstract

Recommended Citation

Paper Type

Search

Browse

Author Corner

Links

Track 6: Current Topics in IS Development

Towards a Conceptual Framework for the Specification of Reproducible and Replicable Data Analysis Projects

Presenter Information

Abstract

Recommended Citation

Paper Type

Share

Search

Browse

Author Corner

Links