MCIS 2017 Proceedings

Building A Big Data Analytical Pipeline With Hadoop For Processing Enterprise XML Data

Viktor Dmitriyev, University of Oldenburg, GermanyFollow
Felix Kruse, University of Oldenburg, GermanyFollow
Hauke Precht, University of Oldenburg, GermanyFollow
Simon Becker, University of Oldenburg, Germany,Follow
Andreas Solsbach, University of Oldenburg, GermanyFollow
Jorge Marx Gómez, University of Oldenburg, Germany,Follow

Abstract

The current paper shows an end-to-end approach how to process XML files in the Hadoop ecosystem. The work demonstrates a way how to handle problems faced during the analysis of a large amounts of XML files. The paper presents a completed Extract, Load and Transform (ELT) cycle, which is based on the open source software stack Apache Hadoop, which became a standard for processing of a huge amounts of data. This work shows that applying open source solutions to a particular set of problems could not be enough. In fact, most of big data processing open source tools were implemented only to address a limited number of the use cases. This work explains and shows, why exactly specific use cases may require significant extension with a self-developed multiple software components. The use case described in the paper deals with huge amounts of semi-structured XML files, which supposed to be persisted and processed daily.

Recommended Citation

Dmitriyev, Viktor; Kruse, Felix; Precht, Hauke; Becker, Simon; Solsbach, Andreas; and Marx Gómez, Jorge, "Building A Big Data Analytical Pipeline With Hadoop For Processing Enterprise XML Data" (2017). MCIS 2017 Proceedings. 1.
https://aisel.aisnet.org/mcis2017/1

Download

COinS

MCIS 2017 Proceedings

Building A Big Data Analytical Pipeline With Hadoop For Processing Enterprise XML Data

Abstract

Recommended Citation

Search

Links

Browse

Author Corner

MCIS 2017 Proceedings

Building A Big Data Analytical Pipeline With Hadoop For Processing Enterprise XML Data

Authors

Abstract

Recommended Citation

Share

Search

Links

Browse

Author Corner