Journal of the Midwest Association for Information Systems (JMWAIS)

Web Scraping in the R Language: A Tutorial

Abstract

Information Systems researchers can now more easily access vast amounts of data on the World Wide Web to answer both familiar and new questions with more rigor, precision, and timeliness. The main goal of this tutorial is to explain how Information Systems researchers can automatically “scrape” data from the web using the R programming language. This article provides a conceptual overview of the Web Scraping process. The tutorial discussion is about two R packages useful for Web Scraping: “rvest” and “xml2”. Simple examples of web scraping involving these two packages are provided. This tutorial concludes with an example of a complex web scraping task involving retrieving data from Bayt.com - a leading employment website in the Middle East.

Recommended Citation

Krotov, Vlad and Tennyson, Matthew (2021) "Web Scraping in the R Language: A Tutorial," Journal of the Midwest Association for Information Systems (JMWAIS): Vol. 2021: Iss. 1, Article 5.
DOI: 10.17705/3jmwa.000066
Available at: https://aisel.aisnet.org/jmwais/vol2021/iss1/5

DOI

10.17705/3jmwa.000066

Download

COinS

Journal of the Midwest Association for Information Systems (JMWAIS)

Web Scraping in the R Language: A Tutorial

Authors

Abstract

Recommended Citation

DOI

Share

Search