Web scraping to build social research data

Introduction

There is an unprecedented amount of information on the internet that could usefully be harvested in order to build social science research datasets.

This one-day online course will showcase suitable techniques for web scraping.

The value, logic and process of capturing data stored on websites will be described in detail, and practical examples and exercises will be demonstrated using the Python programming language.

It is most suited to empirical social science researchers.

Course materials

This repository houses the materials underpinning a one-day NCRM course on web scraping run by Dr Diarmuid McDonnell, University of the West of Scotland. The course was first run on 2021-05-17.

Programme

The course programme can be viewed here.

Materials

The training materials can be found in the following folders:

code - Jupyter Notebooks containing executable Python code for the web scraping lessons.
presentations - PDF versions of the course lectures.
reading - lists of interesting and relevant web scraping online articles.

Acknowledgements

I am grateful to the National Centre for Research Methods (NCRM) for funding this course and an associated set of online learning resources.

Further information

Please do not hesitate to get in contact if you have queries, criticisms or ideas regarding these materials: Dr Diarmuid McDonnell

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Web scraping to build social research data

Introduction

Course materials

Programme

Materials

Acknowledgements

Further information

Files

README.md

Latest commit

History

README.md

File metadata and controls

Web scraping to build social research data

Introduction

Course materials

Programme

Materials

Acknowledgements

Further information