Introduction to Web Scraping with Python

Loading...
Thumbnail Image
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.

Date

2016-02-05

Journal Title

Journal ISSN

Volume Title

Publisher

Indiana University Workshop in Methods

Abstract

Web scraping is a method of extracting and restructuring information from web pages. This workshop will introduce basic techniques for web scraping using popular open-source tools. The first part of the workshop will provide an overview of basic HTML elements and Python tools for developing a custom web scraper. The second part will enable participants to practice accessing websites, parsing information, and storing data in a CSV file. This workshop is intended for social scientists who are new to web scraping. No programming experience is required, but basic familiarity with HTML and Python is helpful.

Description

NaLette Brodnax is a data scientist and fourth-year doctoral student in the Joint Public Policy program administered by the School of Public and Environmental Affairs and the Department of Political Science at Indiana University. Her research interests include education policy, policy analysis and program evaluation, and quantitative research methodology. As a graduate assistant for the Center of Excellence for Women in Technology, she is working on a number of projects intended to expose women to technology and to support women using technology in their studies and careers. Prior to entering the doctoral program, NaLette spent nine years in corporate finance roles, managing large data sets and developing financial models for large companies such as Abbott Laboratories and Nokia. She holds a BSBA from The Ohio State University with a concentration in Finance and a Master's in Public Policy from Loyola University Chicago.

Keywords

web scraping, Python, Workshop in Methods

Citation

Journal

DOI

Link(s) to data and video for this item

Click on the PURL link below in the "Link(s) to data and video for this item" section to play this video.

Rights

© NaLette Brodnax

Type

Presentation