Introduction to Web Scraping with Python

dc.altmetrics.displayfalse
dc.contributor.authorBrodnax, NaLette
dc.date.accessioned2016-02-05T22:46:10Z
dc.date.available2016-02-05T22:46:10Z
dc.date.issued2016-02-05
dc.descriptionNaLette Brodnax is a data scientist and fourth-year doctoral student in the Joint Public Policy program administered by the School of Public and Environmental Affairs and the Department of Political Science at Indiana University. Her research interests include education policy, policy analysis and program evaluation, and quantitative research methodology. As a graduate assistant for the Center of Excellence for Women in Technology, she is working on a number of projects intended to expose women to technology and to support women using technology in their studies and careers. Prior to entering the doctoral program, NaLette spent nine years in corporate finance roles, managing large data sets and developing financial models for large companies such as Abbott Laboratories and Nokia. She holds a BSBA from The Ohio State University with a concentration in Finance and a Master's in Public Policy from Loyola University Chicago.
dc.description.abstractWeb scraping is a method of extracting and restructuring information from web pages. This workshop will introduce basic techniques for web scraping using popular open-source tools. The first part of the workshop will provide an overview of basic HTML elements and Python tools for developing a custom web scraper. The second part will enable participants to practice accessing websites, parsing information, and storing data in a CSV file. This workshop is intended for social scientists who are new to web scraping. No programming experience is required, but basic familiarity with HTML and Python is helpful.
dc.identifier.urihttps://hdl.handle.net/2022/20635
dc.language.isoen_US
dc.publisherIndiana University Workshop in Methods
dc.relation.isversionofClick on the PURL link below in the "Link(s) to data and video for this item" section to play this video.
dc.relation.urihttp://purl.dlib.indiana.edu/iudl/media/257979w09h
dc.rights© NaLette Brodnax
dc.subjectweb scraping
dc.subjectPython
dc.subjectWorkshop in Methods
dc.titleIntroduction to Web Scraping with Python
dc.typePresentation

Files

Original bundle

Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
2016-02-05_wim_brodnax_python_slides.pdf
Size:
2.67 MB
Format:
Adobe Portable Document Format
Description:
Presentation slides
Loading...
Thumbnail Image
Name:
2016-02-05_wim_brodnax_python_flyer.pdf
Size:
179.4 KB
Format:
Adobe Portable Document Format
Description:
Event flyer
Loading...
Thumbnail Image
Name:
2016-02-05_wim_brodnax_python_handout_hello_py.pdf
Size:
27.66 KB
Format:
Adobe Portable Document Format
Description:
Handout: hello.py code
Loading...
Thumbnail Image
Name:
2016-02-05_wim_brodnax_python_handout_wim_web_scraper_py.pdf
Size:
66.78 KB
Format:
Adobe Portable Document Format
Description:
Handout: wim_web_scraper.py code
Can’t use the file because of accessibility barriers? Contact us