Text Mining in Python for Social Scientists
| dc.contributor.author | Marahrens, Helge-Johannes | |
| dc.date.accessioned | 2020-11-16T15:05:15Z | |
| dc.date.available | 2020-11-16T15:05:15Z | |
| dc.date.issued | 2020-10-23 | |
| dc.description | Helge-Johannes Marahrens is a doctoral student in the department of Sociology at Indiana University. He recently earned an MS in Applied Statistics and is currently working toward a PhD in Sociology. His research interests include cultural consumption, stratification, and computational social science with a particular focus on Natural Language Processing (NLP). | |
| dc.description.abstract | Textual data are central to the social sciences. However, they often require several pre-processing steps before they can be utilized for statistical analyses. This workshop introduces a range of Python tools to clean, organize, and analyze textual data. It is intended for researchers who are new to working with textual data, but are familiar with Python or have completed the Introduction to Python workshop. Python is best learned hands-on. We therefore strongly encourage that users install Python 3 (https://www.python.org/downloads/) and the packages listed below. Helge will be on Zoom at 1pm to help participants with installation issues. Of course, anyone is welcome to join and watch without coding themselves. Python packages: nltk, fuzzywuzzy, re, glob, sklearn, pandas, numpy, matplotlib | |
| dc.identifier.uri | https://hdl.handle.net/2022/25949 | |
| dc.language.iso | en | |
| dc.publisher | Indiana University Workshop in Methods | |
| dc.relation.uri | https://iu.mediaspace.kaltura.com/media/1_7f6mxubf | |
| dc.rights | This work may be protected by copyright unless otherwise stated. | |
| dc.title | Text Mining in Python for Social Scientists | |
| dc.type | Presentation |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2020-10-23_wim_marahrens_text-mining_files.zip
- Size:
- 43.24 KB
- Format:
- Unknown data format
- Description:
- Hands-on exercise files
Collections
Can’t use the file because of accessibility barriers? Contact us