Polilnformatics Text as Data Workshop
Wednesday, September 2, 9:00 a.m.-5:00 p.m.
Hilton Union Square 16
What information can we get from text? How do we get it? What are the different options for analyzing words as data? How do we know whether a method is doing a good job of capturing what’s in a document? This workshop introduces computational approaches to collecting, organizing, and analyzing text as data. Computational methods can assist the process of forming impressions and formulating hypotheses where lots of data is involved, and in applying classification schemes at scale. Python is the go to language for large scale text processing but available R packages are also growing. We will learn the basics of the Python programming language and apply some leading text analytic methods. By the end of the day you will have a good sense of the possibilities and some working examples of code to build on should you choose to do so. Prerequisites: This course is primarily intended for graduate students interested who have some experience with a programming language (e.g. STATA, R) but little experience with text as data. Participants will need to register in order to receive preliminary instructions about software to be installed on their device prior to the workshop.