Sometimes you dream big, but you just don’t have the data…
I had this plan for text analysis by character from the Parks & Recreation series to celebrate Galentine’s Day but getting the data was a struggle. Subtitles are available for pretty much every episode but those don’t contain data about the character who said the lines. I needed scripts, which are a bit harder to come by. I found 6 scripts of episodes on the web from the first 3 seasons in pdf format that were usable.
Continue reading