Text Analysis

(Lab 10):

Goals

  • Explore ways to visualize Text
  • Explore ways to visualize metadata about a text

We have a few questions:

  1. Have there been any trends in the length, lexical density, or significance of State of the Union Addresses?
  2. Did George W. bush or Barack Obama have an overall different tone to their addresses?

Getting Started

There are really 2 datasets here: the metrics and the first 500 words of each address. These are on 2 sheets. The metrics give an overall, distant description of the datasets. All of these metrics could reasonably be created in Excel or a text editor. 

We merged two datasets and created a line chart, stacked bar charts, and Word Clouds. We depict the analysis as a story.