What follows is an analysis of Internet news headlines. To create the dataset, a script ran daily to capture the headlines from the news sites that follow:
NBC and CNN use RSS feeds. The rest are those pages are built as the site's home page.
This analysis currently includes headlines from July 5 to August 2, 2021. The exception is OAN, which was brought into the analysis on July 10.
This analysis looks at the overall compound sentiment for the period of review. Sentiment values range from -1 (negative sentiment) to 1 (positive sentiment) with 0 as neutral sentiment.
The table and graph below organizes the source by most to least positive sentiment.
source_name compound 0 Reuters 0.012618 1 CNN -0.007112 2 Washington Post -0.017253 3 NPR -0.031112 4 Fox -0.043821 5 CBS -0.050776 6 OAN -0.051326 7 New York Post -0.067105 8 AP -0.076693 9 ABC -0.090861 10 Breitbart -0.092640 11 NBC -0.120787
What follows are boxplots for the compound sentiment over the same period. This provides more nuance into the range of values than an average alone provides.
For example, NPR has most of its range in tight focus with many outliers. This is in contrast to Fox's wide range of values. CNNs relatively even distribution is a point of curiosity as well.
The Compound Sentiment by Date allows us to see how news headline sentiment trends on each site or feed. This analysis also provides an Overall average for comparison. Select or deselect sites to have them appear or hide in the graph below.
Points of interest include: