Both challenges are due by noon via Sakai on Wednesday May 2. For the first tidy challenge, you’ll want to refer back to our slides. For the second challenge, you’ll want to refer to the reference lab.
Tidy challenge: Tidy the “Lord of the Rings” data (Advice: read in each csv file using read_csv()
, then use dplyr::bind_rows
to stack them together pre-tidying.)
male
and female
are number of words spokenGapminder challenge: Read on…
Install and load the gapminder
data package. This is the same data that was used in your Datacamp “Introduction to the Tidyverse” course!
install.packages("gapminder")
library(gapminder)
?gapminder
Pick at least two of the tasks below from the task menu and approach each with a table and figure.
dplyr
should be your main data manipulation toolggplot2
should be your main visualization toolMake observations about what your tables/figures show and about the process. If you want to do something comparable but different, i.e. swap one quantitative variable for another- go for it!
You do not have to use tidyr
or otherwise worry about reshaping your tables. Many of your tables may not be formatted perfectly in the report. Simply printing dplyr
tabular output is fine. For all things, graphical and tabular, if you’re dissatisfied with a result, discuss the problem, what you tried to do to fix it, and move on.
For each table, make sure to include a relevant figure. One tip for starting is to draw out on paper what you want your x- and y-axis to be first and what your geom
is; that is, start by drawing the plot you want ggplot
to give you. Your figure does not have to depict every single number present in the table. Use your judgement. It just needs to complement the table, add context, and allow for some sanity checking.
Notice which figures are easy/hard to make, and whether the visualization adds clarity, detracts from, or is completely redundant (and therefore probably unnecessary) with respect to the tabular display.
You’re encouraged to reflect on what was hard/easy, problems you solved, helpful tutorials you read, etc.
Gapminder EDA ideas from Jenny Bryan, author and creator of the Gapminder package.