There are a number of other great programming tools out there that can also be used to improve the reproducibility of your analysis
The key is to use some type of language that will allow you to automate and document your analysis
Once you master one language you'll probably find it easier to learn another
You could just type into the command prompt…
… but that doesn't help much with documentation
… but that doesn't help much with automation
With RStudio you can combine your programming and your documentation
Packages are the fundamental units of reproducible R code. They include reusable R functions, the documentation that describes how to use them, and (often) sample data. (From: http://r-pkgs.had.co.nz)
We will use the ggplot2
package for plots and dplyr
for data wrangling in this session.
If you have not yet done so, install these packages by running the following in the Console:
install.packages("ggplot2") install.packages("dplyr")
NOT about understanding all the R commands, but rather getting the big picture of how using R in this way facilitates reproducible analyses
Open intro-template.Rmd
Click on Knit HTML to compile the document
Great news!? You just received some more data, in bits and pieces of course:
gapminder-7080.csv
gapminder-90plus.csv
Add the data from 1970s and 1980s and 1990s plus, and recreate the same plots as before.