and build the foundation for a successful scripting career.
What is statistical programming?
Broadly speaking:
Computer programming is more focused on software development.
Statistical programming is more focused on data analysis and the communication of the results.
In this course we focus more on the HOW of doing data analysis in R. It is not primarily a course in statistics. This course serves as a preparation for e.g. the Summerschool Utrecht course “Data Analysis”.
R
RStudio
R Markdown
and
RStudio projects
R is a language and environment for statistical computing and for graphics
GNU project (100% free software)
Managed by the R Foundation for Statistical Computing, Vienna, Austria.
Community-driven
Based on the object-oriented language S (1975)
RStudio
?Integrated Development Environment (IDE)
Source figure: R for Data Science
RStudio
R Markdown
R Markdown provides an authoring framework for data analysis. You can
use a single R Markdown
file to:
R Markdown documents are fully reproducible and are able to generate many file formats:
R Markdown
RStudio Projects
Every time you start a new (data analysis) project, make it a habit
to create a new RStudio Project
.
Because you want your project to work:
RStudio Projects
create a convention that guarantees
that the project can be moved around on your computer or onto other
computers and will still “just work”.
RStudio project
Every time you want to work on this project: open the project by
clicking the .Rproj
file.
Time for your first practical in R!
R Markdown
file with the
exercises and empty code chunks where you can type your answers.RStudio
and choose File -> New Project. Choose
the option “Existing Directory”. Find the Project Working Directory, the
folder “RSummer2023” you just created. Click on “Create project”.Aim to make the exercises without looking at the answers and use the solutions file to evaluate your work.
But, do not ‘struggle’ too long because you will run out of time.
In any case, ask for help when you feel help is needed.