set.seed(2024)
# randomly selects a (real) number between -10 and 20
<- runif(1, min = -10, max=20)
x # randomly selects a (integer) number between 1 and 6
<- sample(1:6, size = 1)
y print(c(x,y))
[1] 15.10828 5.00000
Creating your first reproducible document
Dr. Irene Vrbik
University of British Columbia Okanagan
April 4, 2025
Download and install the latest release of R | 🔗 Download R |
Download and install the latest release of RStudio | 🔗 Download RStudio |
Download and install the latest release of Quarto | 🔗 Download Quarto |
Reproducibility here refers to the ability to:
Benefits of reproducible documentation:
In STAT 205 I will be expecting that your assignments be created in Quarto and that you submit a rendered HTML file to Canvas (see my example on Canvas).
In an example below, there is a code chunk that relies on random numbers. You will notice that every time you render the HTML document, a new set of random numbers will appear. If you would like your document to be reproducible1 you will need to set a seed. A “seed” refers to a starting point for the random number generator (RNG). The same seed will always produce the same sequence of random numbers, making your results predictable and reproducible. Setting a seed guarantees that the random numbers generated in your document will be the same every time someone runs it. The function for doing this in R is set.seed()
.
set.seed(2024)
# randomly selects a (real) number between -10 and 20
x <- runif(1, min = -10, max=20)
# randomly selects a (integer) number between 1 and 6
y <- sample(1:6, size = 1)
print(c(x,y))
[1] 15.10828 5.00000
The seed we used above is the current year, 2024. Picking the seed is somewhat arbitrary (I often just pound the keyboard randomly or sample a number from 1 to a million2 but you should try to avoid reusing the same seed over and over. Now every time we render the HTML document or someone else runs the same code, x
will be 15.1082762 and y
will be 5. So, if you copy and paste this code into your quarto document, you should get the identical numbers. Note that changing the seed value will result in a different pair of numbers (but we should still get the same ones each time).
There are a number tools that support quarto (e.g. VS code, Jupyter), but we will be using RStudio.
Navigating RStudio involves moving between different panes, tabs, and tools within the IDE. Here are the key elements and actions to help you navigate RStudio:
RStudio is divided into several panels, including the Source Editor, Console, Environment, and Files. You can navigate between these panels by clicking on the corresponding tabs located at the top-left corner of the IDE.
RStudio uses tabs for open documents and other views. You can switch between open files and views by clicking on the corresponding tabs at the top of the Source Editor or other panes. Note: you can change the appearance and layout of your panels in the RStudio Preferences.
Source Editor:
Console:
Environment and History:
File, Plots, Help and Viewer: (tabbed)
The Files pane, allows you to navigate through your project files and directories. You can click on files to open them in the Source Editor or use the Files pane to manage your project files.
The Plots pane displays plots generated by your R code.
The Viewer pane is used for displaying rendered content (default is an HMTL file).
Quarto (qmd
files) are a versatile document format that allows you to integrate narrative text, R code, and the results of code execution into a single document. In Rstudio navigate to File > New File > Quarto Document. Give your document a title select and you will get a newly create Quarto document Untitled.qmd
that looks something like this:
This is called a YAML (which stands for “yet another markup language”). Quarto allows for different format types (e.g. HTML, PDF, or word document). For this course we will be sticking with HTML. The example belown demonstrates how you can edit the YAML to include a table of contents (toc
) YAML below highlights a few of the options available for HTML output.
---
title: "My document"
subtitle: "Demo for STAT 205"
date: today
format:
html:
toc: true
html-math-method: mathjax
---
A full list of HTML format options can be found at the HTML Options page on the quarto website.
By default this is will open in Visual editor mode. This is a user-friend visual interface that will feel similar to writing in something like Word or Google docs in that we have options clickable buttons Bold (or keyboard shortcut (⌘B)), Italicize (or keyboard shortcut (⌘I)), add bulleted lists, etc.
Alternatively, if you move into Source editor mode, you will have a text-based editor and loose those buttons. This mode will require knowledge of the Markdown syntax.
see cheatsheet here: https://www.markdownguide.org/cheat-sheet/) or in RStudio Help > Markdown Quick Reference. Here are some common elements:
Element | Markdown Syntax |
---|---|
Heading | # H1 (level 1 header)## H2 (level 2 header)### H3 (level 3 header) |
Bold | **bold text** |
Italic | *italicized text* or_italicized text_ |
Blockquote | > blockquote |
Ordered List | 1. First item2. Second item3. Third item |
Unordered List | - First item- Second item- Third item |
Code | `code` |
Horizontal Rule | --- |
Link | [title](https://www.example.com) |
Image |  |
I find both modes useful and find myself switching between the two.
Now you can begin writing text and format with Markdown syntax or the RStudio Visual Markdown Editor.
You can add R codes chunks in the Source Editor by using three backticks followed by the language name (in this case, {r}
for R). End the code chunk with three more backticks.
You can quickly insert chunks into your R Markdown file with the keyboard shortcut Cmd + Option + I (Windows Ctrl + Alt + I) or by clicking the button.
Here is an example of a simple R chunk:
There are a variety of code chunk options; see more here: RStudio lesson and Yihui’s website. I have summarized the ones I most commonly use below:
Option | Description |
---|---|
echo |
Whether to display the source code in the output document. Besides TRUE /FALSE , which shows/hides the source code, we can also use a numeric vector to choose which R expression(s) to echo in a chunk, e.g., echo = 2:3 means to echo only the 2nd and 3rd expressions, and echo = -4 means to exclude the 4th expression. |
eval |
Whether to evaluate the code chunk. It can also be a numeric vector to choose which R expression(s) to evaluate, e.g., eval = c(1, 3, 4) will evaluate the first, third, and fourth expressions, and eval = -(4:5) will evaluate all expressions except the fourth and fifth. |
include |
Whether to include the chunk output in the output document. If FALSE , nothing will be written into the output document, but the code is still evaluated and plot files are generated if there are any plots in the chunk, so you can manually insert figures later. |
label |
used to assign a label or identifier to the code chunk |
fig-width |
Used to control the width of figures (plots, images, etc.) generated by code chunks. This (coupled with fig.height ) allows you to adjust the size of the figures in your document. |
fig-height |
Used to control the height of figures (plots, images, etc.) generated by code chunks. This (coupled with fig.height ) allows you to adjust the size of the figures in your document. |
fig-cap |
A character string to be used as a figure caption. |
out-width |
used to control the width of output figures, such as plots or images, generated by code chunks. This option allows you to adjust the display width of the figures in your document. |
warning |
Specifies whether the warning messages generated during the execution of the code chunk should be displayed in the document output. |
message |
Whether the messages produced by the code chunk are displayed in the document output (e.g. helpful to suppress verbose package libraries) |
Here is a simple example:
```{r}
#| echo: false
#| label: iris-scatterplot
#| fig-cap: "A simple scatter plot of the iris data set."
plot(iris[,1:2])
```
which gets rendered to (notice how the source code is hidden):
In Quarto, you can use the inline code syntax to embed code within the text of your document. The syntax for inline code, is to enclose within the expression . For instance
renders to: The mean of Sepal length of in the iris
data is: 5.8433333.
You can create tables in your Quarto document very easily in the Visual editor. Simply navigate to the Editor panel and select the Table < Insert Table. Fill in the desired rows and columns (along with optional caption) in the pop-up window.
Col1 | Col2 | Col3 |
---|---|---|
one | 2 | 3 |
four | 5 | 6 |
7 | eight | nine |
While you can create tables in the Source mode using Markdown syntax, I find the Visual mode to be much easier to work with.
To align the cells in a different way right click on the table and select the appropriate choice from the Align column menu.
Col1 | Col2 | Col3 |
---|---|---|
one | 2 | 3 |
four | 5 | 6 |
7 | eight | nine |
Quarto supports the use of LaTeX equations. Equations must be writing in “math mode”. In Quarto documents, you enter matho mode with an opening and closing dollar sign $
(used for in-line equations), or by opening and closing with two dollar signs the $$...$$
. The later will display the centered mathematical equation on it’s own line.
For example $x + y = z$
renders to \(x + y = z\). And $$x + y = z$$
renders to \[x + y = z\]
Notably, you can using inline R code within latex equations. For instance, let’s assign some random numbers to \(x\) and \(y\).
Based on the random numbers above what is \(x + y\) = ?
which renders to:
Answer: \(10.4108998 + 4 = 14.4108998\)
If you don’t want to keep all of the decimal places, you can round the numbers using round()
or utilize the format()
function. For example
renders to: \(10.41 + 4 = 14.41\)
In Quarto documents, paths are used to reference files (e.g., datasets, images, scripts) relative to the location of your .qmd
file or project structure. Proper path management is crucial for making your document portable, reproducible, and organized.
There are two types of Paths in Quarto:
Relative paths (Recommended for reproducibility) Relative paths reference files based on the location of your .qmd
document. Example structure:
In your .qmd
file, you might have something like:
Important Note: Quarto uses the directory where the .qmd
is located as the working directory by default. So there is no need to use setwd()
as you would have done with an R script.
Absolute Paths (Not Recommended) Absolute paths reference files using their full path on your computer. Example:
Notice how this method is tied to your system, so they break if the document is moved or shared (i.e. they are not portable). For this reason using relative paths (which is easily portable between machines) is the recommended workflow.
To maintain a clean and reproducible structure, organize your project files consistently:
stat205/
├── assignments/
│ ├── assignment1-studentID.qmd
│ ├── assignment2-studentID.qmd
│ ├── data/
│ │ ├── dataset1.csv
│ │ └── dataset2.csv
│ ├── figures/
│ │ ├── figure1.png
│ │ └── figure2.png
├── notes/
│ ├── lecture1.qmd
│ ├── lecture2.qmd
│ └── resources/
│ ├── reference1.pdf
│ └── reference2.pdf
Use the render button ( Render) located in the Source Editor panel or the or use the keyboard shortcut (⇧⌘K).
I like to click the “Render on Save” (located to the left of the ( Render) button, so that my document automatically render whenever I save the file (which is constantly).
In addition to rendering the complete document to view the results of code chunks, you can also run each code chunk interactively in the RStudio editor by clicking the icon or keyboard shortcut (⇧⌘⏎). RStudio executes the code and displays the results either inline within your file or in the Console, depending on your preference
In RStudio, the “Outline” pane in the top provides a convenient way to to view the document’s structure, navigate between sections, and quickly jump to different parts of the document. If the “Outline” pane is not visible, you can open it by going to the top-right corner of the Source Editor pane (this button looks like a small document icon with lines) or by using the keyboard shortcut Ctrl + Shift + O
(Windows/Linux) or Cmd + Shift + O
(Mac).
In addition, you can quickly navigate to different sections and code chunks using the “Chunk Navigation” feature. This feature is available as a clickable navigation menu located at the bottom-left corn of your Source Editor.
Open a new .Rmd file in the RStudio IDE by going to File > New File > R Markdown.
Embed code in chunks. Run code by line, by chunk, or all at once.
Write text and add tables, figures, images, and citations. Format with Markdown syntax or the RStudio Visual Markdown Editor.
Set output format(s) and options in the YAML header. Customize themes or add parameters to execute or add interactivity with Shiny.
Save and render the whole document. Knit periodically to preview your work as you write.
Share your work!
Tutorial: Hello, Quarto: https://rstudio.github.io/cheatsheets/html/quarto.html#overview
Get started with Quarto | Mine Çetinkaya-Rundel Posit PBC YouTube Channel https://www.youtube.com/watch?v=_f3latmOhew&t=711s
Quarto Cheatsheet: https://rstudio.github.io/cheatsheets/html/quarto.html
RMarkdown cheatsheet: https://rstudio.github.io/cheatsheets/html/rmarkdown.html
Markdown cheatsheet: https://www.markdownguide.org/cheat-sheet/
Markdown basics: https://quarto.org/docs/authoring/markdown-basics.html
Code chunk options: https://rmarkdown.rstudio.com/lesson-3.html, https://yihui.org/knitr/options/