- Read/ annotate: Recipe #3. You can refer back to this document to help you at any point during this lab activity.
- Review using Markdown syntax for formatting, including tables, numbering sections, and citations and references
- Review reading, inspecting, and writing data using R functions and Quarto code blocks
- Learn to describe data using prose and code in a Quarto document
- Practice interpreting datasets through statistical, tabular, and visual summaries.
- Create a Quarto document using the RStudio toolbar
- Provide the title: "Lab 03: Trace the datascape"
- Provide the author: <Your Name>
- Render the Quarto document (without changes)
- Click the 'Render' button on the RStudio toolbar
- Save the Quarto file with the name
lab-03.qmd
.
In the repository for this lab, you will find three data files corresponding to the data origin, data dictionary, and the data itself.
data_origin.csv
data_dictionary.csv
data.csv
These are the same files you used in Lab 02 which correspond to a transformed version of the Manually-Annotated Subcorpus (MASC). I suggest you review the data dictionary and data origin files and/ or your work from Lab 02 to help refesh your memory about the data.
- Create the following sections and subsections in your
.qmd
document:
- Overview
- About the data
- Assessment aims
- Setup
- Load packages
- Read data
- Inspect data
- Assessment
- Statistical overview
- Tabular summaries
- Visual summaries
- In the section "Overview", summarize the data and describe the aims of your assessment. Include the following information:
- What is the name of the data source?
- Where did it come from?
- What is the sampling frame?
- What are you trying to learn about the data?
- In the section "Setup", use strategies from Recipes 2 and 3 to load the necessary packages to read and inspect rectangular data. You may choose to subset your data to include only the variables and/ or observations you need for your assessment. Then, write a paragraph describing the data. Include the following information:
- How many variables are included?
- What are the variable types?
- How many observations are included?
-
In the section "Assessment", use strategies from Recipe 3 to assess the data. You may choose to include a statistical overview, tabular summaries, and/ or visual summaries as part of your assessment. Include at least one table and one figure in your assessment. These should be cross-referenced in your prose description of the dataset assessment.
-
Render the
.qmd
as a PDF or a Word document -
(optional) Explore adding the following table and plot enhancements to your Quarto document:
kableExtra
packagekable_styling()
:font_size
,striped_rows
,full_width
, etc.
ggplot2
packagetheme_*()
:theme_bw()
,theme_classic()
,theme_minimal()
, etc.labs()
:labs(title = "Title", subtitle = "Subtitle", x = "X-axis", y = "Y-axis")
- Add a section which describes your learning in this lab.
Some questions to consider:
- What did you learn?
- What did you find most/ least challenging?
- What resources did you consult?
- Instructor? R or Quarto documentation, Websites (provide links)?
- What more would you like to know about Trace the datascape in R and/ or Quarto?
- Find potential resources you might consult to continue your learning. Provide links and a brief description of the resource.
- To prepare your lab report for submission you will need to render your Quarto document to PDF or Word.
- Download this file to your local computer.
- Submit your report as described by your instructor.
This work is licensed under a Creative Commons Attribution 4.0 International License.