Image by WikiImages from Pixabay
In this lab, we will learn how to better understand data by conducting Exploratory Data Analysis (EDA). EDA involves the use of graphical and numerical techniques to uncover the structure of the data. We use EDA to help us see what is contained in the data, what the variables are and their observations – how they were measured or counted. We want to begin to understand if there are interesting patterns or relationships between variables, and if there are unusual observations.
To learn more about how to conduct an exploratory data analysis, we will have two tutorials in the Rehearse portion of this lab for you to work through.
The first, Graphing Data, will be on visualizing data and the second, Describing Data will be on the numerical exploration which we call “descriptive statistics.” Again, don’t be afraid of the “s” word. We provide all you need in the form of our friendly code chunks.
Important
Before you begin the two rehearse sessions, make sure you have followed the link below to set up the Lab 02 project in your RStudio account.
To get started in Lab 02, follow this link to set up Lab 02 Exploring Data in your RStudio Cloud workspace. After it has deployed in your workspace, remember to save the temporary project to your workspace.
When you open the project in your workspace, you should see three R markdown (.Rmd) files. There is a worksheet for the Graphing Data rehearse and another worksheet for the Describing Data rehearse. And a template, Lab-02-Student-Name.Rmd, to be used for your Remix and Report where you apply a bit of what you learned and answer some questions. You will knit each document as you finish it and submit all three in your Lab 02 Assignment area in Canvas.
When you work through each Rehearse session, you will use a worksheet to capture your progress as you follow the instructions in the Rehearse web pages. You can keep the Rehearse page open on one browser tab and the appropriate worksheet in your RStudio account in another.
Let’s get started with the first tutorial – Graphing Data.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Last Compiled 2022-01-07