-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathindex.Rmd
More file actions
71 lines (45 loc) · 3.44 KB
/
index.Rmd
File metadata and controls
71 lines (45 loc) · 3.44 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
---
title: "A3: Incarceration"
output: html_document
---
### Introduction
An introduction of the problem domain and a description of the variable(s) you are choosing to analyze (and why!)
### Summary Information
Write a summary paragraph of findings that includes the 5 values calculated from your summary information R script
These will likely be calculated using your DPLYR skills, answering questions such as:
- What is the average value of my variable across all the counties (in the current year)?
- Where is my variable the highest / lowest?
- How much has my variable change over the last N years?
Feel free to calculate and report values that you find relevant. Again, remember that the purpose is to think about how these measure of incarceration vary by race.
### The Dataset
Who collected the data?
How was the data collected or generated?
Why was the data collected?
How many observations (rows) are in your data?
How many features (columns) are in the data?
What, if any, ethical questions or questions of power do you need to consider when working with this data?
What are possible limitations or problems with this data? (at least 200 words)
### Trends Over Time Chart
Include a chart. Make sure to describe why you included the chart, and what patterns emerged
The first chart that you will create and include will show the trend over time of your variable/topic. Think carefully about what you want to communicate to your user (you may have to find relevant trends in the dataset first!). Here are some requirements to help guide your design:
- Show more than one, but fewer than ~10 trends
- This may mean showing the same measure for different locations or different racial groups. Think carefully about a meaningful comparison of locations (e.g., the top 10 counties in a state, top 10 states, etc.)
- You must have clear x and y axis labels
- The chart needs a clear title
- You need a legend for your different line colors and a clear legend title
When we say "clear" or "human readable" titles and labels, that means that you should not just display the variable name.
Here's an example of how to run an R script inside an RMarkdown file:
``````{r, echo = FALSE, code = readLines("chart1_example.R")}
```
### Variable Comparison Chart
Include a chart. Make sure to describe why you included the chart, and what patterns emerged
The second chart that you will create and include will show how two different (continuous) variables are related to one another. Again, think carefully about what such a comparison means and what you want to communicate to your user (you may have to find relevant trends in the dataset first!). Here are some requirements to help guide your design:
- You must have clear x and y axis labels
- The chart needs a clear title
- If you choose to add a color encoding (not required), you need a legend for your different color and a clear legend title
### Map
Include a chart. Make sure to describe why you included the chart, and what patterns emerged
The last chart that you will create and include will show how a variable is distributed geographically. Again, think carefully about what such a comparison means and what you want to communicate to your user (you may have to find relevant trends in the dataset first!). Here are some requirements to help guide your design:
- Your map needs a title
- Your color scale needs a legend with a clear label
- Use a map based coordinate system to set the aspect ratio of your map