This is a Quarto file that downloads a dataset using opendatatoronto, cleans it, and makes a graph.
Plan
The dataset I am interested in would need to have the date, and the water temperature. A quick sketch of a dataset that would work is Figure 2 (a), I am interested in the water temperature each month, the table would be like Figure 2 (b):
Them I will draw a geom_point graph like Figure 2 (a):
(a) Quick sketch of a graph
Figure 2: Sketches of a potential dataset and graph related to water temperature.
Simulate
This document uses R Core Team (2024) and Wickham (2016)
After examining the raw data, I found that there is only data between May and September. So, I am only generating simulated data between May and September
I can now make a graph of how water temperature change over time.
summary_data |>ggplot(aes(x = temp_year, y = number_temp)) +geom_point() +labs(x ="Year", y ="Water Temperature") +scale_color_brewer(palette ="Set1") +theme(legend.position ="bottom")
Bibliography
R Core Team. 2024. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.
Wickham, Hadley. 2016. Ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. https://ggplot2.tidyverse.org.