Cornell College
DSC 223 - Spring 2024 Block 7
If you have not yet completed the intro survey, please do so asap!
If you have not yet accepted the invite to join the course GitHub Organization, please do so asap!
Office hours + locations linked at https://dsc223-sb7-2024.github.io/DSC223_S24_website/course-instructor.html, come say hi to me and get help!
Let’s take a tour!
Only work that is clearly assigned as team work should be completed collaboratively.
Labs will be completed in groups. You should work with each other during, and sometimes outside of class, to complete the labs. You will be forced to work through each person in your team during class via mounted TVs.
Homework must be submitted individually. You may not directly share answers / code with others, however you are welcome to discuss the problems in general and ask for advice.
Exams must be completed individually. You may not discuss any aspect of the exam with peers. If you have questions, email me, especially if you get stuck on an usual problem (not a coding error).
I are aware that a huge volume of code is available on the web, and many tasks may have solutions posted
Unless explicitly stated otherwise, this course’s policy is that you may make use of any online resources (e.g. RStudio Community, StackOverflow, etc.) but you must explicitly cite where you obtained any code you directly use or use as inspiration in your solution(s).
Any recycled code that is discovered and is not explicitly cited will be treated as plagiarism, regardless of source
Treat generative AI, such as ChatGPT, the same as other online resources.
Guiding principles:
(1) Cognitive dimension: Working with AI should not reduce your ability to think clearly. We will practice using AI to facilitate—rather than hinder—learning.
(2) Ethical dimension: Students using AI should be transparent about their use and make sure it aligns with academic integrity.
✅ AI tools for code: You may make use of the technology for coding examples on assignments; if you do so, you must explicitly cite where you obtained the code.
❌ AI tools for narrative: Unless instructed otherwise, you may not use generative AI to write narrative on assignments. In general, you may use generative AI as a resource as you complete assignments but not to answer the exercises for you.
Ask if you’re not sure if something violates a policy!
Complete all the preparation work before class.
Ask questions.
Do the readings.
Do the lab.
Do the Homework
Don’t procrastinate! There is no time for falling behind on the block!
Course operation
Doing data science
By the end of the course, you will be able to…
What does it mean for a data analysis to be “reproducible”?
Short-term goals:
Long-term goals:
Packages: Fundamental units of reproducible R code, including reusable R functions, the documentation that describes how to use them, and sample data1
As of March 14th, 2024, there are 20,582 R packages available on CRAN (the Comprehensive R Archive Network)2
We’re going to work with a small (but important) subset of these!
Option 1:
Sit back and enjoy the show!
Option 2:
Clone the corresponding application exercise repo and follow along.
ae-01-meet-the-penguins
Go to the course GitHub organization and clone ae-01-meet-the-penguins-YOUR_USERNAME in RStudio on the server. http://turing.cornellcollege.edu:8787/
$
:Option 1:
Sit back and enjoy the show!
Option 2:
ae-01-meet-the-penguins
Go to the course GitHub organization and clone ae-01-meet-the-penguins-YOUR_USERNAME in RStudio on the server. http://turing.cornellcollege.edu:8787/
Important
The environment of your Quarto document is separate from the Console!
Remember this, and expect it to bite you a few times as you’re learning to work with Quarto!