Private Sub Form_Load() MsgBox "Hello, World!" End Sub
नमस्ते & السلام عليكم
<html> Hello world</html>
In groups three or four meet and greet each other.
I live …
My favorite thing about UCI is …
I am awesome because …
Find something in common between all of you by expanding the conversation.
Find a difference.
The most important thing about this course
Think 💭 - Pair 👫🏽 - Share 💬
What do you think data science is about and what will we learn in this course? There is no right or wrong answer.
Data science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processes, algorithms and systems to extract or extrapolate knowledge and insights from noisy, structured, and unstructured data. Wikipedia
Data science also integrates domain knowledge from the underlying application domain (e.g., natural sciences, information technology, and medicine). Data science is multifaceted and can be described as a science, a research paradigm, a research method, a discipline, a workflow, and a profession. Wikipedia
What types of data will we use in this course
We will use a variety of datasets from biological studies to business answering questions serving different purposes in life. Data will come different size, shape, and form and will include numbers, categories, text etc.
Is this a statistics course or a computing course?
A little bit of both.
Do I need prior programming/statistics experience?
|Shortcut||Alt + -||Option + -|
If something comes in quotes, it is not defined in R.
do() is a function;
something is the argument of the function.
In order to get any help we can use
? followed by function (or object) name.
You should not copy paste code from my slides or from the internet. Part of learning to code is building up your muscle memory.
After function names do not leave any spaces.
Before and after operators (e.g. <-, =) leave spaces.
Put a space after a comma, not before.
Object names are all lower case, with words separated by an underscore.
You can let RStudio do the indentation for your code.
_Hello world_ __Hello world__ ~~Hello world~~
Slides that you are currently looking at are also written in Quarto. You can take a look at them on our course’s GitHub organization in the slides repo.
Does this look familiar?
What if we tracked our file with a better names for each version and have only 1 file hw1?
hw1 added questions 1 through 5
hw1 changed question 1 image
hw1 fixed typos
We will call the descriptions in italic commit messages.
git allows us to keep track of different versions of a file(s).
GitHub is a website where we can store (and share) different versions of the files.
We have actually done something similar to this demo before by cloning the test repo and committing, and pushing.
.Rproj file to open projects. Then open the appropriate
.R file from the Files pane. If you don’t open
.Rproj file you will not be able to see the Git pane.
repo is a short form of repository. Repositories contain all of your project’s files as well as each file’s revision history.
For this course our daily repos (lecture code, activity etc.) are hosted on Github.
To clone a GitHub repo to our computer, we first copy the cloning link as shown in screencast then start an RStudio project using that link.
Cloning a repo pulls (downloads) all the elements of a repo available at that specific time.
Once you make changes to your repo (e.g. take notes during lecture, answer an activity question) you can take a snapshot of your changes with a commit.
This way if you ever have to go back in version history you have your older commits to get back to.
This is especially useful, for instance, if you want to go back to an earlier solution you have committed.
All the commits you make will initially be local (i.e. on your own computer).
In order for us to see your commits and your final submission on any file, you have to push your commits. In other words upload your files at the stage in that specific time.
Git: is software for tracking changes in any set of files
GitHub: is an internet host for Git projects.
repo: is a short form of repository. Repositories contain all of your project’s files as well as each file’s revision history.
clone: Cloning a repo pulls (downloads) all the elements of a repo available at that specific time.
commit: A snapshot of your repo at a specific point in time. We distinguish each commit with a commit message.
push: Uploads the latest “committed” state of your repo to GitHub.
Do you git it?