title | cover title | description | learning objectives | estimated time | prerequisites | before getting started | readings | projects | ethical considerations | resources | ||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Introduction to Git and GitHub |
Git and GitHub |
Git and GitHub are powerful tools for collaborative and individual projects. Git is a version control software that aids with tracking changes made to a set of files over time. GitHub is a web-based platform for storing and sharing project files online. This session begins with a conceptual overview of both tools, including an introduction to fundamental concepts such as version control and practical applications like developing a syllabus or collaborative writing. This session then covers initializing Git repositories, committing changes, pushing to GitHub, cloning repositories to your local machine, and forking repositories from other accounts on GitHub. |
|
|
|
|
|
|
|
|
Git is software used for version control—that is, tracking the state of files and changes you make to them over time. Git can be enabled in a folder, and then used to save the state of the contents in that folder at different points in the future, as designated by you. In the language of Git, a folder is called a repository. In the context of this workshop, it refers to a folder that is being tracked by Git. Using Git, you can view a log of the changes you've made to the files in a repository and compare changes over time. We will explore these features in the current workshop. You can also revert back to previous versions, and create "branches" of a project to explore different futures. These are advanced features, which we will provide resources for you to explore later. Git is also useful for collaboration, as a repository can be shared across computers, and its contents can be asynchronously developed and eventually merged with the main project.
GitHub is an online platform for hosting Git repositories. It functions for some, predominantly programmers, as a social network for sharing and collaborating on code-based projects. Users can share their own projects, as well as search for others, which they can then often work on and contribute to. Digital Humanists, librarians, and other academics are also finding ways Git and GitHub are useful in writing projects and teaching. GitHub also serves as a web-hosting platform, allowing users to create websites from their repositories.
As we move forward it's important to make sure we're firm on the distinctions between the two different tools outlined above.
Git is a software that you use on your laptop, or your local computer/machine. The repository with your project's files is stored on your hard drive. You also edit the text files on your local machine using a plain text editor, which is another software on your local computer like Visual Studio Code.
GitHub is a cloud-based platform that you access through your internet browser. Even though you physically are still in the same place, working on your laptop, you are no longer working on your local machine, you are on the Internet. This is a fundamentally different location than when you're working with your Git repository and editing and creating files in your plain text editor. With GitHub, you are uploading your repository—as described above—from your local machine to this platform on the Internet to be shared more broadly. You can also create private repositories if you want to use GitHub to backup a project.
Which of the following best describes version control: (select one)
- a software installed on my local machine - the practice of tracking and organizing the state of a file over time, as it changes* - a language that can be read and rendered by some web-based platforms - a cloud-based softwareWhich of the following best describe Git: (select all that apply)
- a software installed on my local machine* - the practice of tracking and organizing the state of a file over time, as it changes - a web-based platform for storing and sharing files - a version control software* - a cloud-based software - refers to project folders as "repositories"*Which best describes GitHub: (select all that apply)
- a cloud-based software* - a software installed on my local machine - a web-based platform for storing and sharing files* - a version control software - refers to project folders as "repositories"* - the practice of tracking and organizing the state of a file over time, as it changesDo you remember the glossary terms from this section?
A study of how Digital Humanists use GitHub, conducted by Lisa Spiro and Sean Morey Smith, found that a wide range of users, including professors, research staff, graduate students, IT staff, and librarians commonly used the site in their DH work. They used GitHub for a diverse range of activities, such as:
- Developing software
- Sharing data sets
- Creating websites
- Writing articles and books
- Collating online resources
- Keeping research notes
- Hosting syllabi and course materials
Participants in the study said they found GitHub useful in their Digital Humanities work for several reasons. In particular, it facilitated:
- Sharing and backing up files on multiple computers
- Monitoring changes effectively
- Recovering from bugs or errors by going back in time before the error arose
- Using different branches for experiments and new directions
- Sharing and managing files with others—seeing who added what content and when
As you can see across these sessions, we use GitHub to host workshop curricula. Hosting sessions on GitHub allows you (and anyone else interested in these topics!) to follow our repositories, and create your own version of the workshop based on our materials. This fosters open scholarship and knowledge sharing. It also facilitates attribution and citation by clearly tracking which content was created by whom, when it was added, and which projects or materials are derived from others.
Case One: This Session
If you go this to workshop on GitHub and look at the top of the page just under the workshop title, DHRI-Curriculum/git
, you can see it is forked from pswee001/Git_DRI_Jan_2018
(next to the red star). That line shows that this particular repository is copied from ("forked from") the curriculum for a session presented at our January 2018 Institute by "pswee" (former Graduate Center Digital Fellow Patrick Sweeney). If you then look at that repository, you will see that it is in turn forked from previous sessions that were developed by other GC Digital Fellows for workshops in past years. We'll expand on forking in the final section of this workshop.
Git is also used in writing projects! Version control makes tracking changes tractable, especially when there are multiple authors working asynchronously. It can be an alternative to using track changes in Microsoft Word, or comments and edits in a Google Doc.
Case Two: Coauthored Publications
Git and GitHub—together or independently—support multi-author publishing. Like we have done with the DHRI curriculum, you can have a shared project folder that multiple people are working from asynchronously, even on the same parts if they wanted, and then those different offshoots can be carefully folded back into the main project. A singular author can also create different offshoots on their own, allowing the writer to explore different ways forward. This, combined with version control, allows authors to easily return to and compare older drafts or retrieve sections previously discarded.
We encourage you to explore these features after you gain a grasp on the fundamentals in the current workshop. You can find resources to do this on the Theory to Practice page!
How did you initially come by the syllabus you use for your class(es), and did you develop it over time? Many professors borrow and adapt from each other, and most of us probably update our syllabi each semester, even if only a little bit.
Through this process, many of us end up with a set of files that looks something like this:
While I probably can tell which version is the "final" one, I can not see what was changed along the way or how the different versions vary from each other.
With Git, you would save these multiple versions over time as one file, and each version you save includes a note about what has changed so you can easily revert back to an older version if needed.
By looking at the file list, you also can not tell who the syllabus originally came from, or if there were contributions from many individuals. Git and GitHub can help make attribution clear, and maintain it over time as the syllabus travels between hands.
Case Three: Syllabi
Increasingly we see that faculty are sharing their syllabi on GitHub (example: DLCL 204: Digital Humanities Across Borders). Some are even using GitPages that apply a user-friendly interface to their repository to make it easier to access and navigate for their students (example: Digital History).
GitHub offers a way of making a course publicly available on the web, and sometimes easier or more intuitive to users than some learning management systems. Git helps track the changes over time.
When the softwares are used together, Git and GitHub also support a collaborative approach to syllabi development. Copying another's project and modifying and remixing the content to meet your needs is a seamless and transparent process. Attribution of specific changes over time is a foundational function of how Git operates; GitHub explicitly renders attribution, making it easy to see who(s) did what. This is one of the attractions of using the platform.
In a practical sense, you could search other syllabi on GitHub, and share yours so it could be searched by others. If someone finds a syllabus that includes parts they want to use, they could fork that syllabi to their GitHub account, and download the files to your local machine and edit them there. Any changes could then be added back to the repository on GitHub, thereby sharing your amendments publicly. On GitHub, attribution of who contributed what are transparent. Meanwhile, your amended version would be available for others to modify and re-share.
Even if you were only working with your own self-created syllabus, like we'll do later in this workshop, Git and GitHub can be useful for tracking your changes without the hassle of multiple files. From one file, you can use Git to compare your current version with older versions; you can also compare and share these different versions on GitHub if you wanted.
We will cover these features in-depth in a later lesson in this workshop.
What tasks could Git and/or GitHub offer support to? (select all that apply)
- Developing software* - Creating and sharing data sets* - Creating websites* - Writing articles and books* - Collating online resources* - Keeping research notes* - Hosting syllabi and course materials*Do you remember the glossary terms from this section?
During this workshop, you'll be communicating with GitHub from your local computer via the command line (the Terminal or the Git Bash on Windows). This section reviews some of the basic commands that will also be used in this workshop.
In addition to the command line, you'll be using your text editor and your browser. Before continuing, its important that we clearly distinguish between these three different spaces or environments:
- Your plain text editor where you'll be writing your syllabus is on your local computer.
- That syllabus is initially saved in a git-enabled repository on your local computer.
- Your browser is where you'll be uploading your repository to GitHub, a cloud service.
- Your terminal is where you'll be communicating with GitHub to send the repository and project files back and forth between the cloud (which you can view through the GitHub website) and your hard drive.
Because you'll be moving between these three spaces throughout the workshop, you may want to use (command (⌘) + tab) or (control + tab) to move quickly between the three windows on your desktop.
macOS
Hold the command (⌘) key and press the space bar at the same time to bring up the "Spotlight Search" window. Type terminal
, followed by enter to quickly open the Terminal.
Windows
Press the windows button on your keyboard. When the search menu pops up, type git bash
and press enter.
If you don't feel comfortable navigating your hard drive through the command line, here is a short section catching you up. If you feel fairly comfortable using the command line, you can skip this section, and go straight to the next one.
You can create the folder anywhere on your hard drive by typing the following into your terminal and hitting enter.
$ cd <directory-name>
Let's practice this command by using it to take us to our Desktop. Type the following command into your terminal and hit enter.
$ cd Desktop
This will change your current working directory from /Users/<your-name>
to /Users/<your-name>/Desktop
.
Check your current directory by typing the following command into your terminal and hit enter:
$ pwd
Now, use the following command to go up one directory:
$ cd ..
Check your current directory again using the following command. You should be back in your "home" directory:
$ pwd
Practice going back and forth between your Desktop and your home directory.
When finished, go to your Desktop folder and check that you're there with pwd
.
In this session, we will be making a syllabus and using Git to keep track of our revisions. Let's create a Git project folder.
If you've worked through the command line session, you should already have a projects
folder on your desktop. If you don't have a projects folder on your desktop, create one using the following command:
$ mkdir projects
From Desktop
, Navigate into your projects
folder using the following command:
$ cd projects
Then create a git-practice
folder with the following command:
$ mkdir git-practice
Navigate into the new git-practice
folder using the following command:
$ cd git-practice
At this point, when you type pwd
, your folder structure should look like this:
$ pwd
/home/<username>/Desktop/projects/git-practice
Which best describes where you are working when you're writing in your plain text editor:
- on my local machine* - on the internetWhich best describes where you are working when you're using your terminal to communicate with GitHub and share the files:
- on my local machine* - on the internetWhich best describes where your files are when you are viewing them in GitHub:
- on my local machine - on the internet*Git-enabled repository means: (select one)
- none of the files on my local machine are being tracked - a specific file on my local machine is being tracked - a specific folder on my local machine is being tracked* - all the files on my local machine are being trackedWhich command do you use to make a new folder?
- `pwd` - `cd` - `mkdir`*Which command do you use to enter into a folder?
- `pwd` - `cd`* - `mkdir`Which command do you use to check where you are?
- `pwd`* - `cd` - `mkdir`Through this section, you'll be checking your installation and configuring Git with your own name and information.
First, let's make sure Git has been successfully installed. In your terminal, type the following command:
$ git --version
If you see a version number, you're all set. If not, follow the installation instructions here.
Our first step in working with Git is letting the software know who we are so it can track our work and attribute our contributions. This information is useful because it connects identifying information with the changes you make in your repository.
Type the following two commands into your command line, replacing the "John Doe" and "johndoe@example.com" with your name and email (use quotations where you see them). These do not necessarily need to be the name and email you used to sign up for GitHub. Remember, these are different spaces and different softwares.
$ git config --global user.name "John Doe"
$ git config --global user.email johndoe@example.com
To check your set-up, type the following command into your terminal:
$ git config --list
You should get something that looks like this except with whatever information you entered previously:
user.name=Superstar Git User
user.email=gitsuperstar@gmail.com
What are you doing when you set up git? (select one)
- You are creating a new version of the software on your local machine. - You are sending files from your local machine to GitHub? - You are introducing yourself to the software, so it knows who you are.* - You are creating a new version of a project folder on your local machine.The next step is to initialize the project folder that we want Git to track. Even though we configured Git for our computer, Git doesn't start tracking every single file on our computer. That would turn into a headache quickly. We only want Git to track changes for files within specific folders/projects.
When we initialize a folder, we are telling Git to pay attention to it. This only needs to happen once because what is actually happening through this process is Git is adding a hidden subfolder within your folder that houses the internal data structure required for version control. After initialization, Git is ready to track the files within the folder. The folder is now considered a Git repository.
First, use cd
, navigate to the git-practice
folder (inside projects
). From your home directory, you can do all of them in one command by typing the following into your terminal:
$ cd Desktop/projects/git-practice
Next we're going to initialize our repository using the git init
command, which should generate the following output:
$ git init
Initialized empty Git repository in /home/<your-username>/projects/git/.git/
Now Git is tracking our directory. However, it has not done any versioning yet. This is because 1) we haven't told Git to take a snapshot yet, and 2) there are no files in the folder to take a snapshot of. For now, Git knows this folder exists and is prepared to take a snapshot of the files when you tell it to.
Before version control is useful, we'll have to create a text file for Git to track. For this session, the file we will track will be a course syllabus—we'll create that next.
To create a plain text file, we're going to switch to our text editor, Visual Studio Code, to create and edit a file named syllabus.md
and save it to our git-practice
folder. The .md
extension indicates that it is a Markdown file, which is a special file format we will dive into in the next section.
If you have not installed Visual Studio Code, review the installation instructions here.
In terminal, check to make sure you are in your git-practice
folder. (Hint: use pwd
to see what directory you are currently in.)
Next, open the syllabus.md
file in Visual Studio Code using:
$ code syllabus.md
You should see a window appear that looks similar to this:
If Visual Studio Code does not open when you use the code
command in your terminal, open it using the Start Menu on Windows or Spotlight Search on macOS as you would any other software. Then click File > Open File
and use the dialog to navigate to the /Users/<your-name>/Desktop/projects/git
folder and create a syllabus.md
file there.
We'll be typing our markdown into this file in the Visual Studio Code window. At any time, you can save your file by hitting control + s on Windows or ⌘ + s on macOS. Alternatively, you can click the File
menu on the top right, then select Save
from the dropdown menu.
Saving frequently is advised. When we get to the version control functionality of Git, only changes that are saved will be preserved when a version is created.
We'll be using Markdown to write a syllabus, and then using Git to track any changes we make to it. Markdown allows us to format textual features like headings, emphasis, links, and lists in a plain text file using a streamlined set of notations that humans can interpret without much training. Markdown files usually have a .md
extension.
Markdown is a markup language for formatting text. Like HTML, you add markers to plain text to style and organize the text of a document.
Whereas you use HTML and CSS with WordPress, you use Markdown to render legible documents on GitHub. Markdown has fewer options for marking text than HTML. It was designed to be easier to write and edit.
For comparison, you learned to create headers in HTML like this:
<h1>My Syllabus Heading</h1>
In Markdown, we insert headings with a single hash mark like this:
# My Syllabus Heading
A sub-heading (H2) heading uses two hash marks like this:
## Readings
The lessons of this workshop were originally written in markdown. You can see here what they look like in their raw, unrendered form.
Compare that with this—the source code for this lesson's web page, written in HTML here.
Markdown is also arguably more sustainable and accessible than formats like .docx
because of its simplicity and related ability to be read across multiple platforms. Use of Markdown is also supported by document-conversion tools like Pandoc that can change a markdown file to an .epub
with one command entered into your terminal.
Here are a few more key elements to get you ready to make your own syllabus in Markdown.
To provide emphasis, place asterisks around some text:
*This text will appear italicized.*
**This text will appear bold.**
For emphasis, you need to mark where it should start and where it should end, so you need asterisks at the beginning and end of whatever text is being emphasized.
To create a bulleted list, put a hyphen at the beginning of each list item:
- Reading one
- Reading two
- Reading three
To create a link, put the anchor text (the text you will see) in square brackets and the URL in parentheses, directly following the anchor text in brackets. Don't put a space between them:
I teach at [The Graduate Center, CUNY](https://www.gc.cuny.edu).
Paragraphs of text are denoted by putting a blank line between them:
This is a paragraph in markdown. It's separated from the paragraph below with a blank line. If you know HTML, it's kind of like the <p> tag. That means that there is a little space before and after the paragraph when it is rendered.
This is a second paragraph in markdown, which I'll use to tell you what I like about markdown. I like markdown because it looks pretty good, if minimal, whether you're looking at the rendered or unrendered version. It's like tidy HTML.
Use these five elements—headings, emphasis, lists, links, and paragraphs—to create a syllabus. Have a main heading that gives the course title (one #
), then subheadings for, at least, course info and readings. Use emphasis (*
) for book titles and try to get a list in there somewhere.
If you want an a more advanced challenge, you can review some additional markdown elements on this page and add some extra features like images, blockquotes, or horizontal rules.
You can look at an example syllabus in raw text form here. You can see what it looks like when it's rendered by GitHub on GitHub. When editing the markdown file in Visual Studio Code, it might look like this:
-
Visual Studio Code also has a preview feature for your markdown. Hit the preview button on the top right while editing your markdown file:
You'll get two side-by-side panels. Your markdown file will be on the left, and your rendered preview will be on the right:
-
Remember to save your work—regularly!—with control + s on Windows or ⌘ + s on macOS.
Which best describes what you're doing when you initialize your project folder: (select all that apply)
- You created a new version of your project folder - You told Git to pay attention to your project folder* - You told Git to set up its file structure within your project folder so it can track changes to your files.* - You use the command `mkdir` in your terminal - You use the command `git init` in your terminal*Which best describes Markdown: (select all that apply)
- a software installed on my local machine - a language for formatting plain text files* - a language that can be read and rendered by some web-based platforms* - a version control software - a cloud-based software - refers to project folders as "repositories"Do you remember the glossary terms from this section?
Git's primary function is version control, or to track a project as it exists at particular points in time. Now that we have a file to track—our syllabus.md
—let's use Git to save the current state of the repository as it exists now.
In Git, a commit is a snapshot of a repository that is entered into its permanent history. To commit a change to a repository, we take two steps:
- Adding files to a "staging area," meaning that we intend to commit them.
- Finalizing the commit.
Staging a file or files is you telling Git, "Hey! Pay attention these files and the changes in them".
Making a commit is a lot like taking a photo. First, you have to decide who will be in the photo and arrange your friends or family in front of the camera (the staging process). Once everyone is present and ready, you take the picture, entering that moment into the permanent record (the commit process).
Why do you need both steps? Sometimes when you're working on a project you don't want to pay attention to all the files you changed. Perhaps you fixed a bug in some code, but also did some work on your manuscript document. You may want to only commit the changes you made to the code because you still haven't finished your thoughts on the manuscript. You can stage, or add
, the code file so Git knows to only commit the changes made to that file. Later, you can stage and then commit the manuscript changes on their own once you've finished your thought.
First, let's see what state Git is currently in. We do that with the git status
command. It's a good idea to use this command before and after doing anything in a Git repository so you can always be on the same page as the computer.
Make sure you're in your /home/<your-name>/Desktop/projects/git-practice
directory using the pwd
command in the terminal. Once you're there, enter git status
and you should see the following output:
$ git status
On branch main
No commits yet
Untracked files:
(use "git add <file>..." to include in what will be committed)
syllabus.md
nothing added to commit but untracked files present (use "git add" to track)
"Nothing added to commit" means that we have initialized our repository, but haven't made any commits yet. If you're instead getting a message that begins with the word fatal
when you use git status
, you may be in the wrong directory or perhaps you haven't run the git init
command on your directory yet.
Let's follow the recommendation in the status message above and use the add
command to stage files, making them ready to be committed.
We will go ahead and add syllabus.md
by writing the following in the terminal:
$ git add syllabus.md
You should see no output from the command line, which should be interpreted as a the above command succeeded. It is what we call "succeeding silently." Let's run git status
again to have a "sanity check"—to make sure that things have changed. You should see output like this:
$ git status
On branch main
No commits yet
Changes to be committed:
(use "git rm --cached <file>..." to unstage)
new file: syllabus.md
The new file: syllabus.md
should be highlighted in green to show that it's ready for commit.
This is Git telling you, "Ok, I see the file(s) you're talking about."
Now that our files have been staged, let's commit them, making them part of the permanent record of the repository. In the terminal, type:
$ git commit -m "Initial commit of syllabus file"
The -m
flag provides that the message following the flag (in quotation marks) along with the commit. The message will tell others—or remind a future version of yourself—what the commit was all about. Try not to type git commit
without the -m
flag—there's a note about this below.
After running the command, you should see output like this:
[main (root-commit) 8bb8306] Initial commit of syllabus file
1 file changed, 0 insertions(+), 0 deletions(-)
create mode 100644 syllabus.md
This means you have successfully made your first commit in the repository—congratulations! There are a few things going on in this message. The relevant information for you for now is the second line, which tells you that one file was changed, and there were no insertions or deletions. You have a fresh new file!
Let's check the state of our repository after the commit by running git status
:
$ git status
On branch main
nothing to commit, working tree clean
This means that everything in the repository is successfully committed and up-to-date. If you edit your syllabus file or create a new file in the repository, the message you get with git status
will instead list files that have uncommitted changes.
Let's run one other command to see the effect our commit has had. Enter this command:
$ git log
You should see output similar to this:
commit 8bb8306c1392eed52d4407eb16867a49b49a46ac (HEAD -> main)
Author: Your Name <your-email-here@gmail.com>
Date: Sun May 20 16:03:39 2018 -0400
Initial commit of syllabus file
This is the log of commits, comprising a history of your repository. There's only one commit here now, though. If you don't see a prompt (the $
) after running git log
, you may need to press the q key (just the q key by itself) to return to the command line.
The -m
flag is useful for human purposes and technical purposes. For human purposes, the -m
flag helps you keep track of the changes you're making. Version control is most useful when you can confidently return to a specific version. It can also help you be more structured in your approach to making changes—your notes to self are limited, so to make them clear, you might make commits after specific tasks are completed. If you update readings for the first week of classes or if you add another reading, you will want to make a commit. This can also make it easier to reverse a specific change in the future.
Also, if you type git commit
by itself, git will open the command line's default text editor to allow you to enter the commit message in a file-like environment. It looks something like this:
This unfamiliar screen is the default text editor, vi
, and it requires some knowledge to use. We don't teach it as part of our sessions, but if you find yourself stuck in this screen, you can try this trick to leave that environment and return to your usual command prompt. Type :q
and then press enter. You should be back to the command line with a message saying:
Aborting commit due to empty commit message.
If you make a mistake where you include an opening quotation mark but forget a closing one, you might accidentally end up inside a "quote prompt." You will know you're there when your command prompt changes to quote>
. If this happens, you can just keep writing as much of your commit message as you want, and then end it with the same quotation mark that you opened the commit message with.
Another option is to press control + c on your keyboard, which will exit the quote prompt and cancel any commits you were trying to perform.
If you're ever stuck or "trapped" on the command line, try running through these common exit commands to return to the prompt:
- control + c
- control + d
q
followed by enter:q
followed by enter
control + c attempts to abort the current task and restore user control. control + d escapes the current shell environment—if you use it at the normal $
prompt, it will end the current command line session. q
is often used as a command (followed by enter) to escape from specific programs like less
. :q
is the command used in vi
that changes the mode of interaction (:
), allowing you to enter the q
, a one-letter command to quit, which must be followed by enter. Thus, it's a command specific to vi
.
Which best describe the process of staging: (select one)
- you telling Git to take a snapshot of changes made to a file. - you telling Git which files with changes you want it to pay attention to.* - you telling git to pay attention to a folder storing files you want to make changes to. - the second part of a two-step process.Which best describes the process of committing: (select all that apply)
- you telling Git to take a snapshot of changes made to a file.* - you telling Git which files with changes you want it to pay attention to. - you telling git to pay attention to a folder storing files you want to make changes to. - the second part of a two-step process.*What happens if you stage the files, but don't commit them? (select all that apply)
- Git won't know what files you want to take a snapshot of - Git won't take a snapshot of the files.* - Git will take the snapshot of the files - You will have told Git what files you would like it to take a snapshot of.*What happens if you commit the files, but don't stage them? (select all that apply)
- Git won't know what files you want to take a snapshot of* - Git won't take a snapshot of the files.* - You will have told Git what files you would like it to take a snapshot of. - Git will take the snapshot of the files*Which best describes the -m
flag used when committing changes to a file? (select all that apply)
Do you remember the glossary terms from this section?
Now, you may want to backup or share and collaborate around a file on the Internet. Let's connect the directory you created on your local computer to GitHub's cloud service, which you can access through the web.
Remember, GitHub is a service that allows you to host files, collaborate, and find the work of others. Once our syllabus is on GitHub, it will be publicly visible. (Repositories on GitHub can also be private but are public by default.)
Go to GitHub in your browser and click the plus sign in the upper right hand corner.
After clicking the plus button, select New repository
from the dropdown menu.
After clicking New repository
, you'll have to enter some information, including a name and description for your repository.
- Choose a name, such as
git-practice
. (This does not need to match your folder name although it may be less confusing if you choose the same name here.) - Enter a description, such as
Test syllabus for learning Git and GitHub
. - Keep the
Public — Anyone can see this repository
selector checked. (If you choose the Private option, you will need additional steps, not covered in this workshop, to synchronize your GitHub repository with the folder on your computer.) - Do not select
Initialize this repository with a README
since you will be importing an existing repository from your computer. - Click
Create repository
.
You should end up inside your newly created repository. It will look like a set of instructions that you might want to use to connect your GitHub repository to a local repository.
The instructions we want consist of two lines underneath the heading ...or push an existing repository from the command line
. The hand in this screenshot points to where these directions are on the page:
Use the copy button in the top right corner of the code box to copy all three lines of code. They will look something like this:
$ git remote add origin https://github.com/<username>/<repository-name>.git
$ git branch -M main
$ git push -u origin main
You'll need the command copied from your new repository, since it will contain the correct URL.
Paste them into your command line and press enter. You may need to press enter multiple times to run all three lines of code.
Total 3 (delta 0), reused 0 (delta 0)
To https://github.com/<username>/<repository-name>.git
* [new branch] main -> main
Branch 'main' set up to track remote branch 'main' from 'origin'.
If you see output like this, go back to your new repository page in the browser and click the Refresh
button. You should see your syllabus.md
file on GitHub!
We have covered the basic steps of creating a file and tracking changes within a file on your local machine and on GitHub.
This has involved coordinating across three different environments, so let's go through that one more time. Note that this process is very slightly different. I'll highlight it when it comes up.
To start, let's add something to our syllabus. Another week of materials or a new reading.
Save that file.
Use git add
via the command line to stage the file—tell Git what document you want it to pay attention to.
Use git commit
via the command line to save the changes you've just made as a snapshot or new version of you file. Remember to use the -m
flag and include a message about the change you just made.
So far, we have not done anything with GitHub or on the Internet. We have used Git, installed on our local machine, to save a version of file as it stands now. We could stop here if we only had an interest in using Git for version control. But if we also wanted to use GitHub to back up our files, or to share our project with a team or publicly, we want to upload, or push, that repository to GitHub on the Internet.
Use git push origin main
to upload, or push, that file to your repository on GitHub. After refreshing the webpage, your file should appear online. The difference I noted above appears here. Note the absence of the -u
flag from the command. That flag was used the first time to establish the connection between the repository on your local machine and on GitHub. Now that that connection has been established, that flag is not needed.
-
Go through the process a few more times by adding additional readings and weeks of course material. Remember to commit changes intentionally so your commit messages make sense. Use
git log
to review your changes. -
Also try creating a new file and adding an assignment. Rewrite the assignment using Markdown, or edit and add in the markers. Go through the process of staging and committing that file, and pushing it to your repository on GitHub.
-
Test your understanding by thinking through the following questions:
- Do you need to push the file to GitHub each time you commit changes to the file, or can you make several commits to a file and push them all to GitHub at once?
- Do you need to use
git init
after after adding a new assignment file to your folder? - What about the
-u
flag in thegit push origin main
command? Does this flag need to be used to add the assignment to your repository on GitHub?
Which best describes what you're doing when you use the command git push
? (select one)
How does the process of pushing differ from the processes of staging and committing discussed in the previous lesson? (select one)
- There is no fundamental difference between these processes. - Staging and Committing set up the files whereas pushing is the act of taking the snapshot. - Staging and committing the files is to communicate with GitHub on the Internet, pushing the changes happens on your local machine. - Staging and committing the files happens on your local machine, pushing the changes is to communicate with GitHub on the Internet.*What happens if you use git push
without staging and committing files? (select all that apply)
What happens if you stage and commit files, but not push the changes? (select all that apply)
- Git won't know what files you want to take a snapshot of. - Git won't take a snapshot of the files. - Your computer won't know what changes to share with GitHub. - Git will take the snapshot of the files* - You will have successfully created a new version of the file.* - You will have communicated with GitHub and shared a copy of the updated files.Do you remember the glossary terms from this section?
GitHub was built for sharing and collaborating on projects. A key advantage of the platform is that you can find lots of bits of software that do many different things—such as code for plugins for WordPress or Leaflet. Increasingly, you might find syllabi or open writing projects. If a project is public, you can save a copy of it to your local machine, work on it, save your amendations and share it on your own GitHub account. Like we've already mentioned, GitHub usefully helps track attribution along the way.
Cloning and forking are the basic functions of this capability. Each are explained in the following sections, followed by an example and activity to further explain.
Cloning a repository means making a copy of a repository on GitHub, to download and work on locally—on your local machine. By entering the following code into your terminal, you can clone any public directory on GitHub:
$ git clone <repository-url>
When you clone a repository from GitHub, the folder that shows up on your local machine comes built-in with a few things. First, Git is already present, so you don't need to initialize the folder. Also, the connection between your local copy and the online repository is already made, so git push origin main
will work (no -u
flag needed).
For practice, let's clone the repository for this workshop about Git and GitHub, which lives on GitHub.
First, let's navigate back to your Desktop folder.
$ cd ~/Desktop
Remember that the ~
refers to your home directory. Now let's find the URL we need to clone the lesson.
First, visit this workshop's page on GitHub.
On the main page, there should be a green Code
button on the right side:
Click the green button and you will see a box with highlighted text under a heading that says Clone with HTTPS
. If you instead see Cloning with SSH
, click the small link that says Use HTTPS
.
Now copy out the text in the box:
Now that you have the text copied, go back to your terminal. Remember, you should be on the Desktop
. (Hint: Use pwd
to find out what your current working directory is.)
Once you are in the Desktop
, type:
$ git clone <copied-url>
If the command is successful, the full Git and GitHub workshop's text will be replicated on your local machine. To navigate into the folder, its name is git
and you can use the cd
command to access it:
$ cd git
Use the ls
command to take a look at the various files in the lesson folder.
Cloning can be especially useful when you're joining a group project that is hosted on GitHub, and you want your changes to eventually be pushed and shared with that same repository.
But maybe that is not possible or ideal. Maybe you don't want to contribute your changes to someone else's repository. Maybe you want to make a derivative of their folder for yourself, on your GitHub account, and make changes there.
Forking is the step you could take to do this.
Forking a repository means making a copy of someone else's repository on GitHub, and saving it to your account on GitHub. This function happens within GitHub, and has nothing to do with what is happening on your local machine. Note that forking will not automatically make the repository appear as a folder on your computer; that's the role of cloning.
In order to "fork" the git
repository into your own GitHub account, follow these steps.
First, go to the repository for this workshop on GitHub. Note the Fork
button in the upper right hand corner. By clicking that button, you can copy, or fork, this repository to your account.
Doing so would also adjust the attribution information in the upper left hand corner. Your username would replace DHRI-Curriculum
, showing that you are looking at a copy of the repository on your account now. Additionally, it will reference the origin account, in this case, DHRI-Curriculum
below after forked from
, since this was the origin point of your fork.
Your local machine would come into play when you want to clone that repository so you can work on it locally. This also means that when you push those changes to GitHub, you would be pushing them to a forked repository associated with your own account.
You might use this method if you were going to teach your own Git & GitHub workshop. You could use our repository as a base for getting started, and add more examples or change some language, clarify something further, or create a connection to another workshop you are giving, etc. This allows us to continue to use the workshop as we have it as well. Also, maybe at a later time, we want to merge some of your changes with ours. We can do that too by revisiting your version history.
- Fork and clone the repository for this workshop on GitHub. Note not only what you are doing, but also where you are working when completing these two different tasks.
- Make changes to the files on your local machine. Remember to save them!
- Use the 3-step process of stage, commit and push to return the amended files to the repository on GitHub.
You'll know you've completed step one when the project folder (called git
) shows up on your local machine.
After you've made and saved the changes, you'll know you've completed step three when your changes appear in the project folder on your GitHub account.
Which best describes cloning? (select one)
- Copying a repository from GitHub to your local machine.* - Copying a repository from your local machine to GitHub. - Copying a repository from someone else's GitHub account to your own. - Copying a repository from your account to someone else's account.Which best describes forking? (select one)
- Copying a repository from GitHub to your local machine. - Copying a repository from your local machine to GitHub. - Copying a repository from someone else's GitHub account to your own.* - Copying a repository from your account to someone else's account.Do you remember the glossary terms from this section?
You made it to the end of this workshop—congratulations! You now know a little more about why using Git and GitHub in your scholarly practice might be advantageous, and what the challenges are. Below the quiz section you'll find a set of readings and tutorials to supplement the lessons outlined herein, and offer additional ways of advancing your use beyond the basics outlined here. There are also additional challenges to test your skills, as well as discussion questions to test your conceptual understanding of these tools. Enjoy!
TODO: Fix relative linking when switching between display formats
1. What tasks could Git and/or GitHub offer support to? (Select all that apply)
- Developing software* - Creating and sharing data sets* - Creating websites* - Writing articles and books* - Collating online resources* - Keeping research notes* - Hosting syllabi and course materials*Revisit lesson What You Can Do with Git and GitHub to learn more.
2. Which best describes what you're doing when you initialize your project folder: (select all that apply)
- You created a new version of your project folder - You told Git to pay attention to your project folder* - You told Git to set up its file structure within your project folder so it can track changes to your files.* - You use the command `mkdir` in your terminal - You use the command `git init` in your terminal*Revisit lesson Creating Syllabus Content Using Markdown to learn more.
3. What are you doing when you set up git? (select one)
- You are creating a new version of the software on your local machine. - You are sending files from your local machine to GitHub? - You are introducing yourself to the software, so it knows who you are.* - You are creating a new version of a project folder on your local machine.Revisit lesson [Setting Up Git](/workshops/git/?page=5 to learn more.
4. Which best describe the process of staging: (select one)
- you telling Git to take a snapshot of changes made to a file. - you telling Git which files with changes you want it to pay attention to.* - you telling git to pay attention to a folder storing files you want to make changes to. - the second part of a two-step process.Revisit lesson Staging and Committing Changes to learn more.
5. Which of the following best describes version control: (select one)
- a software installed on my local machine - the practice of tracking and organizing the state of a file over time, as it changes* - a language that can be read and rendered by some web-based platforms - a cloud-based softwareRevisit lesson What are Git and GitHub? to learn more.
6. Which best describes cloning? (select one)
- Copying a repository from GitHub to your local machine.* - Copying a repository from your local machine to GitHub. - Copying a repository from someone else's GitHub account to your own. - Copying a repository from your account to someone else's account.Revisit lesson Cloning and Forking to learn more.
7. Which best describes where you are working when you're writing in your plain text editor:
- on my local machine* - on the internetRevisit lesson Review of the Command Line to learn more.
8. Which best describes what you're doing when you use the command git push
? (select one)
Revisit lesson Pushing to GitHub to learn more.
- Yasset Perez-Riverol et.al.'s "Ten Simple Rules for Taking Advantage of Git and GitHub" from PLOS Computational Biology is an academic introduction to GitHub, with some rudimentary commands that we cover in our workshop as well. The article also details why and how GitHub works as a collaborative platform.
- Jenny Bryan and Jim Hester's Happy Git and GitHub for the UseR is an online, open-access book on using Git and GitHub within the R programming environment. While most the book emphasizes this use case, the sections "Let's Git started", "IV Git fundamentals", "V Remote Setups", and "VI Daily Workflows" are great general resources for expanding your Git knowledge without facing a mountain of material to wade through.
- Scott Chacon and Ben Straub's whole book Pro Git is available in open-access format on Git's official website. It is a foundational (albeit long) text that details everything you may want or need to know about working with git on your computer. It also has a section on GitHub for those interested.
- Ei Pa Pa Pe-Than, Laura Dabbish, and James D. Herbsleb has written the article "Collaborative Writing at Scale: A Case Study of Two Open-Text Projects Done on GitHub", which details how and why git's pull-based model can be used for collaborative writing at scale. In conclusion, they argue that the model helps contributors either converge and work on perfecting one single project, or adopt and tailor an original project to their own needs.
- Keith Miyake, former Digital Fellow at The Graduate Center, has written an introduction on how to Create Your (FREE) Website Using Github and Jekyll on the Digital Fellow's blog Tagging the Tower, detailing GitHub's "pages" feature that allows you to publish your own advanced website. Using Jekyll, a specific command-line application, you can create essentially unhackable rudimentary websites that are free to host on GitHub.
- On Programming Historian, Amanda Visconti has written "Building a static website with Jekyll and GitHub Pages" which is an introduction on using GitHub "pages" to create a website. Using Jekyll, a specific command-line application, you can create essentially unhackable rudimentary websites that are free to host on GitHub.
- Simon Coll has provided some introductory remarks on why markdown is a good choice for academics in his blog post, Markdown for Students and Academics.
- Steven Ovadia's "Internet Connection: Markdown for Librarians and Academics" is a short article from Behavioral & Social Sciences Librarian that details the use of markdown as a "method of divorcing content from formatting."
- For those who want an introduction to Creative Commons, a concept introduced in our Git and GitHub introductory workshop, the Official website for Creative Commons is a good place to start. They also have a Creative Commons Wiki which may be of interest.
- The University of Rhode Island has created "Open Licensing with Creative Commons: The Creative Commons Licenses", a good introduction to creative commons as a concept with many links to other websites.
- J.R. Dingwall's open-access book Creative Commons: An Educator's Course Guide to Creative Commons is a good place to start for anyone interested in how the Creative Commons can empower the open education movement with "tools that help create better, more flexible and sustainable open educational resources (OER), practices, and policies," as he writes in Part V of the book, "Creative Commons for Educators."
- FreeCodeCamp has an excellent introductory tutorial for how to use git on your computer, called "An intro to Git: What it is and how to use it".
- Don't miss Github's official documentation around Getting Started with GitHub. It is detailed and provides more in-depth examples of how to interact with GitHub using your command line.
- Fill in your syllabus repository further, adding not only to your syllabus.md file, but adding additional content such as assignments.
- Create a website from your syllabus files using GitHub Pages or Jekyll.
- Create an independent or collaborative reading group that tracks readings and notes using Git and GitHub.
- Use Git and GitHub to track, store, and share an independent or collaborative project folder.
- Use Git to track a project on your local machine.
- Find and/or modify an existing public project on GitHub.
- What does your current version control workflow look like and what are the challenges it poses; or how could it be improved?
- How can git support the work you are already doing?
- What additional opportunities does git and/or GitHub and/or Markdown create for your teaching, research or other scholarly work?
- What are the potential benefits and pitfalls of working in the open on the web via a platform like GitHub?