Introduction to Git

Why is Version Control Essential? #

$ git log --oneline app.py
9e7c2f5 (Ethan) Add logging to greet()
a3d1cbe (Deepa) Fix typo in greeting
6b2e9a1 (Charlie) Refactor: move greeting to main()
f1a72bb (Bob) Add greeting function
d0a1b22 (Alice) Initial version of app.py

What is Version Control?

Version Control Systems (VCS): Tools that track changes to files over time, allowing developers to manage code efficiently and collaborate without conflicts
Purpose: Ensures that code history is maintained, and developers can work together without overwriting each others work

Why is Version Control Essential?

Collaboration: Multiple developers can work on the same project without overwriting each others work
History Tracking: Easily view, compare, and restore previous versions of code
Branching: Developers can experiment with new features in separate branches without affecting the main code base

How Does Git Distributed Version Control Improve Collaboration and Efficiency? #

Types of VCS:

Centralized Version Control Systems (CVCS): A single central server stores all versions of the code (e.g., SVN, CVS)
Distributed Version Control Systems (DVCS): Every developer has a complete copy of the repository (e.g., Git, Mercurial)

Comparison

Aspect	Centralized VCS (CVCS)	Distributed VCS (DVCS)
Repository Location	Single central server	Each developer has a complete copy
Offline Work (Check History,..)	Limited	Fully supported
Single Point of Failure	Yes (central server outage)	No (any copy can restore the project)
Backup	Central server must be backed up	Each clone serves as a backup
Performance	Network-dependent	Local operations are faster

How Does Git’s Distributed Version Control Improve Collaboration and Efficiency?

Complete History for Everyone: Every developer’s local copy includes the full project history, making offline work possible
Includes All Branches: Local clones contain all remote branches that were fetched, allowing developers to switch, create, or merge branches offline
Faster Operations: Most Git commands (like commit, diff) are local, providing fast performance
No Single Point of Failure: If the main server goes down, any local copy can be used to restore the project

Why is Git Snapshot System a Game-Changer? #

What is Snapshot-Based Systems?

Snapshot Storage: Git captures the entire state of your project at each commit, like a photograph
Complete Versions: Instead of storing just the changes (deltas), Git stores a complete version of changed project files
Efficient Storage: Unchanged files are linked to previous versions rather than duplicated, saving space

How is it different from Delta-Based Systems (SVN)?

Delta-Based Storage: Only stores the differences between file versions
More Processing: Requires more computing to reconstruct previous versions
Complex Branching: Changes are applied sequentially, making branching more complex

An example of Snapshot Based System

COMMIT 1
- File Content:
```
Welcome
Version 1
```
- Git Storage: Stores a full snapshot of the file as-is
COMMIT 2
- File Content:
```
Welcome
Version 2
```
- Git Storage: Stores a new full snapshot of the modified file
COMMIT 3
- File Content:
```
Hello
Version 3
```
- Git Storage: Stores yet another full snapshot of the file (Git optimizes behind the scenes using compression and shared objects)

An example of Delta Based System

COMMIT 1
- File Content:
```
Welcome
Version 1
```
- SVN Storage: Stores the complete file for the initial version
COMMIT 2
- Delta Stored: Change line 2 → Version 2
- SVN Storage: Only the difference from version 1 is stored
COMMIT 3
- Delta Stored: Change line 1 → Hello Change line 2 → Version 3
- SVN Storage: Stores only the differences (deltas) from previous version

Why is Git’s Snapshot System a Game-Changer?

Speed: Quickly access any commit without calculating differences
Reliability: Full file versions are always available
Simplified Merging: Merging is faster because complete versions are available

Give a Short History of Git #

History of Git

The Need: In 2005, the Linux kernel development team required a powerful, distributed version control system
The Problem: They were using a proprietary tool that was expensive, unreliable, and centralized
The Solution: Linus Torvalds, the creator of Linux, developed Git to solve these problems
Git is Popular Because of its Key Design Principles:
- Speed: Git was designed to be fast, even for large projects
- Security: Changes are securely tracked and verifiable
- Decentralization: Developers can work independently, without relying on a central server
2005: Git was released as an open-source project
2008: GitHub launched, making Git widely accessible for collaborative projects (Git is a version control tool; GitHub is a cloud-based platform for hosting and managing Git repositories)
2010s: Git adoption surged as open-source communities and companies shifted away from Subversion (SVN) and other centralized tools
2018: Microsoft acquired GitHub, further boosting Git’s presence in the enterprise market
Today: Git is the standard for version control in software development

Key Commercial Products Built Around Git

GitHub: Hosted Git repositories with collaboration, issue tracking, CI/CD, and security tools
GitLab: Git repository hosting with integrated DevOps lifecycle tools (CI/CD, container registry, monitoring)
Bitbucket (by Atlassian): Git hosting with Jira and Trello integration
Azure Repos: Microsoft's Git repository service integrated with Azure DevOps
AWS CodeCommit: Git-based repository service hosted by Amazon Web Services
Cloud Source Repositories (CSR): Google Cloud's Git-compatible source control service for storing and managing code in private Git repositories

What are some of the popular Version Control Systems before Git? #

Important Version Control Systems Before Git

Tool	Type	Strengths	Weaknesses	Typical Use Case
CVS (Concurrent Versions System)	Centralized (CVCS)	Simple and lightweight	Limited merge support, prone to errors in large projects	Used by early open-source projects
Subversion (SVN)	Centralized (CVCS)	User-friendly with basic features	Network-dependent for most operations, performance degrades with large repos or many branches	Popular for corporate projects earlier
Perforce	Centralized (CVCS)	Excellent performance with large codebases and binary files (like game art and assets)	Complex setup, resource-intensive	Common in game development

Comparing Git vs SVN

Feature	Git	Subversion (SVN)
Type	Distributed (DVCS)	Centralized (CVCS)
Commit Model	Snapshots	Deltas
Offline Work	Fully supported	Limited
Merging	Fast and efficient	Can be slow
Branching	Lightweight and fast	Branching is slower
Used By	Used By Most Enterprise and Open Source Projects	Some legacy enterprise projects

What is GitHub? #

Platform on Top of Git: GitHub is a web-based platform built on top of Git that enhances what Git can do by adding collaboration and project management features
Collaboration & Hosting Made Easy: Designed to host Git repositories and provide powerful tools for team collaboration, code review, and project tracking — think of it as Git++
Open Source Friendly: Widely used for hosting open-source projects and connecting with the global developer community
Public and Private Repositories: Choose between public repos for sharing with the world or private repos for secure, team-only collaboration
User-Friendly Interface for Beginners: Offers a simple and intuitive UI for those who are new to Git, making common tasks easier to perform
Pull Requests and Code Reviews: Enables developers to propose changes and collaborate through peer reviews before merging to main branches
Built-In Issue Tracking: Manage bugs, feature requests, and team tasks using GitHub’s issue system
Team Discussions and Knowledge Sharing: Use the Discussions tab as a dedicated space for questions, ideas, and decisions — all in one place
GitHub Actions for Automation: Set up workflows to automatically build, test, and deploy your code whenever changes are pushed
GitHub Pages for Web Hosting: Host static websites straight from your repository — perfect for portfolios, documentation, or demos

Git vs Github

Feature	Git	GitHub
Version Control	Distributed version control for local version management	Cloud-based platform for hosting Git repositories, You can make it accessible to all team members
Project Management	No built-in project management tools	Issue tracking, milestones, and project boards
CI/CD Integration	Requires manual setup of external CI/CD tools	Integrated support for GitHub Actions to automate workflows
Community	No direct community interaction	Community engagement through stars, forks, and discussions

Can you give a Practical Example of Creating a New Git Repo and Pushing it to GitHub? #

What is a Git Repository?

Git repository: A Git repository stores all your code and its history — like a time machine for your project
- You need a separate git repository for each of your projects
Tracks Every Change with Context: Git records who made each change, when they made it, and why — enabling complete change management
Enables Safe Experimentation: Try new ideas locally without fear — if anything breaks, you can always go back to a previous version
Local Repository for Private Development: You work on your own machine in a fully functional Git environment, even without internet (git init)
Remote Repository for Team Collaboration: A shared repository helps your team work together on the same codebase (Create a repository on Github and git clone to your local machine)
Simple Process for Seamless Collaboration: Work locally, commit changes, and push to the remote repo — teammates pull updates to stay in sync

Practical Example: Create a New Git Repo and Push to GitHub

The goal of this workflow is to set up version control for your project using Git and connect it to GitHub

# STEP 1: Create a new local Git repository
git init
# WHAT: Initializes a new Git repo in current folder
# WHEN: You are starting a project from scratch

# STEP 2: Create a new file or modify files
echo "Hello Git" > hello.txt
# WHAT: Creates a file with sample content

# STEP 3: Check the status of the repo
git status
# WHAT: Shows untracked/modified/staged files
# WHY: Helps verify what will be committed

# STEP 4: Add file(s) to the staging area
git add hello.txt
# VARIATION:
# git add .   → adds all modified/untracked files
# git add -A  → adds all changes (incl. deleted files)

# STEP 5: Commit staged files
git commit -m "Initial commit"
# WHAT: Creates a snapshot of your staged changes
# BEST PRACTICE: Use short, clear commit messages

# STEP 6: Create a new GitHub repository
# Visit https://github.com/new
# NAME: my-first-git-repo (example)
# Keep the repo EMPTY (no README/.gitignore)

# STEP 7: Connect local repo to GitHub
git remote add origin \
https://github.com/youruser/my-first-git-repo.git
# WHAT: Adds a remote named 'origin' pointing to GitHub
# WHY: So you can push/pull code to/from GitHub

# STEP 8: Push code to GitHub
git push -u origin main
# WHAT: Pushes 'main' branch to 'origin' remote
# WHY: Publishes your local code to GitHub
# -u: Sets 'origin main' as default for future pushes
# Future pushes → just run: git push

How Git Organizes and Stores Your Code #

Working Directory – Where Files Live and Evolve: This is your active project folder where you create and edit files. It reflects your latest code and uncommitted changes
Index (Staging Area) – Prepare Files for Commit: When you run git add, Git stores a snapshot of the file in the staging area, ready to be committed
Local Object Database – Stores Full Commit History: Once you run git commit, Git creates objects like blobs, trees, and commits and stores them inside the hidden .git/objects folder. This is your full version history.
Remote Object Database – Shared Copy in the Cloud: When you run git push, Git sends your commits to a remote repository like GitHub, GitLab, or Bitbucket so others can collaborate

Summary

The working directory supports fast development and local testing
The index allows intentional, controlled commits
The local database provides complete project history and safety
The remote repo enables collaboration, sharing, and backup

Scenarios

Scenario 1 – Safely Try New Code Without Committing Everything You're experimenting with a new feature. The working directory lets you make and test changes freely. You can decide later what to stage and commit, keeping your version history clean.
Scenario 2 – Commit Only What Matters You fixed a bug and added a feature in one go. Use the index (staging area) to carefully stage only the bug fix for the first commit. This improves commit quality and makes your history more meaningful.
Scenario 3 – Roll Back to a Previous Version Something broke after a few changes. Thanks to the local object database, you can go back to a previous commit — no need to rewrite or remember anything manually.
Scenario 4 – Sync Work with the Team After finishing your task, you push your commits. The remote object database makes those commits available to your team, so they can review and build on your work.
Scenario 5 – Recover from Mistakes You accidentally deleted a file in the working directory. Since it was committed earlier and stored in the local object database, you can restore it easily using Git commands.
Each layer exists to provide a clear, safe, and flexible workflow — perfect for individuals and teams alike

How a File Moves Through Git: From Untracked to Pushed #

Flow for a new file
- Start as Untracked – File Not Known to Git Yet: When you create a new file, Git does not track it until you explicitly add it
- Staged – Ready to Be Committed: Once added using git add, the file becomes tracked and enters Git’s control flow for changes and history. File is moved into the staging area, marking it as ready for commit.
- Committed – Saved in Local Clone: Use git commit to take a snapshot of staged files; they are now part of your local Git history inside the .git folder
- Pushed – Synced to Remote Repository: Use git push to send your committed changes to a remote like GitHub or GitLab for backup and collaboration
- Easy to Visualize Flow: File states flow like this — Untracked → Tracked and Staged → Committed → Pushed
Flow for an already committed file
- Unmodified – No Changes Since Last Commit: The file remains in a stable state; nothing new has been changed after the last commit
- Modified – You Made Changes, But Didn’t Stage Yet: When you edit a file, Git marks it as modified but does not include it in the next commit unless you stage it
- Committed – Change Saved in Local Clone: Use git commit to take a snapshot of staged files; they are now part of your local Git history inside the .git folder
- Pushed – Synced to Remote Repository: Use git push to send your committed changes to a remote like GitHub or GitLab for backup and collaboration
- Easy to Visualize Flow: File states flow like this — Modified → Staged → Committed → Pushed
Foundation: This lifecycle is the foundation of Git’s version control — it allows developers to track, manage, and share changes confidently and consistently across teams and time

What happens in the background when you commit code? (git commit) #

Copy Staged Changes into the Local Object Database: Git stores the content from the staging area into a special .git/objects folder — this is your full version history
Create a Tree Object for the Snapshot: Git builds a tree structure representing the exact state of your files at the time of the commit
Generate a Commit Object with Metadata: Git adds information like author name, timestamp, and commit message to describe what the change was about
Link Commit to the Snapshot (Tree): The commit object points to the tree object, creating a traceable connection between the message and the actual files
Assign a Unique Commit ID: Every commit is given a unique ID that can be used to identify, share, or roll back to that point
Chain Commits Together with a History Pointer: Each commit includes a reference to the previous commit, forming a chain — the full history of your project
Why It Matters: Commits help you track progress, roll back mistakes, and understand the evolution of your code over time
- Git’s power comes from using commits as the foundation for structure, traceability, and control
- Branches and tags point to commits — no commits, no tracking or navigation
- Use commits to checkout, tag, revert, or explore any version

On this page

Why is Version Control Essential? #

How Does Git Distributed Version Control Improve Collaboration and Efficiency? #

Why is Git Snapshot System a Game-Changer? #

Give a Short History of Git #

What are some of the popular Version Control Systems before Git? #

What is GitHub? #

Can you give a Practical Example of Creating a New Git Repo and Pushing it to GitHub? #

How Git Organizes and Stores Your Code #

How a File Moves Through Git: From Untracked to Pushed #

What happens in the background when you commit code? (`git commit`) #

Introduction to Git

On this page

Why is Version Control Essential? #

How Does Git Distributed Version Control Improve Collaboration and Efficiency? #

Why is Git Snapshot System a Game-Changer? #

Give a Short History of Git #

What are some of the popular Version Control Systems before Git? #

What is GitHub? #

Can you give a Practical Example of Creating a New Git Repo and Pushing it to GitHub? #

How Git Organizes and Stores Your Code #

How a File Moves Through Git: From Untracked to Pushed #

What happens in the background when you commit code? (git commit) #

What happens in the background when you commit code? (`git commit`) #