Handling_SQL

The repository Handling SQL contains :

Python file (.py) related on :
- Database connection with psycopg2 and sqlalchemy,
- SQL commands that have an equivalent in pandas
Markdown file (.md) related on :
- Concepts explanation,
- Syntax example
SQL file (.sql) related on :
- Code that will be run on Dbeaver,
- Practice samples on SQL commands like SELECT, CREATE, INSERT, UPDATE, etc

Python

To run efficiently the code provided in .py you must :

Create a virtual environnement to make sure that the packages does not collapse with other dependencies

python3 -m venv name_env

To activate the virtual environnement that you name :

source path/name_env/bin/activate

Install the packages required for this repo :

pip install -r requirements.txt

SQL

To run the code in .sql file you must have :

Postgres installed on your computer. If not follow the steps :
- install Postgres with apt
```
sudo apt install postgresql postgresql-contrib -y
```
- test your interface
```
sudo -u postgres psql
```
- create user with password, then create database that owns by your new user. Grant all privileges for this users
```
CREATE USER user_name WITH PASSWORD 'user_pwd';
CREATE DATABASE db_user OWNER user_name;
GRANT ALL PRIVILEGES ON DATABASE db_user TO user_name;
```
- remember your user_name et the password associed on that user
Dbeaver (community edition) is used because it's simple to manipulate and configure. Here is the step to follow :
- install Dbeaver :
```
curl -fsSL https://dbeaver.io/debs/dbeaver.gpg.key | sudo gpg --dearmor -o /etc/apt/trusted.gpg.d/dbeaver.gpg
echo "deb https://dbeaver.io/debs/dbeaver-ce /" | sudo tee /etc/apt/sources.list.d/dbeaver.list
sudo apt update && sudo apt install dbeaver-ce -y
```
- Download the pagila database on : https://www.postgresql.org/ftp/projects/pgFoundry/dbsamples/pagila/pagila/.
- unzip the file pagila-0.10.1.zip.
- Go to your Dbeaver interface : Database --> New Database connection --> select Postgresql --> Replace the information (database, Username and password) from the user creation on postgres step and finally Test Connection.
- Normally you saw your database connected click : Database --> your_database`` (right click on it) --> SQL editor --> Open SQL script --> Right click on the new windows --> File --> Import SQL scipt
- You need to import the three .sql file from pagila folder one by one

Retrieve information from columns that you need with : SELECT
Filter the information with condition that you specified : WHERE
Order the numerical values by : ORDER BY
Acknowledge the types existed on SQL : BOOLEAN, INTEGER, DATE...
How to manipulate string type : ||, LENGTH(), UPPER()...

P2 : Aggregation & Relational Logic

Basic Aggregation : COUNT, SUM, AVG, MIN, MAX
Grouping by one or multiple columns by : GROUP BY
Understanding Primary Keys (PK) and Foreign Keys (FK) : INNER JOIN
Finding "missing" data : OUTER JOIN
Creating "If-Then-Else" logic inside a SELECT statement to categorize data : CASE WHEN
Make nested queries : Subqueries --> SELECT(... SELECT(...))

P3 : Analytics

Using WITH clauses to break complex queries into readable chunks : CTE
Window Functions part 1 : RANK, ROW_NUMBER(), DENSE_RANK
Window Function part 2 : LAG(), LEAD()
Window Function part 3 : SUM() OVER(...), AVG() OVER(...)
Set Operations : UNION, UNION ALL , INTERSECT, EXCEPT
Arrays & JSONB in Postgres : jsonb_build_object(...)

P4 : Database Design & Management

Data Definition Language : CREATE TABLE, ALTER TABLE, DROP TABLE
Data Integrity Constraints : PRIMARY KEY, REFERENCES, UNIQUE, CHECK(...)
Data Manipulation Language : INSERT, UPDATE, DELETE
Views and Materialized Views : CREATE VIEW ..., CREATE MATERIALIZED VIEW
Indexing to speed up the search : CREATE INDEX ...
Transactions : BEGIN, COMMIT, ROLLBACK

P5 : Python integrations

Preventing SQL Injection by passing variables safely in Python : Parameterized Queries
The Pandas Workflow : pd.read_sql(), df.to_sql()
Using the SQLAlchemy Engine for better connection management : SqlAlchemy
Extract data from a CSV, Transform it in Pandas, Load it into Postgres : ETL

P6 : Advanced Concepts

Handling hierarchical data (e.g., Organization charts, Category trees) : Recursive CTE
Simple PL/pgSQL functions to encapsulate logic : Procedures & Functions

Project

Part 1 : Design the Schema (Tables, Keys) that needed to handle a example of Stock market analysis
Part 2 : Use Python to generate or fetch data (using yfinance library) and load it into the DB.
Part 3 : Add a database view and a Python-based plotting script to visualize historical data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Handling_SQL

Python

SQL

Contents

P1 : Basics Queries

P2 : Aggregation & Relational Logic

P3 : Analytics

P4 : Database Design & Management

P5 : Python integrations

P6 : Advanced Concepts

Project

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
P1		P1
P2		P2
P3		P3
P4		P4
P5		P5
P6		P6
Project		Project
Recap-P1_P4		Recap-P1_P4
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Handling_SQL

Python

SQL

Contents

P1 : Basics Queries

P2 : Aggregation & Relational Logic

P3 : Analytics

P4 : Database Design & Management

P5 : Python integrations

P6 : Advanced Concepts

Project

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages