top of page

STRUCTURED QUERY LANGUAGE

Oct 1, 2024

5 min read

0

0

0

SQL TUTORIAL INTRO

Lame AF tutorials start out with some chatGPT-written fluff like "SQL serves as a fundamental tool for analytics because it facilitates the retrieval and analysis of data, which enables synergies across business units" but I'm a straight shooter so here's the low-down.

SQL is THE SHIT. Not like... it's shit... I mean it's THE SHIT... it's amazing.

Let me tell you in simple terms:

  • why I love SQL, and you should love it too

  • how this free SQL tutorial will help you Ace SQL interviews and set you on a path to a well-paid job in Data Analytics and Data Science

  • why SQL and databases exist in the first place

First up: why money SQL is amazing!

SQL Means MORE MONEY

I ❤️ SQL's power and speed compared to Excel. I love that SQL is the PERFECT place for a beginner without much coding experience to start their Data Analytics & Data Engineering journey.

Sure, programming languages like Python & R have their place in the data analysis tech stack, but you'll get the highest return on investment (ROI) on your time if you start by learning SQL.

Speaking of financial terms like ROI, I ❤️ SQL because I love money. More importantly, I love helping other people also earn more money too, and snagging a high-paying job in Data Analytics or Data Science is a great path towards earning more money.

That's exactly the reason why I wrote the best-selling book Ace the Data Science Interview which now has helped thousands of people land 6-figure Data Science & Data Analytics jobs at FAANG tech companies.

I love helping people up skill into data careers, and learning SQL is the gateway to high-paying jobs in Data Science, Data Engineering, and Data Analytics. I'm excited to help you get started down this path!

TL;DR - learn SQL, analyze data, interview well, get paid!

What's In The DataLemur SQL Tutorial

The 30+ free SQL lessons are meant to get you from SQL zero, to SQL hero. We divided the tutorial into an intro, intermediate, and advanced SQL tutorial, because they have different styles & teaching philosophies.

About the Basic SQL Tutorial

The basic SQL lessons are meant for SQL newbies, who might have used Excel but don't have formal coding experience.

Each lesson comes with SQL exercises which you can directly run and execute in the browser – you don't need to install any software to run SQL code!

Don't believe me?

-->TOH MAT KAR MC !

SQL Overview

If you want to start with your first SQL command, feel free to jump to Lesson #1: SQL SELECT!

But, if you're a total beginner, here's some background on what SQL is, and why it's so damn important!

What is SQL?

SQL, which is pronounced "Sequel", NOT "S.Q.L.", stands for "Structured Query Language". It's used to manage & query data stored in a relational database management system (RDBMS). For example, if you were a Data Analyst at Amazon, you might write the following query to compute the average rating of different products:

SELECT 
  product_id,
  AVG(stars) AS avg_stars
FROM reviews
GROUP BY 
  product_id
ORDER BY product_id;

p.s. if you want to try to run this SQL query, copy-paste the code into this Amazon SQL Interview Question!

What is a RDBMS?

Think of a Relational Database Management System (RDBMS) as a vast collection of Excel workbooks. Each workbook (in database terms, we call these "tables") contains different sheets, and on each sheet, you have rows and columns of data. The sheets are organized in a way that the data can be quickly searched, updated, inserted, or deleted.

Why use SQL and Databases instead of Excel?

Anyone whose used Excel or Google Sheets knows how slow and laggy it can get when you get past 10,000 thousand rows and a dozen columns. An RDBMS offers a better way to store large datasets with millions, sometimes even billions of rows. Using SQL, you can then retrieve and analyze this big data from the RDBMS in mere seconds – workloads that would instantly crash Excel or Google Sheets.

Plus multiple access is terrible in Excel. If two people try to edit a workbook at the same time, you'd run into issues. But databases are designed so that many users can read and write data simultaneously without conflicts.

Then there's the issue of data integrity. In Excel, you can easily overwrite or delete data by accident. Databases have features that ensure data integrity and consistency, meaning it helps prevent unwanted changes or deletions.

Most importantly for Data Analysts and Data Scientists, an RDBMS + SQL let's you query and analyze advanced relationships. In Excel you might use VLOOKUP to get data from one sheet based on data in another. In databases, tables can be "related" in complex ways, and SQL lets you query across these relationships easily.



For example, see all the tables music-streaming app Spotify would have, and their complicated relationships! Good luck trying to represent and analyze this in Excel!

In summary, while Excel is a fantastic tool for a range of tasks, when it comes to the issue of handling large datasets, and the associated problems of data integrity and concurrent access, it's no surprise why databases paired with SQL are the industry standard solution compared to Excel.

Why does SQL look like English?

SQL was designed to look a LOT like plain English, which is why we love it so much compared to more confusing languages like Python and R!

But don't let SQL's simplicity and similarity to English fool you... there's a reason tricky FAANG SQL interview questions exist!

What's the difference between SQL, SQLite, MySQL, PostgreSQL, and SQL Server?

People casually use "SQL" interchangeably with "MySQL" and "PostgreSQL". While that's not technically correct, in most cases for beginners in the field, it doesn't matter too much unless you want to be pedantic.

But, if we're trying to be precise, SQL is general, high-level language for querying and manipulating relational databases (RDBMS). MySQL, Postgres, SQLite, and SQL Server are all RDBMs's (relational database management system). You use varying flavors of SQL syntax to query each unique RDBMS. For example, to query Postgres, you write PostgreSQL. To query MySQL... you write MySQL. Confusing naming, I know!

Here's the good news: the syntax for MySQL, PostgreSQL, SQL Server etc. aren't too radically different from one another. That means, if you do complete our SQL tutorial (which is taught with the PostgreSQL dialect), you should be able to adapt to MySQL or SQL Server pretty effortlessly!

Why choose PostgreSQL for this SQL tutorial?

We LOVE using PostgreSQL, and the associated Postgres RDBMS, for a few reasons:

  • Popularity: PostgreSQL is the 2nd most popular flavor of SQL, slightly behind MySQL in fame, but gaining in popularity every single year!

  • Open Source: Postgres is open-source and managed by a community. It has a BSD-style license, which means it can be used, modified, and distributed freely. The meanies over at Oracle bought MySQL, and gave it a GPL license, which is more restrictive than Postgres's BSD license. Same way, SQL Server is propriety to Microsoft, and is very locked down.

  • Extensible: Postgres has a ton of extensions, like PostGIS to become a geospatial database. MySQL, since it's not as open-source, has way less extensions and add-on features.

  • ANSI Compliant-ish: A bunch of nerds over at the American National Standards Institute (ANSI) try to define standards for everything, including SQL! PostgreSQL is one of the most ANSI-compliant flavors of SQL, which means learning it is a great first SQL flavor to have, because it's the LEAST quirky of the SQL flavors.. making it the best foundation to build!


Oct 1, 2024

5 min read

0

0

0

Comments

Share Your ThoughtsBe the first to write a comment.
bottom of page