Contenido del Curso
SQL Intermedio
SQL Intermedio
GROUP BY
Welcome to the Intermediate SQL course!
In the first section, we're diving into how we can group and aggregate data within our tables.
Let's understand what "grouping data" means using a simple example of an employees table:
We have a task to find out the number of employees in each department. To do this, we will group the data by the department
column and use aggregation with the COUNT(*)
function.
Here's what the implementation will look like:
SELECT department, COUNT(*) AS number_of_employees FROM employees GROUP BY department
So, as you can see, the syntax for grouping data looks like this:
Entonces, como puedes ver, la sintaxis para agrupar datos tiene este aspecto:
Nota
AGG_FUNC
se refiere a funciones de agregado como MAX
, MIN
, COUNT
, etc.
SELECT department, AVG(salary) as average_salary FROM employees GROUP BY department
In this section, we will work with the Montreal Metro system database, which contains the metro_travel_time
table.
This table will contain information about the station line(line_name
), its name(station_name
), and the amount of time it takes for a train to travel from one station to the next one(time_to_next_station
).
Here is what this table looks like and the data preview in it:
En este curso, trabajaremos con la base de datos del sistema de metro de Montreal, que contiene la tabla metro_travel_time
.
Esta tabla contendrá información sobre la línea de la estación (line_name
), su nombre (station_name
) y el tiempo que tarda un tren en viajar de una estación a la siguiente (time_to_next_station
).
Así es como luce esta tabla y la vista previa de los datos en ella:
In the assignments, you’ll often use a concept called an alias. An alias is essentially a "nickname" for a column you retrieve with a SELECT
statement. It’s specified using the following syntax:
An alias only affects how the column appears in the response.
For example, instead of MAX(time)
, the column could be called max_time
if you assign that alias. This makes the output more readable and clear.
Swipe to show code editor
Your task is to find the longest time until the next station on each line. This will allow us to determine the longest travel time between stations for each metro line. To do this, use the MAX()
function and alias it as max_time
, grouping the data by the line_name
column.
Brief Instructions
- Retrieve the
line_name
column and the maximum value of thetime_to_next_station
column. - Add the alias
max_time
for the maximum value. - Group the data by the
line_name
column.
Once you've completed this task, click the button below the code to check your solution.
Solución
¡Gracias por tus comentarios!
GROUP BY
Welcome to the Intermediate SQL course!
In the first section, we're diving into how we can group and aggregate data within our tables.
Let's understand what "grouping data" means using a simple example of an employees table:
We have a task to find out the number of employees in each department. To do this, we will group the data by the department
column and use aggregation with the COUNT(*)
function.
Here's what the implementation will look like:
SELECT department, COUNT(*) AS number_of_employees FROM employees GROUP BY department
So, as you can see, the syntax for grouping data looks like this:
Entonces, como puedes ver, la sintaxis para agrupar datos tiene este aspecto:
Nota
AGG_FUNC
se refiere a funciones de agregado como MAX
, MIN
, COUNT
, etc.
SELECT department, AVG(salary) as average_salary FROM employees GROUP BY department
In this section, we will work with the Montreal Metro system database, which contains the metro_travel_time
table.
This table will contain information about the station line(line_name
), its name(station_name
), and the amount of time it takes for a train to travel from one station to the next one(time_to_next_station
).
Here is what this table looks like and the data preview in it:
En este curso, trabajaremos con la base de datos del sistema de metro de Montreal, que contiene la tabla metro_travel_time
.
Esta tabla contendrá información sobre la línea de la estación (line_name
), su nombre (station_name
) y el tiempo que tarda un tren en viajar de una estación a la siguiente (time_to_next_station
).
Así es como luce esta tabla y la vista previa de los datos en ella:
In the assignments, you’ll often use a concept called an alias. An alias is essentially a "nickname" for a column you retrieve with a SELECT
statement. It’s specified using the following syntax:
An alias only affects how the column appears in the response.
For example, instead of MAX(time)
, the column could be called max_time
if you assign that alias. This makes the output more readable and clear.
Swipe to show code editor
Your task is to find the longest time until the next station on each line. This will allow us to determine the longest travel time between stations for each metro line. To do this, use the MAX()
function and alias it as max_time
, grouping the data by the line_name
column.
Brief Instructions
- Retrieve the
line_name
column and the maximum value of thetime_to_next_station
column. - Add the alias
max_time
for the maximum value. - Group the data by the
line_name
column.
Once you've completed this task, click the button below the code to check your solution.
Solución
¡Gracias por tus comentarios!