Mastering Data Joins: Practical Advice for Data Analysts

Telmo Silvaon January 31, 2025

Last updated on May 14, 2025

Data joins are a powerful way to combine information from different sources into a single, unified dataset. Whether you’re merging customer profiles with transaction records or aligning sales data with marketing efforts, joins give you the tools to uncover hidden relationships and see the bigger picture.

Integrating multiple data sources is critical for streamlining analysis and minimizing the chance of overlooking key insights. When businesses have a 360-degree view of their data, they can develop smarter, data-driven strategies and maintain a competitive edge.

In fact, the McKinsey Global Institute estimates that leveraging data and analytics effectively could generate $9.5 trillion to $15.4 trillion annually across industries.

This guide is designed to help you master data joins. It focuses on practical, hands-on techniques that can help you manage large and messy datasets. Here’s what you’ll take away from it:

A clear understanding of how data joins work and why they’re essential
Advanced strategies to solve frequent problems
Practical ways to integrate joins into your workflow
Learning about advanced tools that allow you to unify disparate datasets and transform raw data into actionable insights

Setting Up a Database for Testing Joins

If you want to understand how joins work, you need to create a sample database where you can explore different types of joins in action. Here we will walk you through setting up three tables: university, ranking_system, and university_ranking_year. These tables reflect real-world scenarios like managing university rankings, giving you a practical way to experiment with joins.

Step 1: Create the `university` Table

The first table, university, is where you’ll store basic information about universities. Each university gets a unique ID and a name.

SQL

CREATE TABLE university(
   	id int primary key identity(1,1),
   	university_name varchar(200) NOT  NULL
);

The id column is an auto-incrementing primary key. It ensures every university is uniquely identified.
The university_name column stores the university’s name. It’s a required field.

first sample database for data joins — Sample university_name Database

Step 2: Create the `ranking_system` Table

Next, you’ll set up the ranking_system table. This table tracks the different systems used to evaluate and rank universities.

SQL

CREATE TABLE ranking_system(
   	id int primary key identity(1,1),
   	system_name varchar(100) NOT NULL,
);

The id column serves as a primary key, just like in the first table. Each ranking system gets its own unique identifier.
The system_name column holds the name of the ranking system. It’s required to keep things organized.

Step 3: Create the `university_ranking_year` Table

The third table, university_ranking_year, brings everything together. This is where you’ll store the rankings, including which university was ranked, by which system, in which year, and with what score.

SQL

CREATE TABLE university_ranking_year(
   	id int primary key identity(1,1),
   	university_id int foreign key references university(id),
   	ranking_system_id int foreign key references ranking_system(id) ,
   	year int NOT NULL,
   	score int NOT NULL
);

The id column is the primary key for this table, providing a unique identifier for each record.
The university_id column links to the id column in the university table.
The ranking_criteria_id column links to the id column in the ranking_system table.
The year column records the year of the ranking.
The score column shows the ranking score given to the university. Both year and score are required fields.

Sample `university_ranking_year` Database

Step 4: Populate the Tables

Once the tables are created, you’ll need to fill them with sample data. The sample data will simulate real-world scenarios, helping you see how joins can be applied in practice.

Insert Data into university:

SQL

INSERT INTO university (university_name)
VALUES 
    ('Harvard University'),
    ('Massachusetts Institute of Technology'),
    ('Stanford University'),
    ('University of Cambridge'),
    ('California Institute of Technology'),
    ('Princeton University'),
    ('University of Oxford'),
    ('Yale University'),
    ('Columbia University');

Insert Data into ranking_system:

SQL

INSERT INTO ranking_system (system_name)

SQL

VALUES 
    ('QS World University Rankings'),
    ('Times Higher Education Rankings'),
    ('Academic Ranking of World Universities'),
    ('U.S. News & World Report'),
    ('CWTS Leiden Ranking');

Insert Data into university_ranking_year:

SQL

INSERT INTO university_ranking_year (university_id, ranking_criteria_id, year, score)
VALUES 
    (1, 1, 2023, 95),
    (2, 2, 2023, 92),
    (3, 3, 2023, 89),
    (4, 4, 2023, 87),
    (5, 5, 2023, 85);

Step 5: Validate the Relationships

After populating the tables, it’s important to check that the relationships between them are correct. This validation step ensures your database is ready to explore different types of joins.

Here’s what to verify:

All university_id values in university_ranking_year should match valid id values in the university table.
All ranking_criteria_id values in university_ranking_year should match valid id values in the ranking_system table.

With that foundation in place, let’s explore how to apply joins to this database.

The Basics of Data Joins

Data joins combine two datasets by linking them through a shared key. This process allows you to connect information from separate tables, making it easier to draw insights that wouldn’t be possible with isolated data.

Picking the right type of join determines how the data is merged and influences the accuracy of your results. Below, we’ll break down the most common types of joins, explaining how each works and when to use it.

Inner Joins

An inner join links two tables by returning only the rows with matching values in specified columns. It’s a powerful way to combine related data and analyze shared characteristics.

SQL

SELECT TOP 10 
    a.id,
    a.university_name, 
    b.year, 
    b.score
FROM university AS a
INNER JOIN university_ranking_year AS b
ON a.id = b.university_id
ORDER BY b.score DESC;

This query retrieves the top 10 universities along with their ranking year and score. It includes only universities with recorded rankings, using an inner join to connect the university table with the university_ranking_year table through the id and university_id columns. The results are sorted by score in descending order, so the highest-ranking universities are displayed first.

Let’s talk about best practices for using inner joins effectively:

Filter Early: Add conditions in your WHERE clause or directly in the join to narrow down the dataset before processing. This keeps queries efficient and focused.
Index Your Columns: When joining large tables, make sure the columns used for the join are indexed. This can significantly speed up query performance.
Use Aliases for Clarity: When dealing with tables with similar column names, table aliases can prevent confusion and make your queries easier to read and debug.

Left Join

A left join pulls all rows from the primary (left) table and any matching rows from the secondary (right) table. If there’s no match, the result still includes the left table’s data but with NULL values in place of missing right table entries.

SQL

SELECT 
    a.id,
    a.university_name, 
    b.year,
    b.score
FROM university AS a
LEFT JOIN university_ranking_year AS b
ON a.id = b.university_id;

In our example, a left join ensures we see all universities, even if they have no ranking records. It keeps every university in the results while adding ranking data where available. If a university isn’t ranked, its ranking year and score will appear NULL.

Best Practices for Left Joins

Keep the Primary Table Intact: Before joining, ensure that the left table contains all necessary data. If data is missing at this stage, fixing it later will be difficult.
Consider INNER JOIN If Possible: If you don’t need unmatched rows, an INNER JOIN may be more efficient.
Handle NULL Values Properly: Use COALESCE(right_table.column, default_value) to replace NULLs with meaningful values.

Right Join

A right join works like a left join but in reverse. Instead of keeping all rows from the primary (left) table, it retains all rows from the secondary (right) table and brings in matching data from the left table where available. If there’s no match, the left table’s columns return as NULL.

SQL

SELECT 
    a.id AS ranking_system_id,
    a.system_name,
    b.university_id,
    b.year,
    b.score
FROM ranking_system AS a
RIGHT JOIN university_ranking_year AS b
ON a.id = b.ranking_criteria_id;

This query pulls ranking data from the university_ranking_year table and links it to the ranking_system table using ranking_criteria_id. If a ranking system is assigned, its name appears in the results. If not, the ranking is listed with a NULL value, signaling it lacks a corresponding system.

Why does this matter? When analyzing university rankings, it’s important to spot missing or mismatched ranking systems. If rankings exist without a proper classification, it could point to data entry issues or inconsistencies in the dataset. Running this query helps validate the data, making cleaning up errors easier and confirming that every ranking is assigned to the correct system.

Best Practices for Right Joins

Use When Right Table is Mandatory: Use a RIGHT JOIN only if all records from the right table must be retained, even if there are no matching records in the left table.
Avoid Unintentional Data Expansion: RIGHT JOIN may introduce unintended duplication or large datasets if not used properly.
Use Explicit Column Selection: Instead of SELECT *, choose specific columns to avoid unnecessary data retrieval.

Right joins are less common than left joins, but they can be useful in specific cases where the secondary table’s data must be prioritized. Test whether rewriting the query as a left join improves clarity when in doubt.

Full Outer Join

A full outer join pulls all records from both tables, whether they match or not. If a match exists, the rows are combined. If not, the result includes NULL values in place of missing data. This makes it useful when you need a complete view of both datasets, ensuring no records are left out.

SQL

SELECT  
    a.id,
    a.university_name, 
    b.year,
    b.score, 
    c.system_name
FROM university AS a
FULL OUTER JOIN university_ranking_year AS b
    ON a.id = b.university_id
FULL OUTER JOIN ranking_system AS c
    ON b.ranking_criteria_id = c.id;

This query merges data from three tables—university, university_ranking_year, and ranking_system—using full outer joins. It retrieves university names, ranking years, scores, and ranking system names. If a university has no ranking, its score and ranking system will return as NULL. The university column will return NULL if a ranking exists without a university record.

Best Practices for Full Outer Join

Check for Missing Data: Use full outer joins when reconciling datasets to confirm that no records are lost in the process.
Use CASE or COALESCE for NULL Handling: Since full outer joins produce NULLs for non-matching rows, handle them properly to avoid data inconsistencies.
Limit Data When Possible: Applying a LIMIT clause or specific filtering criteria can prevent excessive result sets.

Full outer joins provide a broad view of your data, making them a strong choice when analyzing relationships between multiple datasets. It’s particularly useful for spotting gaps, inconsistencies, or missing records in datasets that should align.

Cross Join

A cross join creates a Cartesian product of two tables, meaning every row from the first table is paired with every row from the second. The result set grows quickly as the total number of rows equals the product of the row counts from both tables.

SQL

SELECT 
    u.id, 
    u.university_name, 
    r.id AS ranking_id, 
    r.system_name
FROM 
    university AS u
CROSS JOIN 
    Ranking_system AS r;

This query combines universities from the university table with ranking systems from the ranking_system table. Every university is matched with every ranking system, regardless of their actual ranking relationship. The result is a dataset that includes every possible pairing.

Best Practices for Cross Joins

Filter Where Possible: Applying a WHERE clause helps control the number of results by eliminating unnecessary combinations.
Optimize Indexing: While cross joins don’t use indexes directly, optimizing indexing in related queries can improve performance.
Leverage Temporary Tables: Consider temporary tables to break down complex queries if working with large datasets.

Cross joins are useful when you need to generate all possible combinations between two datasets. Common scenarios include A/B testing setups, product recommendation modeling, or assigning multiple marketing campaigns to different customer segments. If every possible combination is needed, they get the job done. If not, other joins may be a better fit.

Overcoming Common Challenges in Data Joins

While data joins are powerful tools, they also come with their share of challenges. These can range from mismatched keys to performance bottlenecks, especially when working with large datasets.

Key Mismatches in SQL Joins

Key mismatches happen when join conditions fail due to inconsistencies in how keys are stored across tables. When keys don’t match correctly, queries fail to return the expected results. Likewise, if these keys match partially, it can lead to duplicate records. Similarly, if a key misses references, it creates orphaned records. These are foreign key values that don’t exist in the related table. All these issues disrupt database integrity and leave gaps in reports.

Additionally, if a foreign key points to a missing value, your database’s integrity constraints can break, causing reports to return incomplete or misleading insights. These issues might not be obvious initially, but they can quickly add up and make your data unreliable.

Key mismatches usually stem from three issues:

Typos and Inconsistencies

Small differences in spaces (" Harvard " vs. "Harvard") or case ("MIT" vs. "mit") can prevent matches. You can overcome these issues by:

Trimming spaces to remove unnecessary gaps
Converting text to lowercase for uniformity
Applying zero-padding where needed to align numeric keys

SQL

UPDATE university
SET id = LPAD(TRIM(LOWER(id)), 3, '0');
UPDATE university_ranking_year
SET university_id = LPAD(TRIM(LOWER(university_id)), 3, '0');

Key Data Type Mismatch Issues

Joins break when data types don’t match. If one table stores an ID as VARCHAR('001') and another as INT(1), SQL treats them as different values, causing the join to fail. Since joins require exact matches, mismatched types prevent tables from linking properly.

Some databases automatically convert types, like VARCHAR to INT, but this slows queries down by applying conversions at runtime for every row, increasing CPU and memory usage. Others reject mismatched types entirely, throwing errors instead of returning results. You can overcome this issue by.

Standardizing IDs as either VARCHAR or INT across all tables
Converting numeric keys to text and applying padding to match formats
Adjusting column types at the database level to enforce

SQL

ALTER TABLE university MODIFY id VARCHAR(10);
ALTER TABLE university_ranking_year MODIFY university_id VARCHAR(10);
SELECT 
    u.id AS university_id, 
    u.university_name, 
    r.year, 
    r.score, 
    s.system_name
FROM university AS u
LEFT JOIN university_ranking_year AS r
    ON CAST(u.id AS CHAR) = LPAD(CAST(r.university_id AS CHAR), 3, '0')
LEFT JOIN ranking_system AS s
    ON CAST(r.ranking_criteria_id AS CHAR) = CAST(s.id AS CHAR);

NULL Values

NULL values in key columns cause joins to fail, leading to missing records. Here’s what you can do to resolve it:

Use COALESCE() to assign a default value like "Unknown" when an ID is missing
Convert NULL years to "N/A" and NULL scores to 0 for consistency

Replace NULL ranking system names with "Not Ranked" to keep reports complete

SQL

SELECT 
    COALESCE(u.id, 'Unknown') AS university_id,
    u.university_name, 
    COALESCE(r.year, 'N/A') AS ranking_year,
    COALESCE(r.score, 0) AS ranking_score,
    COALESCE(s.system_name, 'Not Ranked') AS ranking_system
FROM university AS u
LEFT JOIN university_ranking_year AS r
    ON COALESCE(LPAD(TRIM(LOWER(u.id)), 3, '0'), 'Unknown') = 
       COALESCE(LPAD(TRIM(LOWER(r.university_id)), 3, '0'), 'Unknown')
LEFT JOIN ranking_system AS s
    ON COALESCE(r.ranking_criteria_id, -1) = s.id;

Performance Bottlenecks in Large-Scale Joins

JOIN operations can slow down queries and strain system resources when working with large datasets. If not optimized, they lead to high memory usage, slow execution, and even system crashes. Here’s why large joins degrade performance and how to fix them.

Joins require the database engine to load, sort, and match records from both tables. If a dataset is too large to fit into memory, the system starts using disk-based temporary storage, drastically slowing performance. For instance, a join query may require gigabytes of RAM if two tables contain millions of rows. This can slow down or even crash the system. You can resolve these issues by:

Use Indexing to reduce the amount of data loaded into memory.

SQL

CREATE INDEX idx_table1_key ON table1(join_column);
CREATE INDEX idx_table2_key ON table2(join_column);
CREATE INDEX idx_table3_key ON table3(join_column);

Limit the dataset size by filtering data before joining.

SQL

SELECT 
    t1.column1,
    t2.column2,
    t3.column3
FROM table1 AS t1
JOIN table2 AS t2 
    ON t1.join_column = t2.join_column
JOIN table3 AS t3
    ON t2.join_column = t3.join_column
WHERE t2.filter_column >= some_value
AND t1.filter_column IS NOT NULL;

Partition the data so the join operates on smaller chunks rather than the entire dataset at once.

SQL

CREATE TABLE partitioned_table (
    id INT PRIMARY KEY,
    join_column INT,
    filter_column DATE
) PARTITION BY RANGE(filter_column);

Optimize the query to process only partitioned data and avoid full table scans.

SQL

SELECT 
    t1.column1,
    t2.column2,
    t3.column3
FROM table1 t1
JOIN partitioned_table AS t2
    ON t1.join_column = t2.join_column
JOIN table3 AS t3
    ON t2.join_column = t3.join_column
WHERE t2.filter_column BETWEEN start_value AND end_value;

Avoiding Pitfalls

Even experienced analysts encounter pitfalls when working with data joins. Here are two of the most common issues and how to avoid them:

Unintended Cartesian Products

A Cartesian product occurs when a join lacks a proper condition, causing every row in one table to be paired with every row in another. This can generate a huge dataset, leading to performance issues and excessive resource consumption. For instance, a query that joins two tables without a condition will return every university paired with every ranking system, even if there’s no actual relationship between them:

Always include an explicit join condition to ensure only relevant rows are combined.

SQL

SELECT 
    u.university_name, 
    s.system_name
FROM university AS u
JOIN ranking_system AS s ON u.id = s.id;

Identifying Cardinality in Joins

Cardinality refers to the relationships between datasets, such as one-to-one, one-to-many, or many-to-many. Misunderstanding these relationships can result in missing data or duplication. For example, knowing how universities, rankings, and ranking systems are related in a university database is crucial for accurate queries.

One common mistake is assuming a one-to-one relationship between tables when it’s actually one-to-many. Take universities and rankings as an example: a university could have multiple rankings for different years or ranking systems. You could miss some rankings if you mistakenly treat this as a one-to-one relationship. Understanding the real relationship between the tables ensures that you don’t exclude relevant data.

Start by counting the unique keys in each table. You can group data by key columns and count how often each key appears. This can help you identify if the relationship is one-to-one or one-to-many.

SQL

SELECT university_id, COUNT(*) 
FROM university_ranking_year 
GROUP BY university_id;

If the count is greater than 1 for any university_id, it indicates a one-to-many relationship.

Next, examine how the records are distributed across tables. If a university appears multiple times in the university_ranking_year table, it likely indicates a one-to-many relationship. Once the cardinality is clear, choose the right type of join for your query:

INNER JOIN: Use when you only want to include universities that have rankings. This is ideal for one-to-many relationships where data exists in both tables.
LEFT JOIN: Use this if you want to include all universities, even those without rankings.

For instance, in a one-to-many relationship where some universities have multiple rankings, a LEFT JOIN would ensure that all universities are included, even if some don’t have rankings for certain systems:

SQL

SELECT 
    u.university_name, 
    r.year, 
    r.score, 
    s.system_name
FROM university AS u
LEFT JOIN university_ranking_year AS r ON u.id = r.university_id
LEFT JOIN ranking_system AS s ON r.ranking_criteria_id = s.id;

Leveraging Advanced Tools for Complex Joins

As datasets grow, traditional join methods may no longer cut it. Analysts need advanced tools to handle large data volumes, maintain efficiency, and simplify complex table relationships. Today’s data platforms are designed to handle these challenges, freeing analysts from technical constraints and allowing them to focus on extracting valuable insights.

ClicData: Simplifying Data Joins

ClicData offers a flexible platform that balances ease of use with powerful capabilities. The platform empowers you to combine the ease of no-code with Python scripting for scalable data joins. Moreover, ClicData allows you to leverage data joins as a key lever of analytics pipelines and automated workflows. Here’s how it stands out:

No-Code Joins: A drag-and-drop interface that enables you to perform complex joins between datasets without writing code. The intuitive design helps you combine data from various sources, clean it, and prepare it for deeper analysis quickly and accurately.
Custom Joins with Python Integration: Integrate Python directly into your workflow and leverage libraries like Pandas and NumPy to create custom joins and handle more sophisticated data transformations.
Seamless Data Integration: Connect data from multiple sources with a wide range of native connectors to databases, cloud services, and APIs. Whether integrating CRM, marketing, or other internal data, this capability ensures that all necessary data is available for accurate joins.
Automated Data Updates: Automate your data refreshes and join processes to ensure datasets remain up-to-date without manual intervention. You can schedule regular updates for data extraction, transformation, and loading (ETL) and execute custom Python scripts at set intervals.
Machine Learning Integration: Build, test, and deploy predictive models directly on your datasets and streamline the transition from raw data to actionable insights.

Learn more about ClicData’s capabilities for handling complex data joins.

Apache PySpark: Scalable Joins for Big Data

PySpark is built for large-scale distributed data processing, making it a powerful tool for executing joins across massive datasets. Its architecture enables seamless scalability, ensuring efficient performance even with complex join operations.

Distributed Execution: Joins are processed across multiple nodes in a Spark cluster, improving scalability and reducing bottlenecks.
Resilient Distributed Datasets (RDDs): PySpark efficiently manages data movement and shuffling during joins, optimizing performance.
Parallel Processing: Operations are distributed across multiple cores, significantly reducing execution time.
Fault Tolerance: PySpark automatically handles node failures, maintaining distributed join operations’ stability.

Apache Hive: SQL-Driven Joins on Hadoop

Hive provides a familiar SQL-like interface for querying massive datasets stored in Hadoop. It is designed for batch processing and simplifies executing complex joins on distributed storage systems.

SQL-Like Interface: Users can perform joins using standard SQL queries without needing to manage underlying storage complexities.
Seamless Hadoop Integration: Built on HDFS and MapReduce, Hive efficiently processes petabyte-scale datasets.
Optimized for Batch Processing: Best suited for large-scale joins that don’t require real-time execution.
Horizontal Scalability: Expands across thousands of machines to handle complex joins efficiently.

Dask: Efficient In-Memory and Out-of-Core Joins

Dask extends the capabilities of Pandas, making it possible to work with datasets larger than available memory while retaining a familiar API. It offers flexibility in both in-memory and out-of-core computations.

Pandas Compatibility: Analysts familiar with Pandas can perform scalable joins without a steep learning curve.
Out-of-Core Computation: Processes datasets that don’t fit in memory by breaking them into smaller partitions.
Parallel Execution: Joins are distributed across multiple cores and machines, improving efficiency.
Memory Adaptive: Can switch between in-memory and out-of-core processing based on available resources.

Mastering Data Joins for Business-Driven Insights

Mastering data joins is about understanding how different datasets connect and ensuring the combined data stays reliable. Knowing how to handle relationships like one-to-one or one-to-many will improve your analysis and help you avoid common pitfalls like mismatched keys or performance issues.

Whether you’re combining customer information for marketing or pulling together IoT data for predictive maintenance, data joins let you bring it all together. With the right tools, you can create a seamless flow of information and gain access to data-backed insights that drive smarter decisions.

ClicData makes the process easier with tools that fit both beginners and experts. Its drag-and-drop interface and Python scripting options let you work with data effortlessly. Plus, with over 250+ connectors, you can combine and transform data from many different sources, quickly unlocking valuable insights without the need for extra coding.

Ready to take your data analysis to the next level? Schedule some time with our data experts today.

Mastering Data Joins: Practical Advice for Data Analysts

Setting Up a Database for Testing Joins

Step 1: Create the `university` Table

Step 2: Create the `ranking_system` Table

Step 3: Create the `university_ranking_year` Table

Step 4: Populate the Tables

Step 5: Validate the Relationships

The Basics of Data Joins

Inner Joins

Left Join

Best Practices for Left Joins

Right Join

Best Practices for Right Joins

Full Outer Join

Best Practices for Full Outer Join

Cross Join

Best Practices for Cross Joins

Overcoming Common Challenges in Data Joins

Key Mismatches in SQL Joins

Typos and Inconsistencies

Key Data Type Mismatch Issues

NULL Values

Performance Bottlenecks in Large-Scale Joins

Avoiding Pitfalls

Unintended Cartesian Products

Identifying Cardinality in Joins

Leveraging Advanced Tools for Complex Joins

ClicData: Simplifying Data Joins

Apache PySpark: Scalable Joins for Big Data

Apache Hive: SQL-Driven Joins on Hadoop

Dask: Efficient In-Memory and Out-of-Core Joins

Mastering Data Joins for Business-Driven Insights

Table of Contents

Share this Blog

Other Blogs

Embedded Analytics for SaaS: How to Give Every Client Their Own Dashboard Without Building It In-House

When Dashboards Are Ignored: A 6-Step Adoption Plan

Automated Dashboard Alerts: How to Build a BI System That Notifies You Before Problems Escalate

Mastering Data Joins: Practical Advice for Data Analysts

Setting Up a Database for Testing Joins

Step 1: Create the university Table

Step 2: Create the ranking_system Table

Step 3: Create the university_ranking_year Table

Step 4: Populate the Tables

Step 5: Validate the Relationships

The Basics of Data Joins

Inner Joins

Left Join

Best Practices for Left Joins

Right Join

Best Practices for Right Joins

Full Outer Join

Best Practices for Full Outer Join

Cross Join

Best Practices for Cross Joins

Overcoming Common Challenges in Data Joins

Key Mismatches in SQL Joins

Typos and Inconsistencies

Key Data Type Mismatch Issues

NULL Values

Performance Bottlenecks in Large-Scale Joins

Avoiding Pitfalls

Unintended Cartesian Products

Identifying Cardinality in Joins

Leveraging Advanced Tools for Complex Joins

ClicData: Simplifying Data Joins

Apache PySpark: Scalable Joins for Big Data

Apache Hive: SQL-Driven Joins on Hadoop

Dask: Efficient In-Memory and Out-of-Core Joins

Mastering Data Joins for Business-Driven Insights

Automated Client Reporting for Marketing Agencies

Choosing The Right Cloud Data Warehouse Provider: A Shopper’s Guide

Table of Contents

Share this Blog

Other Blogs

Embedded Analytics for SaaS: How to Give Every Client Their Own Dashboard Without Building It In-House

When Dashboards Are Ignored: A 6-Step Adoption Plan

Automated Dashboard Alerts: How to Build a BI System That Notifies You Before Problems Escalate

Step 1: Create the `university` Table

Step 2: Create the `ranking_system` Table

Step 3: Create the `university_ranking_year` Table