3.1 Join

7 minutes
Share the link to this page
Copied
  Completed
You need to have access to the item to view this lesson.
One-time Fee
$49.99
List Price:  $69.99
You save:  $20
€47.98
List Price:  €67.18
You save:  €19.19
£39.89
List Price:  £55.86
You save:  £15.96
CA$69.90
List Price:  CA$97.87
You save:  CA$27.96
A$76.87
List Price:  A$107.62
You save:  A$30.75
S$67.31
List Price:  S$94.24
You save:  S$26.93
HK$389.14
List Price:  HK$544.83
You save:  HK$155.68
CHF 44.67
List Price:  CHF 62.54
You save:  CHF 17.87
NOK kr553.51
List Price:  NOK kr774.97
You save:  NOK kr221.45
DKK kr357.84
List Price:  DKK kr501
You save:  DKK kr143.16
NZ$85.68
List Price:  NZ$119.95
You save:  NZ$34.27
د.إ183.61
List Price:  د.إ257.07
You save:  د.إ73.46
৳5,972.22
List Price:  ৳8,361.58
You save:  ৳2,389.36
₹4,221.07
List Price:  ₹5,909.84
You save:  ₹1,688.76
RM223.35
List Price:  RM312.71
You save:  RM89.36
₦84,627.22
List Price:  ₦118,484.88
You save:  ₦33,857.66
₨13,887.22
List Price:  ₨19,443.22
You save:  ₨5,556
฿1,722.96
List Price:  ฿2,412.28
You save:  ฿689.32
₺1,727.27
List Price:  ₺2,418.32
You save:  ₺691.05
B$289.99
List Price:  B$406.01
You save:  B$116.02
R905.58
List Price:  R1,267.89
You save:  R362.30
Лв93.83
List Price:  Лв131.37
You save:  Лв37.54
₩70,211.45
List Price:  ₩98,301.65
You save:  ₩28,090.20
₪185.06
List Price:  ₪259.10
You save:  ₪74.04
₱2,946.36
List Price:  ₱4,125.14
You save:  ₱1,178.78
¥7,736.95
List Price:  ¥10,832.35
You save:  ¥3,095.40
MX$1,021.22
List Price:  MX$1,429.79
You save:  MX$408.57
QR182.26
List Price:  QR255.18
You save:  QR72.92
P683.46
List Price:  P956.90
You save:  P273.44
KSh6,472.14
List Price:  KSh9,061.51
You save:  KSh2,589.37
E£2,482.01
List Price:  E£3,475.01
You save:  E£993
ብር6,118.22
List Price:  ብር8,566
You save:  ብር2,447.77
Kz45,640.87
List Price:  Kz63,900.87
You save:  Kz18,260
CLP$49,324.13
List Price:  CLP$69,057.73
You save:  CLP$19,733.60
CN¥362.07
List Price:  CN¥506.93
You save:  CN¥144.86
RD$3,012.01
List Price:  RD$4,217.06
You save:  RD$1,205.04
DA6,712.40
List Price:  DA9,397.90
You save:  DA2,685.50
FJ$113.77
List Price:  FJ$159.29
You save:  FJ$45.51
Q385.78
List Price:  Q540.13
You save:  Q154.34
GY$10,455.70
List Price:  GY$14,638.82
You save:  GY$4,183.11
ISK kr6,982.60
List Price:  ISK kr9,776.20
You save:  ISK kr2,793.60
DH502.76
List Price:  DH703.91
You save:  DH201.14
L910.90
List Price:  L1,275.33
You save:  L364.43
ден2,951.80
List Price:  ден4,132.76
You save:  ден1,180.95
MOP$400.70
List Price:  MOP$561.01
You save:  MOP$160.31
N$906.31
List Price:  N$1,268.91
You save:  N$362.60
C$1,838.97
List Price:  C$2,574.70
You save:  C$735.73
रु6,749.45
List Price:  रु9,449.77
You save:  रु2,700.32
S/189.51
List Price:  S/265.32
You save:  S/75.81
K201.21
List Price:  K281.71
You save:  K80.50
SAR187.68
List Price:  SAR262.77
You save:  SAR75.08
ZK1,382
List Price:  ZK1,934.92
You save:  ZK552.91
L238.86
List Price:  L334.42
You save:  L95.56
Kč1,216.06
List Price:  Kč1,702.59
You save:  Kč486.52
Ft19,746.05
List Price:  Ft27,646.05
You save:  Ft7,900
SEK kr551.69
List Price:  SEK kr772.42
You save:  SEK kr220.72
ARS$50,182.44
List Price:  ARS$70,259.44
You save:  ARS$20,076.99
Bs345.34
List Price:  Bs483.50
You save:  Bs138.16
COP$219,443.60
List Price:  COP$307,238.59
You save:  COP$87,794.99
₡25,456.77
List Price:  ₡35,641.51
You save:  ₡10,184.74
L1,262.95
List Price:  L1,768.23
You save:  L505.28
₲390,155.46
List Price:  ₲546,248.87
You save:  ₲156,093.40
$U2,130.69
List Price:  $U2,983.14
You save:  $U852.45
zł208.13
List Price:  zł291.40
You save:  zł83.27
Already have an account? Log In

Transcript

The join tool allows you to combine two data sets linked by a common field, usually by a unique identifier, such as a customer ID, email address, or phone number. ultrix also lets you join two data sets based on record position. Which means if you had 30 Records in one set, and five records in the other, the first five records in both sets will join horizontally, even though there's nothing common between them. Referring to the illustration here, the first picture shows our two data sets. The first is employee information, consisting of employee ID, first and last name, age, position, and department ID. The second data set contains department information in which includes department ID, department name and department email.

We essentially want to find the department name for each employee, as just showing the ID at the moment isn't too helpful. So the steps required here are dragging in two input data tools, a join tool, and then joining by a common field, and then outputting. The results in the third picture, to see the department ID, department name and department email joined to the employee data set. Let's start a new workflow by adding our data sets. We'll go to the Insert tab, drag in to input data tools. And the first one will be 3.1 point one employee and the second one will be department will then go to the join tab and dragon join.

And you will notice that there are two inputs to this one, a left and a right. Make sure that the employee is joined to the left and that the right employees is joined to department. Looking in the join tool, there are three types of joins, we can perform a left in a and writes join. Let's go through what each of these mean. I'll start with the inner join, because I think that's easiest to explain. Inner Join means that the joint only returns records were found matching IDs in both data sets.

Therefore, looking at the department file, we have department IDs of 100 200 300 607 hundred. If we go to our employee file, we have department IDs of 1234 and 500. Therefore, the IDs that match are only 100 203 hundred and we should only get back 17 Records. Left join means that you We'll return the left data set, which is employee when it doesn't find a match between the two data sets. Because we know that there are 17 records that match the data set from our inner join, we should see 13 records that don't match. In other words, the 405 hundred department IDs.

Right join means that it will return the right data set, which is department when it also doesn't find a match between the data set. We know that department ID 100 203 hundred exists, and 6700 don't. So we should get back these three records. If we add a browse tool to the outputs of our join tool using Ctrl Shift B and let's update our join tool to join by specific fields and the common identifier between the two data sets is department ID. So, on the left inputs, we have our employee file, and we'll select a department ID. And on the right, we have our department file, and we're going to link department ID with it.

Let's run our workflow. And starting with our inner join output in the middle, we should see 17 records that matched between the two files. So what was common between the two was department IDs 100 to 300. If we go to our left join, we should see 13 records that didn't match but it'll return the data from the employee file and 13 Records here. And 405 hundred were the IDs that didn't match the department file. And lastly, with the right join, it will return the right data set which was department information and because department I 607 hundred couldn't be found in employee.

We have our two results here. Let's start a new workflow and import spreadsheets 3.1 point two. So we'll delete these ones here, dragging two new input data tools and connect to 3.1 point two. So have 2018 sales and 2019 sales. Our first data set contains 2018 accounting data per department. So that's department ID, revenue, expenses, profit and the year of the file.

The second data set contains the same data, but for 2019 because the two data sets are identical in schema, in other words, they have the same columns have the same time departments per record, we can use the join by record position to combine the two data sets. So let's drag in the join tool connecting both. And we'll join by record position and add a browse tool to the output of our join tool and run the workflow. Both sales figures are combined together into one table. We've now joined our sales data by record position. To make it a bit easier to read, we can add a select tool to the output of our join and maybe give these field names something more meaningful for 2018 revenue 2018 expenses 2018 profit, we can probably remove year from it as well.

And the right department ID so we can only keep the first one. We'll add 2019 revenue, expenses and 2019. profit. let's rerun it. And you can see here that per department ID we have the 2018 and 2019 figures looking a lot more legible than it was before.

Sign Up

Share

Share with friends, get 20% off
Invite your friends to LearnDesk learning marketplace. For each purchase they make, you get 20% off (upto $10) on your next purchase.