Help us build a better product, earn the PaxPro Advocate badge: Six short questions, one cool badge.
5. Combine Your Data - Append
This is part 5 of the Getting Started tutorial series. Let's now look at adding more data to our project, both adding columns and rows from similar datasets.
Video: Combining Your Data - Append
Steps: Combining Your Data - Append
Append your data:
Step 1: Enter the source project of the YTD Clean Contacts dataset that we created in the previous project
Step 2: Use the append step
Step 2.1: Click on the green icon that says “dataset(click to select)” to bring in the October contacts data.
Step 2.2: To bring in a dataset into the library click the green + icon and use the Amazon S3 bucket Datasource (This is where the data lies) , open the tutorial bucket and bring in the October contacts data.
Note: You can see a preview of the dataset in the bottom right panel when you bring in the dataset. You can remove, reorder and rename columns before bringing this data on to the Paxata library.
Step 3: Once you finish import, hit the select button to bring this dataset on to Paxata.
Step 4: Despite having different column names, Paxata allows you to select the columns that you want to append interactively. Unmatched columns get populated with blanks.
Step 5: Our October data has an additional column called score but no country column. This maybe because October data has only US contacts. It is clear to us that we need to use all of the previous transformations, even to October data. Using the steps Panel you can move the append step – by simply clicking the append step and dragging it to the desired position
Step 6: Now we see that whitespaces has persisted in the full name column -> in order to fix this, we can select the trim leading and trailing white spaces by criteria (where data type is string) Step 7: Once the dataprep work is done on a project, we can publish this view using a New Lens. A new lens can be automated and exported by the user.
That completes the fifth of six tutorials in the Getting Started series.
- Create a New Data Prep Project
- Explore Your Data
- Clean and Transform Your Data
- Publish and Export Your Data
- Combine Your Data - Append
- Combine Your Data - Lookup / Join