Home Data Prep Q&A

Big News: we’ve moved to the DataRobot Community! Please keep your eye out for an email invitation to join us there. Refer to the We've Moved FAQ for a guide on how to use your existing Paxata Community account to login to our new home.

Visit the official Paxata Documentation portal for all of your doc needs.


Can you aggregate and join datasets from your library? E.g. join source data to current data, sum the source data based on key field from current data? Similar to a sum and join in SQL.


  • Thanks for responding!  I have taken that approach, but I would be more interested in combining those into one step in a project where you join and sum at the same time. Again, similar to sum/join in SQL or SUMIFS formula in Excel. I believe this would also require the ability to select specific columns to bring in on the lookup/join instead of bringing the entire source data set into the current data set.  I'll check out the community videos and see if I can pick up anything there.
  • It's not exactly in one step, but one feature that you may not be aware of is the ability to "expand lookup steps".  Once you do a lookup, open the Steps panel on the left and click "expand lookup steps" right under the Lookup step.  From here, you can click on the Import step and apply changes including hiding columns and group by which are only applied to the dataset that you are looking up against - in this way you can alter the dataset coming into the project before the actual lookup operation.  I hope this tip is helpful!
Sign In or Register to comment.