Home Data Prep Q&A

Paxata Community Members: Something special in a community experience is coming your way. Stay tuned to this space.
In the meantime, check out the brand new Data Prep for Data Science topic here and the new DataRobot Community.

Visit the official Paxata Documentation portal for all of your doc needs.


Can you aggregate and join datasets from your library? E.g. join source data to current data, sum the source data based on key field from current data? Similar to a sum and join in SQL.


  • Thanks for responding!  I have taken that approach, but I would be more interested in combining those into one step in a project where you join and sum at the same time. Again, similar to sum/join in SQL or SUMIFS formula in Excel. I believe this would also require the ability to select specific columns to bring in on the lookup/join instead of bringing the entire source data set into the current data set.  I'll check out the community videos and see if I can pick up anything there.
  • It's not exactly in one step, but one feature that you may not be aware of is the ability to "expand lookup steps".  Once you do a lookup, open the Steps panel on the left and click "expand lookup steps" right under the Lookup step.  From here, you can click on the Import step and apply changes including hiding columns and group by which are only applied to the dataset that you are looking up against - in this way you can alter the dataset coming into the project before the actual lookup operation.  I hope this tip is helpful!
Sign In or Register to comment.