Home Data Prep Q&A

Paxata Community Members: Something special in a community experience is coming your way. Stay tuned to this space.
In the meantime, check out the brand new Data Prep for Data Science topic here and the new DataRobot Community.

Visit the official Paxata Documentation portal for all of your doc needs.

Determining if a field value is unique or distinct

How can I use a compute statement to determine if the value of a field is in a dataset is distinct or unique.  Or can I code a SQL statement in a compute statement?

Best Answer

Answers

  • sayyarsayyar Posts: 24 ✭✭
    Hi @CFresh,
    • If you are trying to explore the data to see if there are duplicates in the column, the quickest way would be to use a Filtergram
    • If you want to identify the data quality for many columns in the dataset profiling is the best solution. 
    • If the objective is to get the counts of certain fields and use that for further processing we have certain shaping operations like group by
    Please let me know if you have further questions on this. 

    Regards,
    Shyam Ayyar
    Product Manager
  • Thanks for the different options.  Do you know if it is possible to export the Profile, ntot that I would print it, but I would use the freeze function so that the part number could remain in view when scrolling.  Thank you!
  • sayyarsayyar Posts: 24 ✭✭
    Yes, the profile will be present as a Dataset in the Data Library and you could export this in the standard formats that Paxata supports. Thank you for the feedback on the freeze function, we will add it to the list of enhancements. 
Sign In or Register to comment.