Home Data Prep for Data Science, AI and ML

Paxata Community Members: Something special in a community experience is coming your way. Stay tuned to this space.
In the meantime, check out the brand new Data Prep for Data Science topic here and the new DataRobot Community.

Visit the official Paxata Documentation portal for all of your doc needs.

Paxata and Data Prep for Data Science

MelanieMelanie Posts: 70 admin

What is Data Preparation for Machine Learning?
Data preparation is the process of transforming raw data so that it's properly prepared for the machine learning algorithms used to uncover insights and make predictions.

Why is Data Preparation Important?
Most machine learning algorithms require data to be formatted in very specific ways. Which means your raw datasets generally require some amount of preparation before they can yield useful insights. For example, some datasets have values that are missing or invalid. If data is missing, the algorithm can’t use it. And if data is invalid, the algorithm produces less accurate or even misleading outcomes. Good data preparation produces clean and well-curated data that leads to more practical, accurate model outcomes.

So what can I do to prep my data?
Paxata provides the transformation tools you need to clean, normalize, and shape your data. And once you've cleaned your data, Paxata also provides the tools you need to prepare your Features for optimal feature engineering.

Here's just a short list of how Paxata can help you to quickly prep your data to train your ML models:

Sign In or Register to comment.