Stop Blaming Your Data

Anyone who has ever worked as a data {scientist, engineer, analyst} knows that, at some point during the project, the data (quality) is going to ruin the party. You have your models and cross validation running on your state-of-the-art CI/CD pipeline, but find out that the predictions are essentially trash. After some digging, you find out that the data you have been given is noisy, incomplete, biased, and incorrect altogether.

