Stop Blaming Your Data - GoDataDriven

Anyone who has ever worked as a data {scientist, engineer, analyst} knows that, at some point during the project, the data (quality) is going to ruin the party. You have your models and cross validation running on your state-of-the-art CI/CD pipeline, but find out that the predictions are essentially trash. After some digging, you find out that the data you have been given is noisy, incomplete, biased, and incorrect altogether.

This is a companion discussion topic for the original entry at