The question this week has to do with your data entry validation. For most ETL processes there is a lot of effort put into scrubbing of data for various reasons and benefits. So, how and what do you use to scrub your data?

And what about the data that, while technically “fits” into a particular field, is still flat out wrong? How do you verify that it is correct? Consider this fine example:

rr_crossing