Consistency over Accuracy - Testing Big Data in Totango

Getting the right data set has always been the trickiest part when data driven systems which require accurate data validation are involved. How do you create a data set that is reliable, repeatable and holds tens of millions records a day and more importantly, how do you validate the data