Abhaya Parthy, MathWorks
A key challenge with the growing volume of measured data in the energy sector is the preparation of the data for analysis. This challenge comes from data being stored in multiple locations, in multiple formats, and with multiple sampling rates. This presentation considers the collection of time-series data sets from multiple sources including Excel files, SQL databases, and data historians. Techniques for preprocessing the data sets are shown, including synchronizing the data sets to a common time reference, assessing data quality, and dealing with bad data. We then show how events can be defined in the data sets before statistical analysis is performed.
Recorded: 20 September 2012