A key challenge with the growing volume of measured data in the energy sector is the preparation of the data for analysis. This challenge comes from data being stored in multiple locations, in multiple formats, and with multiple sampling rates. This presentation considers the collection of time-series data sets from multiple sources including Excel files, SQL databases, and data historians. Techniques for preprocessing the data sets are shown, including synchronizing the data sets to a common time reference, assessing data quality, and dealing with bad data. We then show how subsets of the data can be extracted to simplify further analysis.
About the Presenter: Abhaya is an Application Engineer at MathWorks Australia where he applies methods from the fields of mathematical and physical modelling, optimisation, signal processing, statistics and data analysis across a range of industries. Abhaya holds a Ph.D. and a B.E. (Software Engineering) both from the University of Sydney, Australia. In his research he focused on array signal processing for audio and acoustics and he designed, developed and built a dual concentric spherical microphone array for broadband sound field recording and beam forming.
Recorded: 5 Feb 2013