How to "teach" Matlab how to remove inconsistent data?

7 views (last 30 days)
Hello!
I have a set of data consisting of measured values in time, over several days. You can see on the plot below the "normal" shape, and the moments at which data is inconsistent (mostly in the left and right parts of the plot). Note that this is values from one of my measurement devices, other ones have a more consistent shape with values decreasing at the beginning and at the end. I would like to remove this inconsistent data. How can I do this? I thought of doing some machine learning, for example by training the algorithm and teaching it what is "good" or "bad" data.
Thank you
Romain

Answers (2)

Walter Roberson
Walter Roberson on 11 Aug 2015

Star Strider
Star Strider on 11 Aug 2015
One possibility is for you to sort your data by the independent variable and then plot it. That might at least make it a bit easier to understand. You could then decide what the outliers are.
  4 Comments
Romain Meyer
Romain Meyer on 14 Aug 2015
The values should increase gradually, or at least be consistent with the time and the day. For example on the picture, it is clear that the values are not valid before 7am and after 9pm for some days. I know this is very complicated to implement that kind of verification. The idea I had is to have a script that present some time series and wait for the user "OK" or "NOT OK" and then it could learn to detect bad data, based on the training.
Star Strider
Star Strider on 14 Aug 2015
I’m not quite sure what you’re measuring, but it would seem that 100% occupancy aren’t correct data. I would remove those (perhaps anything greater than 95%, and the associated time data), and then see what the data are. You might not need anything more sophisticated than thresholding your data.

Sign in to comment.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!