Remove repeat values in a text file

5 views (last 30 days)
Hello all, below is a section of data from a weather station, the weather station is supposed to record once a minute, however, some have duplicate minute recordings. For me to analyse it I need to remove the duplicate readings, the only way of identifying the duplicate readings is by the minute column (i.e. two recordings from minute 1 etc.) Any ideas as to a solution?
day month year hour minute temperature
25 1 2017 1 1 1.5
25 1 2017 1 1 5.9
25 1 2017 1 2 5.4
25 1 2017 1 3 5.2
25 1 2017 1 4 5.1
25 1 2017 1 5 5.5
25 1 2017 1 5 5.6

Accepted Answer

Akira Agata
Akira Agata on 7 Dec 2018
Assuming your data is stored in table variable T
>> T
T =
7x6 table
day month year hour minute temperature
___ _____ ____ ____ ______ ___________
25 1 2017 1 1 1.5
25 1 2017 1 1 5.9
25 1 2017 1 2 5.4
25 1 2017 1 3 5.2
25 1 2017 1 4 5.1
25 1 2017 1 5 5.5
25 1 2017 1 5 5.6
Then, unique rows can be simply extracted by using unique function, like:
[~,idx] = unique(T(:,{'day','month','year','hour','minute'}),'rows');
T = T(idx,:);
  1 Comment
Joseff Saunders
Joseff Saunders on 7 Dec 2018
Thank you this worked well, however the issue I am having now is that some data from particular minutes is missing (i.e. it wasn't recorded in the first place), as there are supposed to be 60 minutes for each hour, is there a way of testing to see which minutes are missing?

Sign in to comment.

More Answers (1)

ahmed nebli
ahmed nebli on 7 Dec 2018
i suggest you need to write an if condition on the minute vector exemple : if minute(n-1) == minute(n) then you delete the row n from the table.

Categories

Find more on Weather and Atmospheric Science in Help Center and File Exchange

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!