sequence learning using LSTM

Question

Sharan Magavi on 24 Dec 2019

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/497962-sequence-learning-using-lstm

Commented: Sharan Magavi on 16 Jan 2020

Hello everyone, I am trying to use an LSTM to predict and forecast the position of a vehicle and I would like to know how to train the system.

I have a dataset consisting of 230 vehicle samples i.e. a cell of 1 x 230 where each sample is a matrix of 4 features and the respective sequence length(60 - 300 timesteps). The objective is to forecast future (1 - 5 timesteps) steps of a given vehicle sample.

I am refering to this example to understand the way to forecast and this to see how to train the model for prediction. But in both the examples the LSTM model is used as a many to one example.

my features are in the x,y coordinate..

I would like to know how to train a LSTM model on multiple sequences containing mutltiple features and learn the behaviour of the vehicle model!

Thanks in advance

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Asvin Kumar on 30 Dec 2019

1
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/497962-sequence-learning-using-lstm#answer_408108

Edited: Asvin Kumar on 30 Dec 2019

Have a look at the example here: https://www.mathworks.com/help/deeplearning/ref/nnet.cnn.layer.lstmlayer.html#d117e90134

Although this links to another example that uses the bilstmLayer, the underlying principles remain the same. You can use a fullyConnectedLayer with as many outputs as necessary for your use case. By setting the ‘OutputMode’ to ‘sequence’ in your lstmLayer and preparing the predictors as mentioned in the first example which you linked, you should be able to achieve your desired result.

In your case, the output size of the fullyConnectedLayer would be 4, I suppose. Your predictors would be shifted in time by 1-5 steps, whichever you're trying to forecast. It might make sense to drop the softmaxLayer and the classificationLayer from the example for your requirement.

11 Comments
Show 9 older commentsHide 9 older comments

Sharan Magavi on 13 Jan 2020

code is as follows

n = randperm(size(trackdata,2), 1); %% select any random number

data = trackdata{1,n}; %extract track information to train

numTimeStepsTrain = floor(0.9*size(data, 2));

dataTrain = data(:,1:numTimeStepsTrain+1);

dataTest = data(:,numTimeStepsTrain+1:end);

mu = mean(dataTrain, 2);

sig = std(dataTrain, 0, 2);

dataTrainStandardized = (dataTrain - mu) ./ sig;

XTrain = dataTrainStandardized(:,1:end-1);

YTrain = dataTrainStandardized(:, 2:end);

numFeatures = size(XTrain, 1);

numResponses = size(YTrain, 1);

numHiddenUnits1 = 200;

layers = [ ...

sequenceInputLayer(numFeatures)

lstmLayer(numHiddenUnits1,'OutputMode','sequence')

fullyConnectedLayer(numResponses)

regressionLayer];

options = trainingOptions('adam', ...

'MaxEpochs',250, ...

'GradientThreshold',1, ...

'InitialLearnRate',0.005, ...

'LearnRateSchedule','piecewise', ...

'LearnRateDropPeriod',125, ...

'LearnRateDropFactor',0.2, ...

'Verbose',0, ...

'SequenceLength', 'shortest', ...

'Plots','training-progress');

% training

net = trainNetwork(XTrain,YTrain,layers,options);

%

dataTestStandardized = (dataTest - mu) ./ sig;

XTest = dataTestStandardized(:,1:end-1);

net = predictAndUpdateState(net,XTrain);

[net,YPred(:,1)] = predictAndUpdateState(net,YTrain(:,end));

numTimeStepsTest = size(XTest,2);

for i = 2:numTimeStepsTest

[net,YPred(:,i)] = predictAndUpdateState(net,YPred(:,i-1),'ExecutionEnvironment','cpu');

end

YPred = sig.*YPred + mu;

YTest = dataTest(:, 2:end);

rmse = sqrt(mean((YPred-YTest).^2, 2))

Asvin Kumar on 16 Jan 2020

Have a look at: https://www.mathworks.com/help/deeplearning/ug/multiple-input-and-multiple-output-networks.html#mw_4f57107d-31b4-4432-8d4f-2065ce9008f0

Sharan Magavi on 16 Jan 2020

Hello Ashvin,

I was able to solve the problem I had with the prediction. I used an 'sgdm' solver instead of an adam solver and it made a big difference in my output.

Sign in to comment.

sequence learning using LSTM

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

11 Comments
Show 9 older commentsHide 9 older comments

More Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

sequence learning using LSTM

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

11 Comments Show 9 older commentsHide 9 older comments

More Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

11 Comments
Show 9 older commentsHide 9 older comments