Why is my neural network generating negative weights?

3 views (last 30 days)

Sebastián Quiroga Reyes on 25 Jul 2021

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/885304-why-is-my-neural-network-generating-negative-weights

Edited: Sebastián Quiroga Reyes on 25 Jul 2021

Hi, I'm using reinforcement learning on a control problem. My goal is trying to find optimal values for PID gains, so in my search I found this matlab link: https://la.mathworks.com/help/reinforcement-learning/ug/tune-pi-controller-using-td3.html .

In the matlab's example, it use a custom layer for the actor called: "fullyConnectedPILayer", the description says:

"Gradient descent optimization can drive the weights to negative values. To avoid negative weights, replace normal fullyConnectedLayer with a fullyConnectedPILayer. This layer ensures that the weights are positive by implementing the function Y=abs(WEIGHTS)∗X. This layer is defined in fullyConnectedPILayer.m."

So, the two weights always suppose to be positive, but after training, my actor network has a negative weight value (negative ki = -0.0057) and a positive weight value (Kp = 0.0455). Also, in the same example, it says:

"The integral and proportional gains of the PI controller are the absolute weights of the actor representation. To obtain the weights, first extract the learnable parameters from the actor."

And it uses the abs function to get the weigths, so it doesn't make any sense to use the custom layer "fullyConnectedPILayer", because the actor network can generate negative weights.

the code of the layer is as follows:

classdef fullyConnectedPILayer < nnet.layer.Layer
    properties (Learnable)
        Weights
    end
    
    methods
        function obj = fullyConnectedPILayer(Weights,Name)
            % Set layer name
            obj.Name = Name;
            % Set layer description
            obj.Description = "fullyConnectedNonNegWeightLayer";
        
            % Set layer weights
            obj.Weights = Weights;
        end
        
        function Z = predict(obj, X)
            Z = fullyconnect(X, abs(obj.Weights), 0, 'DataFormat','CB');
        end
    end
end

The code for my actor network is exactly the same as the example:

initialGain = single([1e-3 2]);
actorNetwork = [
    featureInputLayer(numObservations,'Normalization','none','Name','state')
    fullyConnectedPILayer(initialGain, 'Action')];
actorOptions = rlRepresentationOptions('LearnRate',1e-3,'GradientThreshold',1);
actor = rlDeterministicActorRepresentation(actorNetwork,obsInfo,actInfo,...
    'Observation',{'state'},'Action',{'Action'},actorOptions);

I don't know why it generates a negative weight if I'm using this custom layer.

0 Comments
Show -2 older commentsHide -2 older comments

Answers (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Why is my neural network generating negative weights?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

Why is my neural network generating negative weights?

0 Comments Show -2 older commentsHide -2 older comments

Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments