What is the difference between target data, testing data and training data?

Question

Aeri on 30 Jan 2016

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/265842-what-is-the-difference-between-target-data-testing-data-and-training-data

Commented: neamah al-naffakh on 6 Feb 2017

Accepted Answer: Walter Roberson

What's the difference of each data?
Which data is the sample data?
Should the value of target data always compose of 0 and 1? Why? If not, how can I use a target data with different values?
What would be the input and output data?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Walter Roberson on 30 Jan 2016

2
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/265842-what-is-the-difference-between-target-data-testing-data-and-training-data#answer_207918

target information is the information about which class a given sample is known to belong to

training samples are directly used to calculate the data values.

testing samples are different samples than the training samples. The algorithm uses the parameters calculated with the training samples to predict the results on the testing samples and then compares the prediction to the target information to see how well the prediction went. It will make use of this information to decide how to re-calculate based upon the training samples.

For example the algorithm might hypothesize that features #5 and #11 are enough to predict the results well, and it would calculate parameters based on that, and would use the test samples to see how well it went. Then the algorithm might hypothesize that features #5 and #12 are enough to predict the results well, and it would calculate parameters based on that second hypothesis, and would use the test samples to see how well the second hypothesis went. Whichever of the two hypothesis had worse performance would be dropped, and more hypotheses would be tested, until eventually it would come up with the best hypothesis out of all of the ones it tried.

How to specify the target value depends upon what the network is to be used for. If it is to be a binary classification algorithm then it should be just 0 and 1. For some types of neural networks, each sample should have a vector of bit values with exactly 1 bit set, with the bit that is set indicating which class the data is. For example, if you had 3 different classes, then [0 0 1] would correspond to class #3. This vector of bit values would be the rows of a 2D array of target information. For other kinds of neural networks, you would just give an integer that is a class number. For other kinds of neural networks, you would give a scalar or perhaps vector of values per sample that did not have to be integer at all.

The input data would be the array of features, multiple features per sample.

The output depends upon how the neural network is to be used. Sometimes the output is a single value per sample that is the predicted class number; sometimes the output is a vector of parameter values for each sample.

2 Comments
Show NoneHide None

Aeri on 30 Jan 2016

Thank you very much! I appreciate it.

neamah al-naffakh on 6 Feb 2017

Dear Walter,

could you help me on this post please?

Kind Regards

Sign in to comment.

What is the difference between target data, testing data and training data?

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

2 Comments
Show NoneHide None

More Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

What is the difference between target data, testing data and training data?

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

2 Comments Show NoneHide None

More Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

2 Comments
Show NoneHide None