This function implements a method of using genetic algorithms to optimise the form of a polynomial, i.e. reducing the number of terms required in comparison to a least-squares fit using all possible terms, as described in the following paper:
Clegg, J. et al, "The use of a genetic algorithm to optimize the functional form of a multi-dimensional polynomial fit to experimental data", 2005 IEEE Congress on Evolutionary Computation, 928-934, Edinburgh, September, 2005
A general polynomial of n variables can be represented as follows:
a1 * x1^2 * x2^4 * x3^1 * xn^p + a2 * x_1^6 * x2^3 * x3^1 * xn^p + ...
A linear least-square fit can find the values of the coefficients, represented as 'a' above, such that the error between the function and a set of data is minimized. If the functional form of the data is not known in advance this can require that all possible combinations of terms and powers up to a given size are used. As the number of terms in a polynomial is given by (n+(m-1))! / (n!)(m-1)! where n is the maximum power used and m is the number of variables, the number of all possible terms can become very large for multivariable models. For example, a model with 12 variables of up to order 8 would require 75582 terms. Furthermore this can require the inversion of an extremely large matrix in order to evaluate the regression model.
This function generates a population of polynomial forms which are each a subset of the set of all possible terms and evaluates them for their fit to the model data as given by the R-squared value returned by the function polyfitn. The population is then evolved using a genetic algorithm based on their relative scores. Evolution is achieved by mating successful individuals and the introduction of random mutations.
The function makes use of the free GA toolbox from the University of Sheffield Evolutionary Computing Team in the uk available here:
but the crucial subfunctions objpolyfit, crtpolyp, recpoly and mutpoly could no doubt easily be used with the MATLAB Genetic Algorithm and Direct search toolbox.
This function can use the multicore package provided on the file exchange but it is not required, simply always use the options.MCORE = false option (the default in any case).
Required/Reccomended functions can be currently found here:
GA toolbox (required):
***** CAVEAT *****
gapolyfitn will happily fit very high order polynomials to your data. Be warned that such high order polynomials can give a good score, but suffer from high degrees of 'ringing' between the data points. Therefore users should take care when fitting their data using this tool.
Updated description links
Added warning to description.
Corrected links to resubmitted polyfitn by John D'Errico
Added missing polyfitn and polyfitn (previously available on the file exchange) in private directory. Also fixed bug when no options provided, and improved speed of HasPoly.
Fixed error and added GRAPHS option, code should now be stable
Removed superfluous gapolyfitn.zip from uploaded directory
Fixed major error which prevented more than 15 terms being used in polynomial due to some testing code not being removed, oops. Also added some more info printing to command prompt with VERBOSE option.
Substantial speed improvement of code
Updated description and acknowledgments properly this time!