In this problem, we use GP to search for an analytical expression that fits the data.

For example, given a target example:

Given samples , GP must discover .

The GP system only sees input-output samples:

We aim to minimize MSE:

With parsimony pressure, our objective becomes becomes:

Tracing the progress:

Despite its strengths, symbolic regression with GP has challenges:

  • Large computational cost
  • Potential overfitting
  • Sensitivity to function set design
  • Multiple semantically equivalent expressions