undefined

points

by jcarreiro9 hours ago |

comments

by kristjansson7 hours ago|

[-]

> approximately the same magnitude

and they really do mean that, their results show +/- 1 on log10 plots.

by cptroot1 hours ago|

parent|

[-]

I don't think this is an accurate characterization of the error magnitude? Their error plots (from appendix 3) are all showing `log_10(|Y - \dot{Y}|)` as having a median of ~-3 (difference of 0.001) and a max of ~1.5 (difference of 0.035), and this is with only 3 Taylor terms.

by fheinsen9 hours ago|

prev|

[-]

The method is more general. The github repository's first example is with eight Taylor terms (P = 8).

by torginus1 hours ago|

parent|

[-]

I'm clueless about this whole thing, but from my EE education I remember that in general:

Taylor approximations converge slowly in terms of error if the function they're representing is discontinuous (the error disappears quadratically if continuous, linearly if not), and they tend to create highly energetic swings near discontinuties (similarly to Fourier series with Gibbs oscillations).

Moreover, Taylor series are inherently nonlinear, and much of the mathematical toolset around AI assumes general linearity (cue linear algebra), with the exception of sigmoids , and going beyond cubic approximations tends to make errors worse (as expressed in SNR).

by energy1238 hours ago|

prev|

[-]

It converges on conventional attention as P goes up