Everything is logarithms

upvote

Everything is logarithms

(alexkritchevsky.com)

290 points

by E-Reverance21 hours ago |

upvote

by xelxebar14 hours ago|

[-]

The baseless log here is just a torsor [0]!

Lots of things are torsors: position, currency values, calendar dates etc. the vales themselves are arbitrary, and translating/scaling them by some value doesn't make a functional difference. Torsors let us talk about these things without needing to make such an arbitrary choice a priori.

In the case of baseless logs, the underlying set is "information units", i.e. log 2 is bits, log e is nats, log 10 is digits, etc. The conversion factors give us the torsor's group, and picking a privileged unit is just a trivialization of the torsor.

The vector division notation is, similarly, encoding a g-torsor in precisely the same way as length units are.

The examples so far are all torsors with abelian groups, but specifying position both requires choosing an origin and a length unit. The group of this torsor is a suitable semidirect product between translation and scaling, which gives a non-abelian group.

Most of the time we just implicitly choose a trivialization, which often causes confusion because it identifies objects with operations on them, e.g. conflating vectors as positions with vectors as translations. The author's treatise on problems with geometric algebra [1] even brings up this point!

[0]:https://math.ucr.edu/home/baez/torsors.html

[1]:https://alexkritchevsky.com/2024/02/28/geometric-algebra.htm...

reply

upvote

by adrian_b12 hours ago|

[-]

Using the term "torsor" for that mathematical concept has been a very bad choice, both because the concept does not have any obvious relationship with the meaning of the word and because the word "torsor" had already been used for a very long time in classical mechanics for a very different concept, i.e. for the quantity that must be null for a rigid body to stay in equilibrium (i.e. the pair of a resultant force and a resultant torque).

Unfortunately, in mathematics there already is a long tradition of reusing common words to designate concepts that have no relationship whatsoever with the original meanings of those words. This obfuscates the content of many mathematical books or research papers, because even when they state trivial facts the statements are opaque for those unfamiliar with the specific jargon used in that niche branch of mathematics.

reply

upvote

by xelxebar9 hours ago|

[-]

Words happen more than they are chosen, cf. "computer". The term "torsor" in this sense likely comes from the French "torseur" [0], which was used to describe rigid-body motions via a fundamental screw-like action.

The hypothesis seems to be that the idea of affine spaces came out of that theory, for whatever reason, which was subsequently generalized to principle bundles and finally into what we have now. The point is that, at every step along the way, we want to connect the incrementally new ideas to existing ones, and creating a hard break with new, idiosyncratic terminology is itself obfuscatory.

My beef is more with use of the heavily-overloaded words "regular" and "normal" in math, which just seems like lazy naming:

> In the normal extension K/Q, every normal subgroup of the regular representation acts on a normal scheme that is regular in codimension one, whose normal bundle — orthonormal to the regular surface at each regular value — carries a normal operator whose spectrum follows a normal distribution over a space that is at once regular and normal, all indexed by a regular cardinal.

That's like 8 different meanings of normal and 6 different meanings of regular. lol

[0]:https://fr.wikipedia.org/wiki/Torseur

reply

upvote

by sorokod7 hours ago|

[-]

"computer" happened a while ago, it's usage predates the electronic computers as:

"a person who makes calculations, especially with a calculating machine."

Google ngram view:

https://books.google.com/ngrams/graph?content=computer&year_...

reply

upvote

by vi_sextus_vi11 hours ago|

[-]

Yeah, see this thread --- I assume these guys haven't heard of the other meaning neither

https://golem.ph.utexas.edu/category/2013/06/torsors_and_enr...

Consider in particular that use of ‘distance’

>I think you can look at adjoint profunctors from the unit category and show that they consist of giving a consistent ‘distance’ to every object, which in a torsor will be represented.

reply

upvote

by ajkjk13 hours ago|

[-]

I do know about torsors actually but I didn't think to link it from there. I guess I don't find the term very useful; it feels like things are still hard to think about even after you know it's a torsor!---but also, I think I need to get more familiar with the concept, because the other commenter on here who described my basis-logarithm as a "GL(V)-torsor" really said it much more succinctly than what I was hacking out manually.

Regardless of the terminology, I thought it was interesting because I have never seen the logarithm thought about in that way.

reply

upvote

by xelxebar10 hours ago|

[-]

Thanks for the article. I do think your more elementary approach is good pedagogy since the subject is so broadly familiar already. I just like torsors, since they elegantly encode the "arbitrary choice" needed to deal with lots of objects.

Thanks for the writeup!

reply

upvote

by ajkjk46 minutes ago|

[-]

glad you liked it

I wonder if we should really just call them... vectors? Like the thing that torsors do, being defined only relative to a choice of origin in some space / group, is exactly what displacement vectors do. So really they are just generalizations of the concept of a vector. (In this scheme I would be careful to _not_ refer to points as vectors, so as to reserve the term for things that act like, well, torsors. I happen to think that much pedagogical harm has been done by not distinguishing the two concepts, points and displacements, early on.)

reply

upvote

by whattheheckheck13 hours ago|

[-]

Thanks for sharing, very interesting. I wonder how this maps to swe

reply

upvote

by helterskelter19 hours ago|

[-]

Logs are awesome. I started a math textbook from the 1920's a while ago, and all the calculations relied on tabulated logs, where you would convert the number to a log in a table to reduce the operation's degree, then convert back to the ordinary representation. This would reduce operations like finding cubed roots to division, would could be converted to log-log to be further reduced to subtraction before you would restore to ordinary notation. It feels like you're using a magic wormhole or something when you're doing this stuff by hand, it's really neat.

reply

upvote

by badlibrarian19 hours ago|

[-]

The physical version of that magic wormhole is called a slide rule.

reply

upvote

by eru15 hours ago|

[-]

Another neat application, if a bit simplistic, are these mechanical paper computer that let you figure out your body-mass-index. They are basically two disks with logarithmic scales on them that you rotate relative to each other. Like a slide-rule, but circular. I think you can find them under the name 'BMI wheel'.

reply

upvote

by madcaptenor5 hours ago|

[-]

These exist for various medical usages. I've seen one to compute the due date of a pregnancy (from either the date of conception or the last missed period). This was at an obstetrician's office. It was probably dropped off by a sales rep and had the logo of some medication or other.

reply

upvote

by utopiah11 hours ago|

[-]

Indeed, you can get yours today https://www.instructables.com/Make-a-simple-paper-slide-rule...

reply

upvote

by all215 hours ago|

[-]

Got a PDF? I love old books like this.

reply

upvote

by helterskelter15 hours ago|

[-]

Here you go:

https://www.google.com/books/edition/Trigonometry_for_Naviga...

See my other comment:

https://news.ycombinator.com/item?id=48623646

reply

upvote

by eager_learner19 hours ago|

[-]

care to share the name of the said book?

reply

upvote

by helterskelter18 hours ago|

[-]

Trigonometry for Navigating Officers by WP Winter

https://www.google.com/books/edition/Trigonometry_for_Naviga...

I found this book because I was a little rusty on my trig and most celestial navigation texts will just throw the PZX equation (and others) at you without breaking down what's actually being done with it on a mathematical level...it's just kind of treated like a magical black box without any discussion, and I'd rather have a complete understanding of what I'm doing and why. Having an application-specific approach also makes it a lot easier to learn.

I'm using it with Norie's Nautical Tables, which has the log tables and a whole lot else:

https://bluewaterweb.com/product/nories-nautical-tables-2025...

I'm sure there are plenty of free PDF's of log tables you can find though.

(I believe they used log tables on boats primarily because it's easier to use than a slide rule when everything is constantly rocking back and forth.)

reply

upvote

by dieselgate2 hours ago|

[-]

Any other recommendations for getting into celestial navigation? I've used a sextant a few times and would like to purchase one but am aware that's only the hardware-side of things. Do the books you mentioned above provide sufficient tabulation for navigation? I sail in the Puget Sound for reference, thank you!

reply

upvote

by porridgeraisin7 hours ago|

[-]

Yep, we used manual math + some log tables for calculations in our school exams as late as last decade. Since calculators were not allowed. The exam would be such that you would need the log tables once or twice over the course of the exam. Example: dividing = lookup(a)-lookup(b) and then lookup that in the inverse log (i.e exp) tables.

reply

upvote

by orc004 hours ago|

[-]

Charles Petzold's The Lost Art of Logarithms is a great read (still a work in progress).

https://www.lostartoflogarithms.com/

reply

upvote

by rramadass2 hours ago|

[-]

This looks great; thanks for the pointer.

Charles Petzold's writings are always very clear and in-depth.

reply

upvote

by GL261 hours ago|

[-]

The same idea comes up in physics. In quantum physics, the action S appears as the logarithm-like quantity behind the amplitude e^iS/(h^bar). In statistical mechanics, entropy is the logarithm of the number of possible microstates Omega : S = log(Omega). Although the concepts come from different parts of physics, they both reflect the same principle: using a log as a way to turn multiplicative relationships into additive ones.

reply

upvote

by amavect2 hours ago|

[-]

>You might ask: if we have a baseless logarithm log(N), do we also have a “baseless exponential”?

Sure we can, with some naive algebra. If we can take log(x,base) and drop the base, then we can also take pow(base,x) and drop the base. Since bits=log(2), then pow(bits)=2. You can probably connect it to the reverse of things, like integrals.

Also, for fun, I'll play with some notation tricks.

  log(freq) = pitch
  freq = pow(pitch)
  octave = log(2)

  400*Hz = 100*Hz*4  // the frequency 400 Hz equals 4 times 100 Hz
  log(400*Hz) = log(100*Hz) + log(4)
  log(400*Hz) = log(100*Hz) + 2*log(2)
  log(400*Hz) = log(100*Hz) + 2*octave
  log(400*Hz) = log(100*Hz) + 2*octave  // the pitch of 400 Hz equals 2 octaves above the pitch of 100 Hz

  cent = log(2)/1200
  A4 = log(440*Hz)
  B4 = A4 + 200*cent  // the pitch B4 equals 200 cents above A4
  B4 = log(440*Hz) + 200*log(2)/1200
  B4 = log(440*Hz) + log(2^(2/12))
  B4 = log(440*Hz * 2^(2/12))
  pow(B4) = 493.883 Hz  // the frequency of B4 equals 493.883 Hz

I like the intuition that baseless logarithm notation gives, and it also avoids needing to choose a specific reference point. I can also directly calculate by choosing an arbitrary base:

  pow(log(440*Hz) + 200*log(2)/1200)
  exp(ln(440) + 200*ln(2)/1200)

reply

upvote

by amavect4 minutes ago|

[-]

Hah, I can use this to give decibels an actual unit.

  dB_P = log(10)/10
  dB_F = log(10)/20
  log(10*V) = log(V) + 20*dB_F  // the level of 10 V equals 20 dB more than the power level of 1 V.

  SPL = 20*10^-6 * Pa
  hearing_damage = log(SPL) + 90*dB_F  // hearing damage occurs over 90 dB_F above SPL (neglecting A-weighting)
  pow(hearing_damage) = pow(log(SPL) + 90*dB_F))
  pow(hearing_damage) = pow(log(SPL) + 90*log(10)/20))
  pow(hearing_damage) = SPL*pow(90*log(10)/20))
  pow(hearing_damage) = SPL*31622.7766  // the pressure of hearing damage occurs above 31622 times SPL
  pow(hearing_damage) = 0.632455532 Pa  // the pressure of hearing damage occurs above 0.632 Pa

Very helpful!! Imagine combining the goofy list of decibel suffixes into a uniform notation. Write the logarithm first so the + or - stays in the same spot.

  log(reference_unit) + value*dB_F (or dB_P)
  log(reference_unit) - value*dB_F (or dB_P)

https://en.wikipedia.org/wiki/Decibel#List_of_suffixes

reply

upvote

by ajkjk45 minutes ago|

[-]

True, I guess you can just 'curry' exponentiation and say that's a baseless power. I couldn't find a clean notation for it :p

reply

upvote

by badlibrarian19 hours ago|

[-]

This essay needs a type system. Every time it says “log” it should say: log of what, into what?

It’s like audio where people say "dB" as if it answers the next question. Relative to what, measured how, and weighted for whom?

Author should brush up on https://en.wikipedia.org/wiki/Lie_theory

reply

upvote

by rq117 hours ago|

[-]

The important properties of the logarithm are structural: we usually do not care about units or bases, except when carrying out an actual numerical computation.

As developed in the article, informally, but somewhat sufficiently, the change of base formula shows that the choice of base is largely irrelevant: different bases give equivalent logarithms up to a constant factor.

The Taylor expansion of exp gives a more intrinsic and general definition of the exponential function. This allows exp to be generalised structurally to many algebraic settings, provided the relevant convergence conditions are met: for example, the complex exponential and its many possible logs, the matrix exponential, and so on…

reply

upvote

by eru15 hours ago|

[-]

> The important properties of the logarithm are structural: we usually do not care about units or bases, except when carrying out an actual numerical computation.

Units are important as a sort-of type system, even at the conceptual level.

You are right that bases are not as important conceptually.

reply

upvote

by jfengel16 hours ago|

[-]

I still don't understand why audio dB are negative. That's relative to what? What happens at 0dB?

reply

upvote

by eru15 hours ago|

[-]

Well, the brightness of celestial objects is also sometimes negative:

> The apparent magnitude of known objects can range from −26.832 for our Sun to about +31.5 for objects in deep space imaged by the Hubble Space Telescope.[3]

See https://en.wikipedia.org/wiki/Apparent_magnitude

reply

upvote

by Sharlin7 hours ago|

[-]

And this is because Ptolemy’s catalog in which he ranked stars by their apparent brightness on a scale of one to six, one being the brightest. Ptolemy’s scale was (much later) retrofitted to a log scale (base 100^(1/5) or about 2.512), allowing extrapolation to both brighter and dimmer objects. The brightest of Ptolemy’s first-magnitude stars actually have negative magnitudes by the modern definition.

reply

upvote

by deepspace15 hours ago|

[-]

0db is usually defined as the loudest sound that the audio system can produce. Hence, everything else must be negative.

reply

upvote

by rdbl2714 hours ago|

[-]

More specifically, 0 dB is the loudest sound the audio system is rated to produce without distortion. It's common to be able to actually drive systems harder than their specified engineering limits, which is why meters have a short positive dB section marked in red.

reply

upvote

by mitthrowaway215 hours ago|

[-]

Of course, typical of the wonderful ambiguity of decibels, 0 dB is also usually defined as the quietest sound that the human ear can perceive.

https://en.wikipedia.org/wiki/Absolute_threshold_of_hearing

reply

upvote

by ianburrell14 hours ago|

[-]

That's why important to give the scale. dBfs is full scale level, and db SPL is sound pressure level.

reply

upvote

by rramadass2 hours ago|

[-]

Yep.

"Sound Power Level SWL", "Sound Pressure level SPL", and "Sound Intensity Level SIL" are different quantities which should not be confused. - https://sengpielaudio.com/calculator-soundpower.htm

A sound source produces sound power and this generates a sound pressure fluctuation in the air. Sound power is the distance independent cause of this, whereas sound pressure is the distance-dependent effect.

Sound pressure p is a "sound field quantity" and sound intensity I is a "sound energy quantity". In teachings these terms are not often separated sharply enough and sometimes are even set equal. But I ~ p2.

reply

upvote

by rramadass1 hours ago|

[-]

Articles:

Understanding dB - http://www.jimprice.com/prosound/db.htm

dBFS - https://en.wikipedia.org/wiki/DBFS

Videos:

Understanding dB level by Paul McGowan - https://www.youtube.com/watch?v=t3Via4c8SlI

Paul explains 0dB and why there's a minus sign on volume - https://www.youtube.com/watch?v=NgEr6NEDPd4

See also https://news.ycombinator.com/item?id=48632331

reply

upvote

by kevin_thibedeau15 hours ago|

[-]

That is dB full scale where 0 is an absolute ceiling and you can deduct from there.

reply

upvote

by jmyeet19 hours ago|

[-]

The first section details how the author thinks of "log N" with no base as an abstract object rather than a number. Or what are you referring to?

reply

upvote

by badlibrarian19 hours ago|

[-]

The first section is the good part.

The later reuse of “log” across valuations, dimension, vector fields, orders of vanishing is not so good. Those may be related ideas, but each needs a type signature: from what, to what, and preserving which operation?

reply

upvote

by exmadscientist17 hours ago|

[-]

Or, to say a little more explicitly what you're getting at: when you take a logarithm of some quantity, log x, x absolutely must be unitless. There's no way whatsoever to take a logarithm of something with a unit attached. (This is an important and useful dimensional analysis check in formulas and long calculations!)

So what do you do in practice? You have to normalize: you don't calculate log x, but instead log x/U for some scaling unit U. It's typical for U to be something like 1 mV or 1 W in electrical engineering, for example. This is completely legitimate, but it does mean that the thing that comes out needs a corresponding unit attached to it: dBmV, dBW, et cetera.

And it's really kind of important to be careful about that.

reply

upvote

by aesthesia16 hours ago|

[-]

I think what's going on with the complex logarithm is basically the same as the logarithm that outputs the set of all possible bases for a vector space. The complex logarithm produces a Z-torsor, and the basis logarithm produces a GL(V)-torsor. There's probably some way to represent a choice of branch cut as a part of the choice of the base of the complex logarithm, and similarly the choice of a specific basis as part of the choice of base of the vector space base logarithm.

reply

upvote

by ajkjk13 hours ago|

[-]

Interesting, it did not occur to me of those as two instances of the same phenomenon. Although I still find the complex analytic one hard to think about.

reply

upvote

by adrian_b12 hours ago|

[-]

The term "baseless logarithm" is really nonsensical and using it would be a great mistake.

Nonetheless, where the author of TFA is correct is that logarithms are a single physical quantity, like length, area or volume, and that choosing the so called "base" is choosing the unit of measurement for logarithms.

Logarithms are included in the dimensional formulae of many derived physical quantities, e.g. for describing the attenuation or amplification of waves during their propagation, where one uses quantities like logarithm per length and logarithm per time.

Changing the "base" of logarithms modifies the numeric values of all derived physical quantities exactly in the same manner as changing any other fundamental unit of measurement, like the unit of length or the unit of time.

Like for any physical quantity, the complete value of a logarithm is independent of the unit of measurement, because it is the product between the numeric value and the unit of measurement. When the unit of measurement is changed, both the numeric value and the unit are changed and the product stays the same (i.e. the logarithm corresponds to the same ratio, regardless what base is used to compute a numeric value for the logarithm).

Nowadays, the unit of logarithms is normally chosen between the octave (binary logarithms), neper (hyperbolic logarithms) or bel (decimal logarithms).

The units of measurement for logarithms are not the bases, but the logarithms of the bases, which is why e.g. the value of the number "e", the base of the hyperbolic logarithms, is never needed in any computation. The only values that are needed are "ln 2" or its inverse "log2 e", which are used to convert the numeric values of logarithms when the unit of measurement is changed between those corresponding to binary logarithms and to hyperbolic logarithms (a.k.a. natural logarithms, but there is nothing more "natural" about hyperbolic logarithms than about any other kind of logarithms).

reply

upvote

by jmyeet4 hours ago|

[-]

"Baseless logarithm" is not nonsencial. Given that:

    d(logₐx)/dx = 1/(x log(a))

a baseless logarithm is simply a family of functions with similar properties. Perhaps it might be clearer if the author said something like the "logarithm property" rather than "baseless logarithm" but that's nit-picking and debatable.

As for changing the base changes the numbers, I have to wonder if you've done any advanced linear algebra or, more specifically, tensors. The whole point of a tensor is that it operates the same on an object regardless of the basis. Put another way, if a and b are two representations of the same object with different bases then T(a) and T(b) are equivalent if T(x) is a tensor.

My point is that any numbers are an arbitrary choice and they don't define the underlying structure. The author here is talking about logarithmic structure.

This btw is why you learn about different bases in linear algebra and converting between them. Or even polar coordinates vs cartesian coordinates (in high school, for some reason). They're priming you to learn about structure. You get to groups and learn that group A and B are isomorphic they have the same mathetmatical structure.

Even when the numbers change.

reply

upvote

by adrian_b3 hours ago|

[-]

You use the word "logarithm" with the meaning "logarithmic function", i.e. a function whose argument is a ratio and whose result is a numeric value that gives the corresponding logarithm in a certain base.

I use the word "logarithm" in its original sense, meaning "logarithmic quantity". Logarithms are a certain kind of quantity, which measures numeric ratios, like other quantities measure various things, e.g. plane angles, lengths, time or cardinal numbers, where the latter measure how many elements are in a set.

Even for cardinal numbers, where there is an obvious "natural" unit, the number "1", it is frequent in practice (e.g. when computing statistical quantities) to choose other units of measurement, like a thousand, a million, a billion, the Avogadro number, the Curie number, etc.

Both for a logarithm or for a cardinal number, like for a distance or an angle, the complete value is independent of the chosen unit of measurement, even if the numeric value changes.

As you say, while for a scalar quantity the complete value is independent of the unit of measurement, for a vector quantity or tensor quantity the complete value is also independent of the chosen reference system of coordinates, even if the numeric values of the components of a vector or tensor change when the reference system is changed.

However, all these have nothing to do with whether the term "baseless logarithm" makes sense.

You say that this should be used as a term with the meaning "logarithmic function" (because the family of functions defined by you is the same as the family of functions traditionally named "logarithmic functions", since Leonhard Euler).

I say that this claim is baseless itself, because the term "logarithmic function" has been in use for almost three centuries and there is absolutely no need to invent another term, which also does not make sense etymologically, because when computing any logarithmic function, i.e. any member of the function family that has the property mentioned by you, you need a concrete base value, i.e. no such function is baseless.

reply

upvote

by anArbitraryOne15 hours ago|

[-]

I can't believe he called normal logarithms 'based'

reply

upvote

by kfse16 hours ago|

[-]

All this would be way more interesting if it actually helped to demonstrate a novel mathematical fact. Right now it's more like notational play.

reply

upvote

by ajkjk44 minutes ago|

[-]

I happen to think that novel facts and theorems and proofs are way overrated. If you find a new fact it just goes into the giant pile of facts that are sitting around uselessly. The useful progress in math is comes from "refactoring" efforts to make things simpler and more intuitive.

I don't mean that this is necessarily the case, but that it is where we are now: we have found ourself in a situation where we have way too many facts and not enough simple perspectives that make them useful and accessible.

Just my opinion, though.

reply

upvote

by sixo15 hours ago|

[-]

I read this kind of essay as a certain part of the arc by which new thoughts are formed: an act of large-scale pattern matching, laying out a bunch of cases which resemble each other, searching for the essential basis of the resemblance.

To post such a pattern allows the thought process to become distributed. Perhaps someone else will see the insight.

reply

upvote

by myzek11 hours ago|

[-]

Wasn't there some scientific paper recently that proved that every operation can be represented as a logarithm? Like, the same as every logic gate can be derived from NAND gates

reply

upvote

by ebolyen5 hours ago|

[-]

Was it this exp-minus-log arxiv paper?: https://arxiv.org/html/2603.21852v2

reply

upvote

by amelius19 hours ago|

[-]

Does this answer the question of why we see hyperoperations until exponentiation in physics, but not higher?

reply

upvote

by AnotherGoodName19 hours ago|

[-]

I think that's more about integrations/differentials not producing them (generally speaking). Physics likes to deal with integrals and differentiation as you calculate change over time or over spatial dimensions.

Eg. the integral of x^10 is x^11 / 11 + c. No hyper-operation appears and it's just another exponential (with a division).

The integral of log(x) is xlog(x) - x + c. So still basically just a logarithm

Even the integral of 2^x is just 2^x / log(2). Still basically the same thing.

There's no easy way to pull a hyper-operation out.

reply

upvote

by renyicircle5 hours ago|

[-]

I'd say integrals or differentials are not as important on their own as the kinds of differential equations that come up in physics. Integrals and differentials don't produce hyperoperations from non-hyperoperations, but a solution to something as simple as y' - e^x y = 0 will have a double exponential.

However a lot of DEs in physics are linear second-order with coefficients that are most often constants or polynomials, and if they're not polynomial they are made to be so using series expansions, under reasonable assumptions. This already brings you a long way towards solving the problem. The resulting equations usually have trigonometric/exponential/special function solutions.

It's still possible that hyper-operations like a double exponential might come up in the study of some specific non-linear problems. As in the example above, if you have an exponential function as a coefficient in your differential equation you might get a double exponential in the solution somewhere. I'm not familiar with any specific physics examples though.

reply

upvote

by jongjong19 hours ago|

[-]

That's a lot of ways to think about logarithms.

Logarithms are laughably simple once you've fully internalized the meaning of the log function; it simply answers the question:

"To what power must I raise the base to get the argument?"

This is why the output tapers out as you increase the argument; because even if you increase the argument exponentially, you only need a fixed increment in the power to reach that number... So if you increase the argument only by a fixed amount (linearly) instead of exponentially, then it makes sense that the output will grow sub-linearly.

I remember when I was doing algebra with logs many years ago at school, I was applying rules to remove the log from one side of the equation.

Then when I got to uni, I had to revise the rules but it was kind of silly of me because those rules can be trivially derived if you just think about what the log function means. Turns out I had been solving equations with logs throughout school without understanding what they even meant... It's only at university that I actually bothered to learn them.

Actually TBH. I didn't even fully understand powers for some time even though I was doing calculus with them at school. I only fully understood powers once I properly internalized the concept of k-ary trees as a proxy.

It's one thing to be able to apply something, another to understand it. And I think to innovate with something, as a tool, it's not enough to be able to apply it. You must understand it.

reply

upvote

by rramadass12 hours ago|

[-]

A better way to understand logarithms is to start with the original motivation from Napier himself (https://sites.pitt.edu/~super1/lecture/lec44911/005.htm);

Seeing there is nothing (right well-beloved Students of the Mathematics) that is so troublesome to mathematical practice, nor that doth more molest and hinder calculators, than the multiplications, divisions, square and cubical extractions of great numbers, which besides the tedious expense of time are for the most part subject to many slippery errors, I began therefore to consider in my mind by what certain and ready art I might remove those hindrances. And having thought upon many things to this purpose, I found at length some excellent brief rules to be treated of (perhaps) hereafter. But amongst all, none more profitable than this which together with the hard and tedious multiplications, divisions, and extractions of roots, doth also cast away from the work itself even the very numbers themselves that are to be multiplied, divided and resolved into roots, and putteth other numbers in their place which perform as much as they can do, only by addition and subtraction, division by two or division by three.

This is what provides the intuition viz; convert multiplication/division/etc. of large numbers into addition/subtraction of two other smaller numbers. Logarithms as inverse of Exponentiation came much later. Starting with this generally confuses the student since they do not understand the point of it all.

From https://en.wikipedia.org/wiki/History_of_logarithms;

Napier conceived the logarithm as the relationship between two particles moving along a line, one at constant speed and the other at a speed proportional to its distance from a fixed endpoint.

Since the speed is directly proportional to its remaining distance from the fixed endpoint, it therefore is a deceleration, which results in the characteristic "flattening" of the curve.

Further details for understanding the above can be found at Priority, Parallel Discovery, and Pre-eminence: Napier, Burgi and the Early History of the Logarithm Relation (pdf) - http://www.numdam.org/item/RHM_2012__18_2_223_0.pdf

reply

upvote

by jongjong10 hours ago|

[-]

I find my explanation simpler.

// The power to which I must raise 10 to get 100 is 2.

log10(100) = 2

// The power to which I must raise 10 to get 1000 is 3.

log10(1000) = 3

// The power to which I must raise 3 to get 27 is 3.

log3(27) = 3

Also it makes solving equations much more intuitive:

log3(x) = 4

^ This means; the power to which I must raise 3 to get x is 4. So it follows logically that if I raise 3 to the power of 4, I will get x. This makes it intuitive that this equation can be rewritten as:

x = 3 ^ 4

You don't even need to know the algebraic rule. I felt retarded when I figured this out. This was a rule I had memorized before. It's even dumber and easier to infer than the rule to compute derivatives. I wonder why teachers even bother to teach you all these rules when they could just explain the fundamentals to you.

reply

upvote

by rramadass7 hours ago|

[-]

That is just the definition of Logarithm which is what is taught to all students today i.e.

Given a^x = b we define log_a(b) = x where 'a' is a +ve real number - https://en.wikipedia.org/wiki/Logarithm#Definition

The above wikipedia page also details the properties, applications and generalization of the logarithm concept which are non-trivial.

As i pointed out above, that does not help in intuiting why it is helpful and needed. That is why you need to read the history of logarithms and see how we arrived at the above standard.

Napier actually calculated logarithms of sines for every minute from 0-90degrees to simplify astronomical calculations. The complexity/sizes involved, precision needed etc. can all be seen in this detailed paper walking you through the entire process of table construction; Napier’s ideal construction of the logarithms (pdf) - https://locomat.loria.fr/napier/napier1619construction.pdf

reply

upvote

by whattheheckheck13 hours ago|

[-]

What made you want to understand it or did it happen upon you in college

reply

upvote

by jongjong12 hours ago|

[-]

It happened during college.

I had a weird relationship with Math growing up; I alternated between getting very high grades and terrible grades depending on the teacher. I didn't like all the notations and conventions of Math and the way it was taught, but I enjoyed it conceptually. It had ended badly in high school as I did poorly in advanced Math though I did quite well in all my other subjects so I got into a good Software Engineering degree at a top 50 university for engineering globally anyway.

But early in college, it occurred to me that I didn't understand Math concepts as intuitively as I understood programming concepts so I challenged myself to revisit everything from the beginning including numbers, addition, subtraction, fractions, roots, powers, probabilities, derivatives, integrals, vectors, matrices, calculus...

I had to free myself from thinking of Math as symbols on a piece of paper and think of it as being about actual quantities, transformations and combinations. I needed a completely new way to think about it and visualize every single step. When I was practicing calculus, I would stop at each step and try to visualize the equation. For example, when finding the 3D plane perpendicular to a point on a 3D curve, I would put effort into visualizing what happened to the equations across different dimensions at each step when I found the partial derivatives and combined them to get the 3D plane vectors.

My Math grades at university were quite good. I passed all the Math courses with ease and got several distinctions even.

reply

upvote

by saulpw17 hours ago|

[-]

This sentiment (not necessarily the content) is what I'm striving to communicate with Mag World[0] (website and podcast so far).

[0] magworld.pw

reply

upvote

by psychoslave11 hours ago|

[-]

IIRC, Knuth use lg for logarithm base 2.

reply

upvote

by monkamonme13 hours ago|

[-]

[flagged]

reply

upvote

by yaccb320 hours ago|

[-]

Look, the whole thing actually makes sense and the core idea is pretty cool because it's true that a lot of stuff in math looks identical. But in my opinion this is way too much of a macro-level overgeneralization and you risk throwing everything into the same pot, which ends up diluting the actual point of things.I mean, if you take a hammer and a meat mallet, at the end of the day they're both chunks of metal used to hit stuff, but if you bunch them together without making any distinction, you lose track of why you use one to drive nails into a wall and the other to prep cutlets.Saying everything is just one big logarithm is a nice mental exercise, but I feel like it flattens out the differences too much and makes you lose the practical utility of the individual math tools, which are meant to solve completely different problems.

reply

upvote

by galaxyLogic13 hours ago|

[-]

I'm a programmer so to me this brings to mind the idea of classes and subclasses. A program is implemented by having a set of classes. The classes can be organized into a class-hierarchy where they inherit methods from their ancestor-classes.

Now assume originally you did not have the feature of inheritance in your programming language so you would just create all the classes you need without orgnizing them into an inheritance-tree. Then you upgraded to a language that doe shave inheritance and you wanted to refactor your program to omit duplicate definitions of methods.

What kind of class-hierarchy would you come up with? There is no single way to do it. Some ways are better than others. There migh be more than one optimal way.

Same goes with generalization general, it is part of the language we create to describe things and there are many different languages we may come up with, some simpler, some more difficult to understand.

reply

upvote

by SadErn13 hours ago|

[-]

[dead]

reply