undefined

points

by 2019844 hours ago|

[-]

Shared libraries (and mmapped files in general) are deduplicated; it's nowhere near as bad as you think. The kernel loads a .so into memory once and then maps that memory into every process that mmaps it.

Editing to add: this deduplication is one of the greatest upsides to dynamic linking. Common libs like libgcc and libc only have to exist in memory once and can stay in CPU caches, whereas if they were statically linked into every binary, each binary would have a copy of that library that wouldn't be shared with anything else and you'd waste a lot of memory.

by sjmulder3 hours ago|

parent|

[-]

Doesn't the loaded code have to be patched for relocations?

by ptspts3 hours ago|

parent|

[-]

It does, so not 100% is reused. The patched parts are in different sections though, so the entire .text (code) section ends up being reused.

by monocasa3 hours ago|

parent|

prev|

[-]

Not on modern archs that provide decent support for PIE (position independent executables).

by 2019842 hours ago|

parent|

[-]

How do you think position independent code can call functions from other .so's without being patched with their addresses?

They can't, so even PIC code still has to have a relocation table that gets patched. It's in a different page than the code though, so code does still get reused.

by monocasa2 hours ago|

parent|

[-]

That's not really patching though, any more than any use of function pointers is patching.

by 20198445 minutes ago|

parent|

[-]

There's a part of the .so ELF file (the Global Offset Table aka GOT) that has to be modified with all the addresses of the functions being imported, which of course vary from process to process.

If not patching, what exactly would you call modifying part of the file?

by monocasa2 minutes ago|

parent|

[-]

And the got is just a big table of pointers like any other table of pointers your application manipulates as it runs.

This isn't meant as a reductive take, but instead that there is a difference between completely describable in C like the contents of the .got section, and something like a .reloc section that actually has to understand the generated assembly in order to build the relocation table to load and link the executable. Both are linking, but I've saved "patching" for more brain surgery esque techniques. Like on mips, the jump instruction immediate is the bottom 26 bits of the absolute address of the target, so you're going through and modifying all of the jump instructions if you load it to somewhere it wasn't linked at.

by t-33 hours ago|

parent|

prev|

[-]

Not if it's position-independent.

by saidinesh54 hours ago|

prev|

[-]

Typically libgcc_so.so is loaded by the linker, which uses an mmap call to map the binary into the address space.

> The kernel keeps track of which file is mapped where, and can detect when a request is made to map an already mapped file again, avoiding physical memory allocation if possible.

Relevant stack overflow answer: https://stackoverflow.com/questions/61950951/linux-shared-li...

by mlaretallack4 hours ago|

prev|

[-]

In Linux, when a shared lib is loaded by multiple processes, its loaded once and not duplicated in ram. Only if a memory page is modified by the process will the memory be duplicated. (Hope I have explained that correctly)

by monocasa4 hours ago|

prev|

[-]

Those mappings by default all go to the same shared memory.

Unices have been sharing executable memory between processes longer than there's been mmap for user space to do the same thing themselves. I remember seeing it in the 2BSD kernel for instance.

by 4 hours ago|

prev|

[-]

deleted

by BoingBoomTschak3 hours ago|

prev|

[-]

Eh? Aren't shared libraries actually shared in memory?

by 17186274403 hours ago|

parent|

[-]

Yeah, that's kind of the point.

by sirsinsalot3 hours ago|

prev|

[-]

I have a rule for myself. If I think something is silly or stupid, I assume I don't understand it. I usually find I do not understand it, and it no longer seems silly when I do understand it.

In this case too, you think it is silly because you don't understand it. Your assumptions are wrong, making it seem silly.