Python is 1.3x faster by just adjusting some compiling options for libpython

dang · on June 13, 2021

We changed the url from https://www.facebook.com/dan.colascione/posts/10107358290728..., which is behind a login wall and points to this.

Keep in mind that many of the complaints in this thread were posted in the context of that original URL.

jchw · on June 12, 2021

There’s nothing wrong with this post factually, but the tone sucks. It has an immensely combative energy for what is not really a charged subject matter.

Like sure. Today, a lot of the historical reasons for things seem silly and irrelevant. At one point, they did not seem silly and irrelevant. For compatibility with stuff sticking around from those days, we get some performance penalties that are not strictly necessary. I don’t think anyone is doing that to be an asshole, so the oddly antagonistic tone seems unjustified.

And yes, Windows with a module-level namespace is cleaner in this regard, but Windows design is entirely different and has plenty of its own skeletons. ELF does not, to me, feel significantly more horrible than PE. And I’m not speaking from inexperience; I did at least write a couple of ELF and PE parsing softwares over time, most recently go-winloader[1].

Do we need to override symbols in the same library? Probably not... kind of. Your modules may in fact not need this. However, libc probably does. Take a look at what symbols libpthread exports on your system some time.

I hate to be the person to point this out, but please consider not approaching subjects from this position. It feels alienating, and I have no idea why it’s necessary to have such a tone.

[1]: https://github.com/jchv/go-winloader

ineedasername · on June 12, 2021

Agreed. I was trying to find the words for what I was so put off by the article, but you nailed it. The tone made me want to disagree with it just by default. Luckily I 1) recognize that I am not qualified to have an opinion on the technical details and 2) Ruthlessly crush instinctual responses until I've thought them through with less emotion. (most of the time... I'm not robot, or perfect)

Someone in a sibling thread said it's not bad to write like that for catharsis... I guess to blow off steam or something. But if the method of blowing off steam is belittling other smart people that don't always make perfect decisions then it's probably not a great way to go. If you need to write it for catharsis, go for it, but there's no need to publish it.

Otherwise, my questions on the technical side: Would this performance hit and the alternative option have been obvious at the time? If so, was there a reasonable trade off for why this approach was taken? Or was this choice only wrong in retrospect?

galangalalgol · on June 12, 2021

I read it as a way to make a tedious topic more entertaining. It didn't seem combative at all. I do identify with your desire to disagree with people whose tone I dislike, even if I intellectually think they are right. I wish I could turn that off.

_dh54 · on June 13, 2021

That’s interesting, I have the opposite instinct. I have an urge to agree with and support people who have a more combative or opinionated tone. Strange!

phibz · on June 13, 2021

I read the tone as insecurity. Hey look at me, I figured this out. Aren't I smart? All these previous people are dumb. Right? Right!? RIGHT!?!

rjzzleep · on June 13, 2021

Wow, it must be a cultural thing. I reread it again trying to find this emotion, arrogance or condescending tone different people have read into this, and I really can't see it. It's just very direct. This is an old engineering decision, why shouldn't it be direct?

gradschoolfail · on June 13, 2021

Were you reading the facebook post or the forum post? The moderator switched the url; you and GP might be talking about different things?

rjzzleep · on June 13, 2021

Yeah I don't know anything about the Facebook post. But it's not really clear to me why people bragging on their Facebook wall is worth a discussion.

CalChris · on June 12, 2021

The OP was writing about a 29 year old design decision, and he wasn’t writing about a person. Design decisions don’t have feelings. I found his no holds barred clarity about something as obscure as dynamic linking namespaces made for an easier if still not easy read.

But that said, I don’t think dynamic linking is in the ELF spec. I believe that’s a de facto OS + dev tools thing rather than an ELF spec de jure thing. His points are still valid.

rtpg · on June 12, 2021

This is recalling the old Linus debates, but the aggressiveness _doesn't improve the clarity_, and is basically upping the word count.

I'm not tone policing but contesting the premise that "aggressive tone" = "direct". For example

>(Windows took a different approach and got it right. In Windows, it's okay for multiple DLLs to provide the same symbol, and there's no sad and desperate effort to pretend that a single namespace is still cool.)

>(Windows got this right, where multiple DLLs can provide the same symbol)

There you go. Shorter, and not wasting 3 lines to express your feelings, and _you can still say Windows got it right_.

My feeling is that you can go in and describe a thing succinctly and to the point, and actually get your opinion across! It will be more effective, shorter, and your opinions are backed up with fact! No fluff needed.

coldtea · on June 13, 2021

>This is recalling the old Linus debates, but the aggressiveness _doesn't improve the clarity_, and is basically upping the word count.

It drives the point home though, and keeps the energy levels higher.

kranner · on June 13, 2021

This may be subjective. For others (including me), it drives the irritation level higher and makes it harder to pay attention. Ranty writing fuels my own tendency to get annoyed and ranty; it makes me want to disengage to preserve my own mood. ("Holub on Patterns" is one example of a book I couldn't finish because of this quality of the writing.)

Personally for me, concise and information-dense writing makes me sit up and pay attention.

chrisseaton · on June 13, 2021

> It drives the point home though

Is the argument too weak to do that on its own without the abuse?

coldtea · on June 13, 2021

That's the case for all technical arguments.

Arguments are seldom weak or strong based on their technical details.

Heck, the technical details about this were already known to many people including several involved in build setups, but nobody cared anyway.

craftinator · on June 13, 2021

I really found my chakras to be stimulated by the wordy venting as well. That's a very good point!

Nullabillity · on June 13, 2021

No, your rewritten version misses the fact that they're trying to work around the problem, which is what takes it from "X is wrong because I say so" to "X is causing them tangible problems".

jordigh · on June 12, 2021

> I found his no holds barred clarity

Being right is no excuse for being an asshole.

The attitude will appeal to some. It will strike many others in the wrong way and put them on the defensive.

There's no reason to write this way. A concise, well-articulated, non-combative post will appeal to everyone and still convey the same information.

CalChris · on June 12, 2021

I don’t think it’s possible to be an asshole to an inanimate object.

jordigh · on June 12, 2021

Someone wrote that inanimate object. Someone likes that inanimate object. Someone thinks that inanimate object has reasons to be the way it is.

Attacking that inanimate object is not without emotional repercussions to those related to that object.

QuadrupleA · on June 13, 2021

I think it's necessary to criticize technical work (or any work) as long as the people involved care about achieving good results and making things work.

Agreed that we should not be excessively abrasive, and I think the facebook post is leaning that way - but I don't think a world without criticism can work - at a certain point in any field you have to face reality, in which some things work and some don't, and to protect every person involved from "emotional repercussions" is impossible because generally people's beliefs and feelings are all over the map.

At any rate I'm not sure the author of LD_PRELOAD or ELF dynamic symbol interposition is scanning this thread - and after ~30 years distance might have different opinions about them, or at least a thicker skin :)

josephg · on June 13, 2021

But the rant isn’t really about the inanimate object. It’s about the author, and it implies a story and paints a character. If there’s a coffee table in a bad place in my office, and I bang my shins into it every morning, there’s a dramatic persona I’m expressing as the person who’s angry at the coffee table. It’s not really about the coffee table - it’s about my relationship. And that’s a strong point of view, and it’s comedic and entertaining because of the strong commitment to that perspective that to everyone else might seem silly and bizarre. It implies a narrative of this person’s commitment to die on the hill of hating linux’s dynamic linking behaviour.

Essentially a lot of criticism of the original article seems to be of the form of “reading strong, angry opinions like this make me feel insecure so please don’t do that”. And that might be a good reason to avoid writing like that. But tone policing, and insisting on emotionally desaturated writing has a cost for the reader and the writer. I think it makes us smaller. And it keeps us in our heads rather than in our hearts. That’s just not the way I want to live my life.

coldtea · on June 13, 2021

>Someone wrote that inanimate object. Someone likes that inanimate object.

You'll find somebody liking everything you can name. Even every attrocity has been done by somebody.

Not attacking the person is of course the correct thing.

But if you can't even attack the attrocity or bad choice, we've gone too far with this sensitivity thing.

kelnos · on June 13, 2021

But in this case it's not an atrocity, and whether or not it was even a bad choice (at the time or today) is debatable. It's fine to present an opinion you hold about something, but belittling that thing also tends to implicitly belittle the intellect of people who might agree with that thing, not to mention the people who designed and built that thing.

> But if you can't even attack...

Why do we need to attack something? If we can't explain and support our view that something is wrong or a bad decision without resorting to attacks, perhaps our argument isn't really that strong?

And that's the thing. I don't think the author really presented a strong argument; he tried to convince me by verbally trashing the other side, while the actual logical, coherent argument is buried in a sea of disdain. I think it's still not clear what the default should be. Do we optimize for performance, or for debugability and tinkerability? I mean, I feel like that's one of the classic debates that we still -- and will probably never -- have no hard answer for.

Edited to add: I went back and read the linked Python bug tracker issue[0], which honestly I wish was what HN linked to. It's concise, explains the problem, explains why LD_PRELOAD isn't all that useful for libpython, specifically why this sort of performance degradation is even worse with a library like libpython, and makes sure to call out that this change only affects libpython and not any other shared libraries, where (implicitly) people might find LD_PRELOAD useful.

[0] https://bugs.python.org/issue38980

coldtea · on June 13, 2021

>Why do we need to attack something? If we can't explain and support our view that something is wrong or a bad decision without resorting to attacks, perhaps our argument isn't really that strong?

It's rather the opposite: if we don't resort to attack, comdent the practice, raise the tone, our argument will be weak.

That's because it's not enough to be right. It also need to be memorable and resontant. Else people's eye will just glaze over it.

That's why this post has 218 comments as of now, and you where involved and will remember it better tomorrow, than some purely technical explanation that probably wouldn't even have made it in the first page (or have 0-10 comments, typical of such posts).

chrisseaton · on June 13, 2021

Why can't they just present the argument without any abuse at all? Why is there ever any need for the abuse?

coldtea · on June 13, 2021

Well, the argument here is:

(a) if you attack the practice is not abuse. Abuse is when you attack the person.

(b) if you don't have a colorful tone and strongly condemn something, for most the complaint wont even register

chrisseaton · on June 13, 2021

> if you don't have a colorful tone and strongly condemn something, for most the complaint wont even register

Maybe the argument isn't actually that good if you need to resort to theatrics to make the point register?

snet0 · on June 13, 2021

If I criticise a portrait, am I not criticising the painter?

Zuider · on June 12, 2021

Even though the vitriol may be directed at an inanimate object, it may be distracting and obnoxious to the person who is reading.

nerdponx · on June 12, 2021

Unless that inanimate object was invented by a person or a group of people, in which case you are indirectly insulting those people.

Diederich · on June 12, 2021

> no holds barred clarity

There's a big difference between 'clarity' and confrontational, negative language.

kelnos · on June 13, 2021

> I found his no holds barred clarity

I found it not particularly clear at all. The entire thing can be boiled down to 2 or 3 sentences, or maybe as many as 10 if you want to include more background information.

When I was reading it, about halfway through I was thinking "god, when is this mediocre rant going to get to the meat of how to fix the problem?"

ineedasername · on June 13, 2021

Design decisions don't appear, they are made by people. Going over the top with profanity or LOL's or l33t speak might make it more entertaining to read, but anyone even a little predisposed to disagree with your assessment is a lot more likely to go on the defensive. If I said was talking to someone and telling them a design decision I encountered was shitty and bad, and it turns out that was the person who made the decision, I'm going to be a lot less likely to convince them of my assessment then if I just said "Choosing to do X instead of Y will make it 1.3x faster".

This means the writer is choosing to write for audience entertainment instead of technical advancement & improving the status quo. It's not impossible to do both at the same time, without the insulting tone.

jchw · on June 12, 2021

> The OP was writing about a 29 year old design decision, and he wasn’t writing about a person. Design decisions don’t have feelings. I found his no holds barred clarity about something as obscure as dynamic linking namespaces made for an easier if still not easy read.

I don't think that it would've been hard to maintain the exact same level of clarity regarding the subject matter. Perhaps the read was more entertaining due to the abrasive tone, but was it actually easier to read?

Ultimately, design decisions are made by people - if someone made such a takedown of ideas of mine, I would probably be somewhat discouraged, at least as long as I know they are being sincere and not just doing a bit. I don't think people should be flinching in their criticisms of ideas, but being fair to nuance and history really would be welcome too. It's one thing to point out dysfunctions in things, but it's different to pull out a sort of Angry Video Game Nerd-esque personality and drag how horrible things are through the mud.

Maybe this post is more in jest than not and I(/we?) simply did not pick up on the tone being purely for entertainment value. But that's the thing. When people read things like this, I think a lot of people take it too seriously and start to embody this attitude, and it leads to the kind of thought processes where things are either good, or stupid/evil/whatever, with no room for things that are just "not perfect, but overall fine."

Hell, I feel kind of bad due to how unnecessarily personal my comments regarding this feel. How would the author feel reading this? The fact that I may be right doesn't matter, because I'm not some kind of uncaring asshole, and I think most people are not if they are in the right mind.

> But that said, I don’t think dynamic linking is in the ELF spec. I believe that’s a de facto OS + dev tools thing rather than an ELF spec de jure thing. His points are still valid.

Like I had said initially, I do not take any issue with the factual content of the post, and agree that most people should be using these compiler flags. And yes, it is, however many years later from when this may have not been the norm, now clearly a good idea to make all of your symbols hidden by default. No disagreements from me. I just hope that people don't walk away with the idea that some morons from the past made some horribly stupid mistakes because they just had no idea what they were doing. I wasn't there, but it doesn't feel like that's what happened at all; it feels like as things panned out some things worked out well, and some things did not work out well. Some ideas are more clearly 'bad' ideas than they were. Even today, it would probably be unwise to assume we still know 100% what we're doing. Personally, I think it's hard to ever be absolutely sure you are taking the right lessons away from things that don't work out well.

fpgaminer · on June 12, 2021

Not OP, but I read the "antagonistic" style of the post as just the usual catharsis humor. All in-jest. I've used that style of writing plenty before. It's a good way to blow off the steam of working with these rather absurd, archaic systems that we have to tackle on a daily basis. Programming can feel a bit kafkaesque at times, so a bit of aggressive/dark humor goes a long way.

But I do agree, it felt too thick. Still a very interesting topic regardless.

hn_throwaway_99 · on June 13, 2021

The problem I see is that I often see this type of detached, "wow, look at all of these previous shitty decisions!" attitude inject itself, completely unnecessarily, into the workplace.

I've actually made it something I won't compromise on: I refuse to hire people who I suspect will have this attitude. It's one thing to express frustration at previous decisions that make current work more difficult. However, when I see that morph into an arrogance of "how could people have made such a stupid decision" (especially when some of those "people" may still work at the company), without even trying to understand the context of why that decision was made in the first place, it shows to me that person is not an engineer I want to work with.

QuercusMax · on June 13, 2021

On one team, we had a lot of code that had been written hastily because we were OK with taking on technical debt in order to move more quickly. We'd reconsider old design decisions and blame "old $me". It was all done in good fun, and I was happy to take the blame as senior engineer.

zitterbewegung · on June 12, 2021

It actually seems to miss a few points. (I also agree that the post has not enough levity to balance out the negative tone).

1. PEP 445 makes the use case of LD_PRELOAD irrelevant.

2. A change like this would go under obvious code review and testing to make it into a released version.

3. The risk of a regression would still exist but that can either be caught by #2 or the existing unit testing already in Python.

(Disclaimer: I have contributed to the Python codebase)

smitty1e · on June 13, 2021

There is a risk of not knowing the historical context for decisions and getting a bit critical about How Daft The Ancients Were.

Much of this is the cruft the author of TFA identifies.

But so much of it was just "nature of the beast" at the time.

Generosity toward these shortcomings is always in order.

karmakaze · on June 13, 2021

I couldn't see anything wrong with the post, it seems perfectly professional and dry. Then I realize that the link was updated from one with an arbitrary excerpt:

> (Windows took a different approach and got it right. In Windows, it's okay for multiple DLLs to provide the same symbol, and there's no sad and desperate effort to pretend that a single namespace is still cool.)

There's no need for technical topics to be emotionally charged, it only detracts from communicating the importance, correctness or benefits.

MaskRay · on June 13, 2021

Interposing libc.so symbols in a shared object is not affected by -fno-semantic-interposition or -Bsymbolic

https://maskray.me/blog/2021-05-16-elf-interposition-and-bsy...

Solaris offers a similar model called direct bindings.

Lammy · on June 12, 2021

> It has an immensely combative energy for what is not really a charged subject matter.

It becomes a charged subject matter when one works at companies like Google and Facebook and gets used to navigating performance reviews.

Twirrim · on June 12, 2021

> It becomes a charged subject matter when one works at companies like Google and Facebook and gets used to navigating performance reviews.

The way things were at Amazon when I was there, posts like this would count against a Principal Engineer promotion. One of the standards engineers are expected to meet is "Respect what has gone before". You don't know the full details of what was going on at the time, you don't know the trade-offs and why, you don't know what they did and didn't know about the situation and what couldn't have been foreseen at the time decisions were made.

Generally speaking people aren't idiots. They do the best they can with what they have, under the circumstances they're operating within, to meet the goals they have.

Almost no one sets out to make a monster impossible to maintain, or with diabolical performance.

Treat it with respect, even while you work to replace it.

jchw · on June 12, 2021

This is interesting. Not saying you are incorrect, but, I have worked at Google for a few years and didn't pick up on this, most people seem abundantly polite. But, I can just as easily chalk that up to limited experience, since there is clearly quite a lot of different things going on in any large company.

ineedasername · on June 12, 2021

Would this tone of expression be appropriate in navigating performance reviews? I mean the question honestly: My own answer is "no", but I don't know the culture of performance reviews at companies like that.

coldtea · on June 13, 2021

>There’s nothing wrong with this post factually, but the tone sucks. It has an immensely combative energy for what is not really a charged subject matter.

Perhaps because the subject matter doesn't matter.

The really important takeaway message is more about the industry/community not paying attention to shitty defaults and winging it for decades, than about the potential speedup and/or this particular mechanism.

_ncuy · on June 12, 2021

I would just read the linked post:

https://bugs.python.org/issue38980?fbclid=IwAR0cyfahpBywNzbq...

As it contains almost the same info without the rant and with better explanation.

beervirus · on June 13, 2021

And isn’t hosted on Facebook of all things.

Scaevolus · on June 12, 2021

This is true for _libpython_ (the shared library version), which is the default on some distros (RedHat, Fedora, Arch), but many others (Debian, Ubuntu) use statically linked Python and never paid this performance tax.

usr1106 · on June 13, 2021

So what do Redhat, Fedora, Arch gain by using the shared libpython? Who except python is using the library?

mjw1007 · on June 13, 2021

Software that embeds Python uses the library (even on Debian and Ubuntu: they end up building the whole Python source base twice).

For example Vim (if so configured), or Postgres PL/Python, or the old Apache mod_python.

usr1106 · on June 13, 2021

Interesting. Performancewise not having to call a separate executable is of course a good thing. Securitywise process boundaries are better.

dheera · on June 12, 2021

I think using Pypy instead of CPython will give you several times the performance boost as any of this.

coldtea · on June 13, 2021

Now you have 10 problems.

A reference to the classic line, oft attributed to JWZ:

"Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems".

The point being, switching from CPython to PyPy does not simply give you the performance boost and that's it. It comes with its own tradeoffs, including:

(a) trading speed for much more memory consumption

(b) slower C FFI - important for all kinds of Python workflows (e.g. Pandas, Numpy, and so on)

(c) behind mainline CPython releases, and with subtle incompatibilities

(d) slower startup times (due to the JITting involved)

(e) different garbage collection model (and less predictable)

(f) less support (from companies, distros, etc), fewer ports, less manpower to port quickly to new platforms (e.g. Apple's M1).

dharmab · on June 12, 2021

Pypy is not a drop-in replacement for CPython. It does not support many libraries that rely on C extensions, and targets a slightly older version of the language.

nwmcsween · on June 12, 2021

Assuming pypy does what it says it does: Python -> RPython -> C -> LLVM Clang -> LLVM IR -> 'JIT', you're still paying a large tax compared to something like RPython -> Some IR -> 'JIT'

joshuamorton · on June 13, 2021

I think you've misinterpreted something. The pypy interpreter is written in rpython, but executed code itself never gets translated to rpython.

dathinab · on June 12, 2021

Likely, but it doesn't work with all applications.

blopker · on June 12, 2021

For what it's worth, Python 3.10 will add -fno-semantic-interposition when built with --with-optimizations by default. However, the slow down only affects Pythons built with --enabled-shared anyway.

The issue is discussed here [0]. I also attempted to backport the fix to the official Python Docker image [1], but wasn't able to get much traction. I wish this would land since everyone using these images would get an instant speedup.

[0]: https://bugs.python.org/issue38980 [1]: https://github.com/docker-library/python/issues/501

throwdbaaway · on June 13, 2021

> However, the slow down only affects Pythons built with --enabled-shared anyway.

On Gentoo, the ebuild for Python and Ruby both have --enable-shared hardcoded.

Also, if you use prebuilt Ruby binaries from RVM, those were also built with --enable-shared. And if you build your own Ruby binaries with ruby-build, it also does --enable-shared.

So, there are still plenty of opportunities around for instant speedup.

dbt00 · on June 13, 2021

The people who did the work on finding and implementing this speedup talk about it here.

https://fedoraproject.org/wiki/Changes/PythonNoSemanticInter...

They then proposed it for upstreaming here: https://bugs.python.org/issue38980

And that's the new default.

The link is to a facebook post by someone who neither discovered nor implemented this change.

geofft · on June 12, 2021

Related: https://developers.redhat.com/blog/2020/06/25/red-hat-enterp...

> This article focuses on one specific performance improvement in the python38 package. As we'll explain, Python 3.8 is built with the GNU Compiler Collection (GCC)'s -fno-semantic-interposition flag. Enabling this flag disables semantic interposition, which can increase run speed by as much as 30%.

(not logged in to FB, so maybe TFA is a reference to this one?)

Dylan16807 · on June 13, 2021

Also https://pythonspeed.com/articles/faster-python/

And having just read that yesterday, I was very confused to open this post and immediately see complaints about the author's tone.

temac · on June 13, 2021

edit: my comment addresses the text from the original link https://www.facebook.com/dan.colascione/posts/10107358290728...

The author is confused. no-semantic-interposition does not preclude unicity of symbols. There are things like -Wl,-Bsymbolic-functions to relinquish that (and lose strict conformity against the programming language you are using, by the way). When I read a virulent post at least I would like the author to have some deep expertise on the subject, and not mix everything they read about (including related but different matters). I won't even talk about the "interposition is useless anyway because clang has an historical bug" part.

As for elf shared libs providing a single symbol space, there are arguments for and against.

An argument for a single symbol space is that a process with segregated one per dynamic module is a beast the C and C++ standards know nothing about. And don't even get me started on processes with multiple different C or C++ runtimes simultaneously (take a look at the list of CRTs loaded in an instance of explorer.exe, it is horrifying). Another one is that you can easily move things around between dynamic libs or split them, etc. If various subsets are actually used by multiple executables, this can be quite useful (lacking that property, note that MS had to invent their own additional virtual/redirection layer to refactor the Win32 libs)

A practical argument against is that it is "hard" (read: virtually impossible) to dynamically link binary only modules maintained with different-enough toolchains. Likewise for different versions of the same lib (e.g. via transitive deps). But then the question of is this even a good idea shall be asked (well, if you want to load a plugin in a proprietary software I understand this can be wanted -- and this is actually similar to using libs of a proprietary platform; any other use case?)

stereo · on June 12, 2021

Text if you don't want to visit Facebook:

Summary: Python is 1.3x faster when compiled in a way that re-examines shitty technical decisions from the 1990s. ELF is the executable and shared library format on Linux and other Unixy systems. It comes to us from 1992's Solaris 2.0, from back before even the first season of the X-Files aired. ELF files (like X-Files) are full of barely-understood horrors described only in dusty old documents that nobody reads. If you don't know anything about symbol visibility, semantic interposition, relocations, the PLT, and the GOT, ELF will eat your program's performance. (Granted, that's better than being eaten by some monster from a secret underground government base.)

ELF kills performance because it tries too hard to make the new-in-1992 world of dynamic linking look and act like the old world of static linking. ELF goes to tremendous lengths to make sure that every reference to a function or a variable throughout a process refers to the same function or variable no matter what shared library contains each reference. Everything is consistent.

This approach is clean, elegant, and wrong: the cost of maintaining this ridiculous bijection between symbol name and symbol address is that each reference to a function or variable needs to go through a table of pointers that the dynamic linker maintains --- even when the reference is one function in a shared library calling another function in the same shared library. Yes, `mylibrary_foo()` in `libmylibrary.so` has to pay for the equivalent of a virtual function call every time it calls `mylibrary_bar()` just in case some other shared library loaded earlier happened to provide a different `mylibrary_bar()`. That basically never happens. (Weak symbols are an exception, but that's a subject for a different rant.)

(Windows took a different approach and got it right. In Windows, it's okay for multiple DLLs to provide the same symbol, and there's no sad and desperate effort to pretend that a single namespace is still cool.)

There's basically one case where anyone actually relies on this ELF table lookup stuff (called "interposition"): `LD_PRELOAD`. `LD_PRELOAD` lets you provide your own implementation of any function in a program by pre-loading a shared library containing that function before a program starts. If your `LD_PRELOAD`ed library provides a `mylibrary_bar()`, the ELF table lookup goo will make sure that `mylibrary_foo()` calls your `LD_PRELOAD`ed `mylibrary_bar()` instead of the one in your program. It's nice and dynamic, right? In exchange for every program on earth being massively slower than it has to be all the time, you, programmer, can replace `mylibrary_bar()` with `printf("XXX calling bar!!!")` by setting an environment variable. Good trade-off, right?

LOL. There is no trade-off. You don't get to choose between performance and flexibility. You don't get to choose one. You get to choose zero things. Interposition has been broken for years: a certain non-GNU upstart compiler starting with "c" has been committing the unforgivable sin of optimizing calls between functions in the same shared library. Clang will inline that call from `mylibrary_foo()` to `mylibrary_bar()`, ELF be damned, and it's right to do so, because interposition is ridiculous and stupid and optimizes for c00l l1inker tr1ckz over the things people buy computers to actually do --- like render 314341 layers of nested iframe.

Still, this Clang thing does mean that `LD_PRELOAD` interposition no longer affects all calls, because with Clang, contra the specification, will inline some calls to functions not marked inline --- which breaks some people's c00l l1inker tr1ckz . But we're all still paying the cost of PLT calls and GOT lookups anyway, all to support a feature (`LD_PRELOAD`) that doesn't even work reliably anymore, because, well, why change the defaults?

Eventually, someone working on Python (ironically, of all things) noticed this waste of good performance. "Let's tell the compiler to do what Clang does accidentally, but all the time, and on purpose". Python got 30% faster without having to touch a single line of code in the Python interpreter.

(This state of affairs is clearly evidence in favor of the software industry's assessment of its own intellectual prowess and justifies software people randomly commenting on things outside their alleged expertise.)

All programs should be built with `-Bsymbolic` and `-fno-semantic-interposition`. All symbols should be hidden by default. `LD_PRELOAD` still works in this mode, but only for calls _between_ shared libraries, not calls _inside_ shared libraries. One day, I hope as a profession we learn to change the default settings on our tools.

qwertox · on June 12, 2021

Thank you. This link asked me to sign in in a very broken page (I block most of Facebook's domains), and am wondering if this is just someone who posted it on FB or if it is a post from the engineering team at FB.

aioprisan · on June 12, 2021

Just someone posting on FB

dathinab · on June 12, 2021

This has interesting parallels with how some languages include the library version in the "symbolic name" (mangled name, fully qualified name etc).

This often allows loading of multiple versions of the same dependency in the same program without ugly hacks. Which is grate if you have multiple dependencies which both have the same sub-dependency (each internal only to their dependent) but need different versions.

It's kinda a nightmare if you run into this problem in languages which don't support it.

cutler · on June 12, 2021

Hitting Reader quickly on load in Firefox also gets you the text.

rocqua · on June 12, 2021

Not sure this is ok copyright wise.

ineedasername · on June 12, 2021

I'm not sure Facebook's privacy intrusions are ok ethically wise. So there's competing value systems at work.

kzrdude · on June 12, 2021

Completely beside the article - the first specializing, adaptive interpreter (PEP 659) improvements have been merged to CPython these last weeks, and hopefully we can see updates about benchmarks and performance sooner or later.

MaskRay · on June 13, 2021

Folks may be interested in the two blog posts which have more technical details: https://maskray.me/blog/2021-05-09-fno-semantic-interpositio... https://maskray.me/blog/2021-05-16-elf-interposition-and-bsy...

kg · on June 12, 2021

Anyone got an archive link? I can't read this without making a Facebook account and signing in

eptcyka · on June 12, 2021

I read it without signing in. The "Not now" link is greyed out and 4 points smaller and not a button. But it's there.

mct · on June 12, 2021

I'm seeing "You must log in to continue," with no "not now" option.

qwertox · on June 12, 2021

Probably expects JavaScript enabled or something. I also don't have a "not now" option. No JavaScript, no CSS.

Honestly, how can someone into tech post something like this on FB?

OJFord · on June 12, 2021

Leaving aside whether or not they should want to post it there, I'm surprised it has an audience.

Someone saw it and shared it to HN; enough read it to upvote it this much.. maybe Facebook's more popular than I thought! (That sounds silly or sarcastic, but 'among HN users and similar' I'm serious.)

ineedasername · on June 12, 2021

I did the first time around, but then I closed the tab, and when I wanted to go back to look at something in more detail I was blocked unless I signed in. Luckily someone posted the full text in another comment.

Someone · on June 12, 2021

I think I get to see the full text in iOS private mode without logging in. Didn’t have to click away anything, either.

BurningFrog · on June 12, 2021

I'm confused.

Is this about something I can do to speed up our 3.8 Python code, or about why Python 3.8 is faster than 3.7?

ineedasername · on June 12, 2021

I think it was addressed by python already:

Eventually, someone working on Python (ironically, of all things) noticed this waste of good performance

But It would be good to know when & what versions.

I'm also not sure why this is "ironic". Who else but the experts on python would be more likely to discover this & resolve the issue? Which basically makes the whole thing a non-issue:

Python creators made a choice when creating python. A while later they realized they could improve performance by revisiting that choice.

The tone of the article makes it sound like this was an embarrassing mistake of massive proportions.

ahupp · on June 12, 2021

> Python creators made a choice when creating python. The tone of the article makes it sound like this was an embarrassing mistake of massive proportions.

The article is talking about a bad decision in ELF and dynamic linking, not in Python specifically. The Python people just discovered that disabling that default behavior was useful.

kaba0 · on June 13, 2021

I think the author meant ‘ironic’ due to the very stereotypic view of “Python is slow, why would it care about performance. At least I read it that way.

nwmcsween · on June 12, 2021

This isn't really an elf issue, more of a python issue, if you have internal symbols hide them.

QuadrupleA · on June 13, 2021

It looks like the mod_wsgi / Apache stack I've used for years does indeed use the shared libpython described here (according to running lld on mod_wsgi.so). So this compiler flag could have saved me 30% of a server - or saved my users 23% of the server processing latency of their TTFBs.

Though my servers are already overprovisioned, and latency is probably more due to memory / IO and... oh nevermind.

Kudos to the python devs, looking forward to more speed improvements.

th0ma5 · on June 12, 2021

Not entirely related but I recently started playing with the built in "dis" library and it is fun to see the compiled representation of functions that the runtime executes. Just an FYI if you're ever bored and are looking to get more familiar with assembly, it is a very approachable thing to play with.

derefr · on June 12, 2021

Doesn’t gVisor require symbol interposition to do its sandboxing thing? (At least, for binaries with static-linked runtimes, like the type Golang produces by default.)

dathinab · on June 12, 2021

If you have a fully static-linked library you already don't have symbol interposition.

Furthermore this options still allow the thinks you need interposition for, for calls from/to external dynamic linked libraries like libc.

But most important gVisor is based around intercepting system calls (over simplified), for which you don't need symbol interposition.

falldmg · on June 12, 2021

Symbol interposition? I don't know for sure, but I would guess gVisor is using ptrace or another mechanism, to interpose on syscalls, not library calls. But these flags, I believe, only impact interposition of symbols in the same library, so even if gVisor did use interposition for something, it may not matter.

DeathMetal3000 · on June 14, 2021

Isn’t it .3 times (30%) faster? I think they meant to say 1.3x _as fast_ which is very different.

stuaxo · on June 13, 2021

Stopping LD_PRELOAD working is pretty big though.

This might be OK, if two versions were provided.

Black101 · on June 13, 2021

One day it will catch up... performance-wise...

freediver · on June 13, 2021

I tested this.

tldr; I could not replicate speed benefits except for heavy stack usage. Python 3.10 is 8% slower than python 3.8. Python 3.10 with said optimization is 3% slower overall than Python 3.10 without it except for stack usage.

  Python 3.8
  python-speed v1.2 using python v3.8.5
  string/mem: 1476.42 ms
  pi calc/math: 1817.37 ms
  regex: 1984.63 ms
  fibonnaci/stack:  1085.79 ms
  
  total:  6364.21 ms (lower is better)



  Python 3.10 no optimization
  python-speed v1.2 using python v3.10.0
  string/mem: 1580.52 ms
  pi calc/math: 1796.23 ms
  regex: 2110.86 ms
  fibonnaci/stack:  1337.18 ms
  
  total:  6824.79 ms (lower is better)


  Python 3.10 with optimization
  python-speed v1.2 using python v3.10.0
  string/mem: 1559.45 ms
  pi calc/math: 1821.25 ms
  regex: 2387.64 ms
  fibonnaci/stack:  1299.8 ms
 
  total:  7068.13 ms (lower is better)

Anything I am missing? Tested using https://github.com/vprelovac/python-speed

cerved · on June 12, 2021

Right post, wrong platform

mastrsushi · on June 12, 2021

Python is like the cockroach equivalent of those shell scripting languages that came out of the late 80s to early 90s.

Perl, Ruby, PHP, TCL, and Lua have definetly declined over the years. Python's biggest asset seems to be featured rich libraries, rather than the language itself.

cutler · on June 13, 2021

Ruby is a much better designed language than Python but the community unfortunately made the mistake of focusing too much on Rails and web development at the expense of diversification. Python is the Skoda of scripting languages.

ganzuul · on June 13, 2021

{}, [], and / \ are all cumbersome to use on many keyboards because bankers got to them before programmers. Rather than solving the problem in hardware by getting a en-US keyboard we mitigate the RSI in software by choosing languages with less notation.

taeric · on June 13, 2021

Cute, but somewhat ignores picking lisp; which just puts everything in parenthetical lists. :D

aitchnyu · on June 13, 2021

What does it mean in your country? For me, its a company priced out by carmakers who manufactured locally, and screwed over by dealers who didn't service cars properly. Ex Fabia owner from India.