perf: better string building #997

henryiii · 2025-11-27T06:54:16Z

This makes the __str__ method faster - about 6-7% less time to do str(Version(v)); probably about 25% faster for just the str operation. This is used quite a bit in SpecifierSet, it makes a small but measurable improvement there. It's larger than the speedup you get if the generator is replaced by map #996, though you can get close by using the := trick as well. This is still a bit faster, and I think it reads better.

Signed-off-by: Henry Schreiner <[email protected]>

henryiii · 2025-11-27T16:20:16Z

Almost all the savings here was from avoiding the NamedTuple indirection. This is now only 1% faster total time, probably 5% or something like that faster for the str operation. Saving the intermediate value doesn't have any measurable effect anymore, so I've remove that.

Now it's mostly up to if you think this looks better (and it still is a little faster).

brettcannon · 2025-11-27T18:05:43Z

Now it's mostly up to if you think this looks better

I'm indifferent.

notatallshaw · 2025-11-27T18:56:39Z

"".join(...) reads better to me because I've written that pattern so often in Python, but that's just anecdotal.

I was also under the impression, apparently incorrectly, that join would be faster. Because naively concatenating strings can be O(n^2) with regards to memory allocation operations, and I thought the .join method had some kind of optimization to handle that. Maybe this is just too few concatenations with too small strings where the memory allocations become a dominating factor.

henryiii · 2025-11-27T20:30:29Z

I'm nearly sure it's the fact the strings are generally small. If they were large I'm almost sure it would be the other way around.

I played around with several ways to do this - I thought making all four separately then using an f-string to join them would be fastest, but short circuiting if None was too important. Now that the largest cost (accessing the nested field in the NamedTuple) is gone, it's possible that is faster.

brettcannon · 2025-11-28T17:38:30Z

I was also under the impression, apparently incorrectly, that join would be faster. Because naively concatenating strings can be O(n^2) with regards to memory allocation operations, and I thought the .join method had some kind of optimization to handle that. Maybe this is just too few concatenations with too small strings where the memory allocations become a dominating factor.

There's an optimization in CPython specifically for += in a loop. So you're right that str.join() should be faster, but we cheated in CPython. 😁

henryiii force-pushed the henryiii/perf/str branch 2 times, most recently from 52925ec to 267b5d9 Compare November 27, 2025 15:31

perf: better string building

e584ec3

Signed-off-by: Henry Schreiner <[email protected]>

henryiii force-pushed the henryiii/perf/str branch from 267b5d9 to e584ec3 Compare November 27, 2025 16:14

chore: remove saving intermediate (no longer much faster)

92b9674

Signed-off-by: Henry Schreiner <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: better string building #997

perf: better string building #997

henryiii commented Nov 27, 2025 •

edited

Loading

Uh oh!

henryiii commented Nov 27, 2025 •

edited

Loading

Uh oh!

brettcannon commented Nov 27, 2025

Uh oh!

notatallshaw commented Nov 27, 2025

Uh oh!

henryiii commented Nov 27, 2025

Uh oh!

brettcannon commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perf: better string building #997

Are you sure you want to change the base?

perf: better string building #997

Conversation

henryiii commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

henryiii commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brettcannon commented Nov 27, 2025

Uh oh!

notatallshaw commented Nov 27, 2025

Uh oh!

henryiii commented Nov 27, 2025

Uh oh!

brettcannon commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

henryiii commented Nov 27, 2025 •

edited

Loading

henryiii commented Nov 27, 2025 •

edited

Loading