`%read` in chunks for lseek()able files. by jpco · Pull Request #107 · wryun/es-shell

jpco · 2024-08-15T07:08:20Z

This is a performance optimization that's normally insignificant; occasionally good, in tight %read loops; and very rarely negative, when those tight %read loops work on a file with mostly zero-to-one character lines. Generally the performance improvement increases as line lengths increase, which shouldn't be a surprise. Files with pretty short lines, like /usr/share/dict/words, see a measurable improvement with tight %read loops, though.

In theory, it seems to me the overhead of the extra lseek() calls (which are the source of the negative effects, where they appear) could be reduced by caching the seekability of fds. Maybe just one such value, assuming performance-sensitive calls to %read are typically done repeatedly in a loop (and on a single fd).

This introduces an awkward performance asymmetry, sadly, between the cases of cat foo | the_script and the_script < foo , since stdin in the former case is non-seekable from a pipe, while in the latter case it's seekable.

I'm not married to this proposal. Just wanted to put it out there. I'm not very good at speedy code so there's very possibly a better way to do this than I've got here :)

jpco · 2025-04-12T15:38:19Z

Reopening this in light of having a more realistic "tight $&read loop" scenario with #178.

Still incomplete; failing test case illustrates the remaining bug

read in chunks for lseek()able files.

8f59efa

jpco closed this Oct 26, 2024

jpco reopened this Apr 12, 2025

jpco mentioned this pull request Apr 12, 2025

Up for discussion: User-provided commands for reading shell input #178

Draft

Merge branch 'master' into fastread

ca709f8

jpco force-pushed the fastread branch from 9469a2e to ca709f8 Compare April 12, 2025 16:20

jpco added 2 commits April 18, 2025 19:40

Merge remote-tracking branch 'upstream/master' into fastread

d0712d4

Add NUL awareness for seeking %read

69a38d2

Still incomplete; failing test case illustrates the remaining bug

jpco force-pushed the master branch 3 times, most recently from 64361a8 to a23a7a0 Compare September 19, 2025 00:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`%read` in chunks for lseek()able files.#107

`%read` in chunks for lseek()able files.#107
jpco wants to merge 4 commits intowryun:masterfrom
jpco:fastread

jpco commented Aug 15, 2024

Uh oh!

jpco commented Apr 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jpco commented Aug 15, 2024

Uh oh!

jpco commented Apr 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant