pr: don't convert to String when storing lines to print by venoosoo · Pull Request #11327 · uutils/coreutils

venoosoo · 2026-03-14T15:44:10Z

Changed FileLine.line_content from String to Vec<u8> to avoid
unnecessary byte → String → byte conversions.

Changes:

FileLine.line_content is now Vec<u8>
apply_expand_tab now takes &mut Vec<u8> instead of &mut String
from_buf no longer needs to return Result since UTF-8 validation
is no longer needed
get_pages also no longer needs to return Result as a cascade effect
get_line_for_printing uses String::from_utf8_lossy only at the
formatting stage

Note: test_dd::test_iso8859_1_case_conversion fails but is pre-existing
and unrelated to this change.

github-actions · 2026-03-14T15:57:57Z

GNU testsuite comparison:

Note: The gnu test tests/tail/tail-n0f is now being skipped but was previously passing.

venoosoo · 2026-03-14T16:52:26Z

the error on the test seems infrastructure issue unrelated to this change — the VM fails at SSH connection. Style and lint checks passed

cakebaker · 2026-03-15T14:13:53Z

src/uu/pr/src/pr.rs

    let formatted_line_number = get_formatted_line_number(options, file_line.line_number, index);

-    let mut complete_line = format!("{formatted_line_number}{}", file_line.line_content);
+    let content = String::from_utf8_lossy(&file_line.line_content);


While this is an improvement to the previous code, it's not entirely correct. GNU pr also prints non-UTF8 characters.

You can see the difference with a tool like hexdump:

$ printf $'\xFFfoo' | cargo run -q pr | hexdump -C [snipped] * 00000040 20 20 20 20 50 61 67 65 20 31 0a 0a 0a ef bf bd | Page 1......| 00000050 66 6f 6f 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a |foo.............| 00000060 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a |................| * 00000090 $ printf $'\xFFfoo' | pr | hexdump -C [snipped] * 00000040 20 20 20 20 50 61 67 65 20 31 0a 0a 0a ff 66 6f | Page 1....fo| 00000050 6f 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a |o...............| 00000060 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a |................| * 00000080 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a 0a |..............| 0000008e

My suggestion is to add a todo that support for non-UTF8 is not implemented yet.

cakebaker · 2026-03-15T14:16:25Z

src/uu/pr/src/pr.rs

 /// # Errors
 ///
 /// Returns an error if the bytes are not a valid UTF-8 string.


A detail: this part of the comment is no longer valid and can be removed.

github-actions · 2026-03-15T17:06:15Z

GNU testsuite comparison:

GNU test failed: tests/misc/io-errors. tests/misc/io-errors is passing on 'main'. Maybe you have to rebase?
Skip an intermittent issue tests/cut/bounded-memory (fails in this run but passes in the 'main' branch)
Skip an intermittent issue tests/date/date-locale-hour (fails in this run but passes in the 'main' branch)
Skipping an intermittent issue tests/pr/bounded-memory (passes in this run but fails in the 'main' branch)
Note: The gnu test tests/rm/many-dir-entries-vs-OOM is now being skipped but was previously passing.

github-actions · 2026-03-15T23:06:01Z

GNU testsuite comparison:

Skip an intermittent issue tests/date/date-locale-hour (fails in this run but passes in the 'main' branch)

cakebaker · 2026-03-16T07:21:44Z

Thanks for your PR!

venoosoo added 2 commits March 14, 2026 17:38

pr: don't convert to String when storing lines to print

41affdf

pr: apply cargo fmt

d15ba7a

cakebaker reviewed Mar 15, 2026

View reviewed changes

pr: add todo for non-UTF-8 support

8a686cf

pr: cargo fmt fix

a14d17a

cakebaker merged commit bc58d3d into uutils:main Mar 16, 2026
160 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

pr: don't convert to String when storing lines to print#11327

pr: don't convert to String when storing lines to print#11327
cakebaker merged 4 commits intouutils:mainfrom
venoosoo:fix-pr-string-to-bytes

venoosoo commented Mar 14, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 14, 2026

Uh oh!

venoosoo commented Mar 14, 2026

Uh oh!

cakebaker Mar 15, 2026

Uh oh!

cakebaker Mar 15, 2026

Uh oh!

github-actions bot commented Mar 15, 2026

Uh oh!

github-actions bot commented Mar 15, 2026

Uh oh!

Uh oh!

cakebaker commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

venoosoo commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 14, 2026

Uh oh!

venoosoo commented Mar 14, 2026

Uh oh!

cakebaker Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

cakebaker Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 15, 2026

Uh oh!

github-actions bot commented Mar 15, 2026

Uh oh!

Uh oh!

cakebaker commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

venoosoo commented Mar 14, 2026 •

edited

Loading