Change Utf8.encodedLength to just encode and check length on server (unsafeprocessor) #24363

copybara-service · 2025-11-07T22:37:26Z

Change Utf8.encodedLength to just encode and check length on server (unsafeprocessor)

Maintain the current loop behavior on mobile (safeprocessor).

Encoding conceptually does a lot more work (both computationally and an allocation) than needed to simply determine how many bytes it should take to encode in Utf8 string. However, JDK has privilege to access string internals as byte[], which enables it to implement the getBytes() method including encoding faster than any other way to determine the byte length that we are able to write via read loop.

Several alternatives were benchmarked, including various alternate loops, other JDK APIs, but this version benchmarks as 10x faster on ascii strings, and 2x faster on most latin1 and higher unicode codepoint strings (with some regression cases for mysterious reasons), the second best implementations benchmarked is what we have today, and other alternatives were slower than that.

…unsafeprocessor) Maintain the current loop behavior on mobile (safeprocessor). Encoding conceptually does a lot more work (both computationally and an allocation) than needed to simply determine how many bytes it should take to encode in Utf8 string. However, JDK has privilege to access string internals as byte[], which enables it to implement the getBytes() method including encoding faster than any other way to determine the byte length that we are able to write via read loop. Several alternatives were benchmarked, including various alternate loops, other JDK APIs, but this version benchmarks as 10x faster on ascii strings, and 2x faster on most latin1 and higher unicode codepoint strings (with some regression cases for mysterious reasons), the second best implementations benchmarked is what we have today, and other alternatives were slower than that. PiperOrigin-RevId: 825109983

github-actions · 2025-11-14T10:07:39Z

Auto-closing Copybara pull request

github-actions bot closed this Nov 14, 2025

github-actions bot deleted the test_825109983 branch November 14, 2025 10:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change Utf8.encodedLength to just encode and check length on server (unsafeprocessor) #24363

Change Utf8.encodedLength to just encode and check length on server (unsafeprocessor) #24363

Uh oh!

copybara-service bot commented Nov 7, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Change Utf8.encodedLength to just encode and check length on server (unsafeprocessor) #24363

Change Utf8.encodedLength to just encode and check length on server (unsafeprocessor) #24363

Uh oh!

Conversation

copybara-service bot commented Nov 7, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants