fix(rs-dapi,sdk): decode base64 CBOR error messages and detect duplicate identity keys by lklimek · Pull Request #3350 · dashpay/platform

lklimek · 2026-03-16T15:32:51Z

Issue being fixed or feature implemented

When Tenderdash returns errors with message: "Internal error", rs-dapi falls through to the data field. Tenderdash wraps Drive's CBOR error payload in the JSON-RPC data field as a base64 string, but rs-dapi stored it raw — producing unintelligible gRPC status messages like oWdtZXNzYWdleMVzdG9yYWdl... on the wire.

Related: dashpay/dash-evo-tool#766 (DET workaround using message-text matching)

Evidence from DET production logs

144 gRPC errors across all DET logs. Four patterns:

Pattern	Count	gRPC Code	`drive-error-data-bin`	`dash-serialized-consensus-error-bin`	`consensus_error` in CBOR
Duplicate key (non-unique set)	22	Internal (13)	✅	❌	null
Duplicate key (unique set)	77	Internal (13)	✅	❌	null
Consensus violation (DPP 10530)	2	InvalidArgument	✅	✅	present
TCP timeout	43	Unavailable	❌	❌	N/A

The majority of errors (99/144) are GroveDB duplicate-key errors during identity create/update where the actual human-readable message was buried in double-encoded CBOR. The 2 consensus errors (DPP 10530) already work correctly via the existing dash-serialized-consensus-error-bin path.

What was done?

Added decode_data_message() helper that tries to decode the data field as base64 → CBOR → extract human-readable message text. If decoding fails (plain text data), the raw string is preserved — no regression for existing behavior.

Reuses existing base64_decode() and walk_cbor_for_key() helpers. 6 new tests including 2 real-world DET log fixtures.

How Has This Been Tested?

Automated

cargo test -p rs-dapi --lib error_mapping — all 35 tests pass (29 existing + 6 new)
cargo clippy -p rs-dapi — clean
cargo fmt -p rs-dapi — clean
Real-world test fixtures extracted from DET production logs

Manual integration test

To reproduce with a real network:

Start a local devnet or connect to testnet via DET
Create an identity with IdentityCreate state transition
Attempt to create another identity reusing the same public key hash
Before fix: SDK receives gRPC Internal error with message oWdtZXNzYWdleMVzdG9yYWdl... (base64 CBOR gibberish)
After fix: SDK receives gRPC Internal error with message storage: identity: a unique key with that hash already exists: the key already exists in the unique set [...]

Alternatively, use the raw Tenderdash JSON-RPC fixture:

{
  "code": 13,
  "message": "Internal error",
  "data": "oWdtZXNzYWdleMVzdG9yYWdlOiBpZGVudGl0eTogYSB1bmlxdWUga2V5IHdpdGggdGhhdCBoYXNoIGFscmVhZHkgZXhpc3RzOiB0aGUga2V5IGFscmVhZHkgZXhpc3RzIGluIHRoZSBub24gdW5pcXVlIHNldCBbMTM1LCAyMDIsIDE3MiwgNTMsIDE3NiwgNDUsIDE5MSwgMjcsIDUwLCAxMiwgNTAsIDIxNSwgNjUsIDEyNCwgMTQ3LCAzLCAyMDgsIDYsIDIyNiwgMTUxXQ=="
}

Breaking Changes

None. Plain text data fields are preserved as-is. Only base64-encoded CBOR with a message key is decoded.

Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have added or updated relevant unit/integration/functional/e2e tests
I have added "!" to the title and described breaking changes in the corresponding section if my code contains any
I have made corresponding changes to the documentation if needed (N/A)

For repository code-owners and collaborators only

I have assigned this pull request to a milestone

_{🤖 Co-authored by Claudius the Magnificent AI Agent}

Summary by CodeRabbit

Bug Fixes
- Better extraction of embedded error messages (including encoded payloads), preserving plain text when decoding fails for clearer, more accurate messages.
- Drive internal failures are now surfaced with specific, human-readable error text instead of generic failures; existing conflict detection remains intact.
Tests
- Added unit and integration tests covering extraction and classification of embedded error messages.

coderabbitai · 2026-03-16T15:32:59Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: bda13009-5201-4754-b6ba-ca4db245d09e

📥 Commits

Reviewing files that changed from the base of the PR and between c27a959 and e32635f.

📒 Files selected for processing (1)

packages/rs-sdk/src/error.rs

🚧 Files skipped from review as they are similar to previous changes (1)

packages/rs-sdk/src/error.rs

📝 Walkthrough

Walkthrough

Extracts embedded CBOR/base64 "message" fields from Tenderdash JSON data and gRPC drive-error-data-bin metadata; adds decoding helpers, updates error mapping to prefer decoded messages, and adds unit/integration tests.

Changes

Cohort / File(s)	Summary
Platform service error mapping `packages/rs-dapi/src/services/platform_service/error_mapping.rs`	Added private `decode_data_message(data: &str) -> Option<String>` to permissively base64-decode and CBOR-decode `data`, extracting a top-level `"message"`. Updated `impl From<serde_json::Value> for TenderdashStatus` to use decoded message when top-level `"message"` is missing/empty or equals `"Internal error"` (case-insensitive), falling back to raw `data` if decoding fails. Added unit and integration tests covering plain text, base64→CBOR with/without `"message"`, and real DET fixtures.
Dapi client gRPC error handling `packages/rs-sdk/src/error.rs`	Added enum variant `Error::DriveInternalError(String)`. Introduced private helper `extract_drive_error_message(bytes: &[u8]) -> Option<String>` to CBOR-decode `drive-error-data-bin` metadata and extract a non-empty `"message"`. Extended `From<DapiClientError> for Error` to inspect `Code::Internal` metadata and return `Error::DriveInternalError(message)` when present; otherwise preserved existing mapping and fallbacks.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant DapiJSON as Tenderdash JSON handler
    participant DapiGRPC as DapiClient error converter
    participant Metadata as gRPC Metadata / JSON "data"
    participant Decoder as CBOR/Base64 Decoder
    participant ErrorEnum as Error Mapper

    Client->>DapiJSON: receive JSON error (contains "message" and/or "data")
    DapiJSON->>Metadata: inspect top-level "message" and "data"
    alt top-level "message" missing/empty or "Internal error"
        DapiJSON->>Decoder: attempt permissive base64 -> CBOR decode of "data"
        Decoder-->>DapiJSON: optional "message" string
        DapiJSON->>ErrorEnum: map using decoded message or fallback raw "data"
    else top-level "message" present
        DapiJSON->>ErrorEnum: map using top-level "message"
    end

    Client->>DapiGRPC: receive gRPC error (Status + metadata)
    DapiGRPC->>Metadata: inspect `drive-error-data-bin` when Code::Internal
    alt metadata present
        DapiGRPC->>Decoder: CBOR-decode metadata bytes
        Decoder-->>DapiGRPC: optional "message" string
        DapiGRPC->>ErrorEnum: return Error::DriveInternalError(message) if found
    else
        DapiGRPC->>ErrorEnum: continue existing mapping or fallback
    end

    ErrorEnum-->>Client: mapped Error

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

🐰 I munched through padded Base64 and CBOR peel,

found tiny words the bytes would try to conceal.
Now errors hop out with a clear little tune,
guiding my paws by the light of the moon. 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The PR title refers to decoding base64 CBOR error messages, which is the primary change in rs-dapi/error_mapping.rs. However, it also claims to 'detect duplicate identity keys', which is not mentioned in the raw_summary of actual changes.	Update the title to remove 'detect duplicate identity keys' or ensure this feature is documented in the change summary. The title should accurately reflect only the changes present in the PR.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/sdk-cbor-security-hardening

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Copilot

Pull request overview

Improves dash-sdk’s error conversion so callers can recover structured ConsensusError information even when DAPI does not provide the primary dash-serialized-consensus-error-bin metadata, by falling back to decoding drive-error-data-bin.

Changes:

Add a fallback path in From<DapiClientError> for Error to parse CBOR drive-error-data-bin and extract embedded consensus_error bytes.
Add explicit defensive limits (8 KiB max metadata size, CBOR recursion limit 16) before deserializing untrusted metadata.
Add unit tests covering the fallback behavior and precedence.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

packages/rs-sdk/src/error.rs

When Tenderdash returns an error with `message: "Internal error"`, rs-dapi falls through to the `data` field. Previously, the raw base64 string was stored as the message, producing unintelligible gRPC status messages like "oWdtZXNzYWdleMVzdG9yYWdl...". Now `decode_data_message()` tries to decode the `data` field as base64 → CBOR and extract the human-readable `message` text. If decoding fails (plain text data), the raw string is preserved as before. This fixes the 99% case from production DET logs: GroveDB duplicate-key errors (code 13) where the actual message "storage: identity: a unique key with that hash already exists..." was buried in double-encoded CBOR. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

Improves rs-dapi’s Tenderdash JSON-RPC error mapping by decoding base64-encoded CBOR stored in the Tenderdash data field (when the top-level message is unhelpful like "Internal error"), so gRPC clients receive human-readable error messages instead of base64 gibberish.

Changes:

Added decode_data_message() helper to decode data as base64 → CBOR and extract the "message" field.
Updated TenderdashStatus::from(serde_json::Value) to prefer decoded data.message when appropriate, while preserving prior fallback behavior.
Added unit tests for decoding behavior plus real-world DET log fixtures.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

packages/rs-dapi/src/services/platform_service/error_mapping.rs

codecov · 2026-03-18T13:41:15Z

Codecov Report

❌ Patch coverage is 95.20000% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 79.94%. Comparing base (e398cd7) to head (e32635f).
⚠️ Report is 3 commits behind head on v3.1-dev.

Files with missing lines	Patch %	Lines
...api/src/services/platform_service/error_mapping.rs	96.69%	4 Missing ⚠️
packages/rs-sdk/src/error.rs	50.00%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##           v3.1-dev    #3350      +/-   ##
============================================
+ Coverage     79.93%   79.94%   +0.01%     
============================================
  Files          2861     2861              
  Lines        280230   280352     +122     
============================================
+ Hits         223993   224121     +128     
+ Misses        56237    56231       -6

Components	Coverage Δ
dpp	`71.82% <ø> (ø)`
drive	`82.30% <ø> (ø)`
drive-abci	`86.67% <ø> (ø)`
sdk	`36.56% <50.00%> (+<0.01%)`	⬆️
dapi-client	`79.06% <ø> (ø)`
platform-version	`∅ <ø> (∅)`
platform-value	`83.36% <ø> (ø)`
platform-wallet	`76.55% <ø> (ø)`
drive-proof-verifier	`55.26% <ø> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

thepastaclaw

Code Review

Clean, well-scoped PR that adds base64→CBOR decoding for the Tenderdash error.data field, with good test coverage including real-world DET log fixtures. The new decode_data_message() function follows existing patterns in the module, reuses base64_decode() and walk_cbor_for_key(), and gracefully falls back on any decode failure. No blocking issues. Two suggestions worth considering: the pre-existing is_ascii() filter silently drops non-ASCII UTF-8 data strings (contradicting the 'raw string preserved' claim), and the other Tenderdash error ingestion paths (wait_for_state_transition_result) still pass data through without attempting CBOR decoding.

Reviewed commit: 7ae8846

🟡 3 suggestion(s)

1 additional finding

🟡 suggestion: Other Tenderdash error paths still pass `data` through without CBOR decoding

packages/rs-dapi/src/services/platform_service/wait_for_state_transition_result.rs (lines 151-161)

The new decode_data_message() is only invoked inside impl From<serde_json::Value>, which handles JSON-RPC error responses. The wait_for_state_transition_result path constructs TenderdashStatus::new() directly from ABCI tx_result.data (lines 151–161, 243) without attempting CBOR decoding.

This may be intentional — the ABCI responses could use a different encoding format than JSON-RPC errors. But if they can also carry the same base64-CBOR payloads, clients would still receive unreadable strings on those paths. Worth investigating whether TenderdashStatus::new() should optionally run the same decoding on its message parameter.

🤖 Prompt for all review comments with AI agents

These findings are from an automated code review. Verify each finding against the current code and only fix it if needed.

In `packages/rs-dapi/src/services/platform_service/error_mapping.rs`:
- [SUGGESTION] lines 313-314: Pre-existing `is_ascii()` filter silently discards non-ASCII UTF-8 data
  The `.filter(|s| s.is_ascii())` on line 313 (pre-existing, not introduced by this PR) means that if Tenderdash ever returns a valid UTF-8 `data` string containing non-ASCII characters (e.g. localized or user-provided error text), it gets silently dropped — `message` becomes `None` and later code falls back to a generic message instead of preserving the server detail.

`serde_json::Value::as_str()` already guarantees well-formed UTF-8, and `base64_decode()` will simply return `None` for non-base64 input, so the `is_ascii()` guard provides no safety benefit here. Removing it would make the "raw string preserved on decode failure" claim fully accurate.

Since this PR is already touching this exact code path, it would be a good opportunity to fix this.
- [SUGGESTION] lines 830-841: No integration test for valid-base64-but-not-CBOR data field
  The existing `from_json_value_preserves_plain_text_data` test uses text that fails at the base64 step. Adding a test for a `data` field that base64-decodes successfully but is NOT valid CBOR would confirm the end-to-end fallback through the full `From<serde_json::Value>` conversion, not just the unit function.

In `packages/rs-dapi/src/services/platform_service/wait_for_state_transition_result.rs`:
- [SUGGESTION] lines 151-161: Other Tenderdash error paths still pass `data` through without CBOR decoding
  The new `decode_data_message()` is only invoked inside `impl From<serde_json::Value>`, which handles JSON-RPC error responses. The `wait_for_state_transition_result` path constructs `TenderdashStatus::new()` directly from ABCI `tx_result.data` (lines 151–161, 243) without attempting CBOR decoding.

This may be intentional — the ABCI responses could use a different encoding format than JSON-RPC errors. But if they can also carry the same base64-CBOR payloads, clients would still receive unreadable strings on those paths. Worth investigating whether `TenderdashStatus::new()` should optionally run the same decoding on its `message` parameter.

packages/rs-dapi/src/services/platform_service/error_mapping.rs

- T4: Use non-logging base64 decode in decode_data_message() to avoid noisy debug logs when data field is plain text (expected failure) - T5: Fix misleading test comment — decode_data_message returns None, caller falls back to raw string - T6: Remove pre-existing .filter(|s| s.is_ascii()) that silently discarded valid non-ASCII UTF-8 data strings - T7: Add test for valid-base64-but-not-CBOR data field fallback Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

thepastaclaw

Incremental Review

Single-commit push (25de8d2) addressing 2 of 3 prior suggestions. Changes are minimal, correct, and well-tested:

is_ascii() filter removed ✅ — resolves the pre-existing filter that silently dropped non-ASCII UTF-8 data strings. The decode_data_message() fallback now correctly handles all valid UTF-8.
base64-non-CBOR integration test added ✅ — new from_json_value_preserves_base64_non_cbor_data confirms end-to-end fallback for data that is valid base64 but not CBOR.
Inline base64 engine — good call avoiding base64_decode() debug logging for expected failures on plain-text data fields. The inline GeneralPurpose engine keeps the same decode config.

All 36 tests pass (cargo test -p rs-dapi --lib error_mapping). No new findings.

Reviewed commit: 25de8d2

…-bin metadata When Drive returns a raw GroveDB storage error for a duplicate identity key (without a serialized ConsensusError), the SDK now checks the drive-error-data-bin gRPC metadata for CBOR-encoded error details. If the decoded message matches the identity key uniqueness pattern, the error is surfaced as Error::AlreadyExists instead of falling through to the opaque Error::DapiClientError. Also adds test fixtures for both the DAPI-level CBOR data field decoding and the SDK-level drive-error-data-bin handling. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai

🧹 Nitpick comments (1)

packages/rs-sdk/src/error.rs (1)

193-194: Make duplicate-key matching resilient to casing changes.

Line 193 and Line 194 use case-sensitive contains(...). If upstream wording changes capitalization, this mapping silently falls back to Error::DapiClientError(_).

♻️ Suggested patch

-                        if message.contains("unique key") && message.contains("already exists") {
+                        let normalized = message.to_ascii_lowercase();
+                        if normalized.contains("unique key")
+                            && normalized.contains("already exists")
+                        {
                             return Self::AlreadyExists(message);
                         }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@packages/rs-sdk/src/error.rs` around lines 193 - 194, The duplicate-key
detection is using case-sensitive contains on the error message, so
capitalization changes cause falls back to DapiClientError; update the matching
in the function that returns Self::AlreadyExists (the branch that currently does
if message.contains("unique key") && message.contains("already exists")) to
perform case-insensitive matching — e.g. normalize the message with
message.to_lowercase() (or use a case-insensitive regex) and then check for
"unique key" and "already exists", ensuring the same Self::AlreadyExists variant
is returned when those substrings appear in any casing.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@packages/rs-sdk/src/error.rs`:
- Around line 193-194: The duplicate-key detection is using case-sensitive
contains on the error message, so capitalization changes cause falls back to
DapiClientError; update the matching in the function that returns
Self::AlreadyExists (the branch that currently does if message.contains("unique
key") && message.contains("already exists")) to perform case-insensitive
matching — e.g. normalize the message with message.to_lowercase() (or use a
case-insensitive regex) and then check for "unique key" and "already exists",
ensuring the same Self::AlreadyExists variant is returned when those substrings
appear in any casing.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 557e7cca-577e-488e-8ca0-2ba378c1e050

📥 Commits

Reviewing files that changed from the base of the PR and between 25de8d2 and b18c173.

📒 Files selected for processing (2)

packages/rs-dapi/src/services/platform_service/error_mapping.rs
packages/rs-sdk/src/error.rs

github-actions · 2026-03-20T09:57:20Z

✅ DashSDKFFI.xcframework built for this PR.

Workflow run: https://github.com/dashpay/platform/actions/runs/23336881782
Artifacts: DashSDKFFI.xcframework (folder), DashSDKFFI.xcframework.zip, xc_checksum.txt

SwiftPM (host the zip at a stable URL, then use):

.binaryTarget(
  name: "DashSDKFFI",
  url: "https://your.cdn.example/DashSDKFFI.xcframework.zip",
  checksum: "96246283cc6ebcde72308f6665fa756e3c8013a691f995e89692a97256b96a2b"
)

Xcode manual integration:

Download 'DashSDKFFI.xcframework' artifact from the run link above.
Drag it into your app target (Frameworks, Libraries & Embedded Content) and set Embed & Sign.
If using the Swift wrapper package, point its binaryTarget to the xcframework location or add the package and place the xcframework at the expected path.

thepastaclaw

Code Review

The new drive-error-data-bin detection code has a semantic conflict with existing recovery logic. When a duplicate identity key error (key belongs to a different identity) is mapped to Error::AlreadyExists, the Identity::wait_for_response() handler misinterprets it as 'this identity was already created', tries to fetch an identity that doesn't exist, and produces a misleading 'identity was proved to not exist but was said to exist' error. The helper function and tests are well-structured but this classification issue needs to be resolved before merging.

Reviewed commit: b18c173

🔴 1 blocking | 🟡 3 suggestion(s) | 💬 1 nitpick(s)

🤖 Prompt for all review comments with AI agents

These findings are from an automated code review. Verify each finding against the current code and only fix it if needed.

In `packages/rs-sdk/src/error.rs`:
- [BLOCKING] lines 190-198: Mapping duplicate-key errors to AlreadyExists triggers wrong recovery path in Identity::wait_for_response
  This new branch converts any `drive-error-data-bin` message containing 'unique key' and 'already exists' into `Error::AlreadyExists`. However, `Identity::wait_for_response()` in `waitable.rs:98-115` catches *every* `Error::AlreadyExists` and assumes the identity was previously created — it extracts the identity ID from the state transition and calls `Identity::fetch(sdk, identity_id)`. For a duplicate-key conflict the *key* belongs to a different identity, but the identity being created does not yet exist. The fetch returns `None` and the user receives the misleading error 'identity was proved to not exist but was said to exist', losing the actual duplicate-key cause entirely.

Either introduce a dedicated variant like `Error::DuplicateIdentityKey(String)` that is not caught by the existing `AlreadyExists` recovery path, or keep the error as `Error::DapiClientError` and let the caller inspect it. If `AlreadyExists` must be reused, update `waitable.rs` to distinguish between 'identity already exists' and 'key already exists' before attempting the fetch.
- [SUGGESTION] lines 193-195: Brittle string matching creates invisible coupling to Drive error format
  The pattern `message.contains("unique key") && message.contains("already exists")` matches against Drive/GroveDB's human-readable error text (`UniqueKeyAlreadyExists` display string). If GroveDB or Drive ever changes the wording — even a minor phrasing tweak — this detection silently breaks with no compile-time or test-time signal. A more durable approach would be a machine-readable error code in the CBOR payload (the `code` field is already present but not used for this classification).
- [SUGGESTION] lines 190-192: Silent error swallowing when to_bytes() fails — inconsistent with adjacent code
  When `drive_error_value.to_bytes()` returns `Err`, the failure is silently discarded via `if let Ok(bytes)`. The adjacent `consensus_error_value.to_bytes()` path at lines 181-185 logs at debug level and returns a `Generic` error. Adding a `tracing::debug!` log here would maintain consistency and aid debugging.
- [SUGGESTION] lines 190-198: Detection only covers UniqueKeyAlreadyExists, not IdentityKeyAlreadyExists
  Drive defines two duplicate-key error variants: `UniqueKeyAlreadyExists` (display: 'a unique key with that hash already exists: ...') and `IdentityKeyAlreadyExists` (display: 'identity key already exists for user error: ...'). The string heuristic only matches the first variant. The second variant has different semantics (same-identity duplicate vs. cross-identity conflict) so this may be intentional, but it's worth documenting the distinction.

packages/rs-sdk/src/error.rs

thepastaclaw · 2026-03-20T10:28:12Z

packages/rs-sdk/src/error.rs

+        #[test]
+        fn test_drive_error_data_bin_unrelated_message_falls_through() {
+            // When drive-error-data-bin contains a message that does NOT match
+            // the identity key pattern, the error should fall through to DapiClientError.
+            let mut cbor_buf = Vec::new();
+            ciborium::ser::into_writer(
+                &ciborium::Value::Map(vec![(
+                    ciborium::Value::Text("message".to_string()),
+                    ciborium::Value::Text("some other storage error".to_string()),
+                )]),
+                &mut cbor_buf,
+            )
+            .expect("CBOR serialize");
+
+            let mut metadata = MetadataMap::new();
+            metadata.insert_bin("drive-error-data-bin", MetadataValue::from_bytes(&cbor_buf));
+
+            let status =
+                dapi_grpc::tonic::Status::with_metadata(Code::Internal, "Internal error", metadata);
+
+            let error = DapiClientError::Transport(TransportError::Grpc(status));
+            let sdk_error = Error::from(error);
+
+            assert_matches!(sdk_error, Error::DapiClientError(_));
+        }


💬 Nitpick: No test for malformed CBOR in drive-error-data-bin

The code handles invalid CBOR gracefully (ciborium::de::from_reader returns Err, inspect_err logs, .ok()? returns None → falls through). An explicit test with random bytes in drive-error-data-bin would document this behavior and prevent regressions.

_{source: ['claude']}

…ror-data-bin metadata" This reverts commit b18c173.

…-bin metadata When a gRPC Internal status contains drive-error-data-bin metadata, extract the human-readable message from the CBOR payload and return Error::DriveInternalError instead of opaque Error::DapiClientError. This gives downstream consumers (like DET) a structured error variant for Drive-level failures that bypassed consensus validation. Related: #3396 (pre-validation gap for non-unique key set) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

thepastaclaw

Code Review

This push correctly addresses the prior blocking issue by replacing the error-prone AlreadyExists mapping with a dedicated DriveInternalError variant, which no longer triggers the wrong recovery path in waitable.rs. The rs-dapi decoding logic and test coverage are solid. One gap: the SDK-side DriveInternalError conversion path and extract_drive_error_message() helper have no unit tests — the prior push's tests were removed along with the string-matching logic.

Reviewed commit: c27a959

🟡 2 suggestion(s)

🤖 Prompt for all review comments with AI agents

These findings are from an automated code review. Verify each finding against the current code and only fix it if needed.

In `packages/rs-sdk/src/error.rs`:
- [SUGGESTION] lines 194-210: SDK-side DriveInternalError conversion has no unit tests
  The prior push had tests for the `drive-error-data-bin` handling path (`test_drive_error_data_bin_duplicate_identity_key` and `test_drive_error_data_bin_unrelated_message_falls_through`), but they were removed when the string-matching approach was replaced. The new `DriveInternalError` mapping and `extract_drive_error_message()` helper at lines 226-240 have zero test coverage in the SDK.

The rs-dapi side has thorough tests for `decode_data_message()`, but the SDK's `From<DapiClientError>` conversion through `drive-error-data-bin` metadata is a separate code path that deserves its own tests — particularly to verify that:
1. A gRPC `Internal` status with valid CBOR in `drive-error-data-bin` maps to `DriveInternalError`
2. A gRPC `Internal` status *without* `drive-error-data-bin` falls through to `DapiClientError`
3. Non-`Internal` status codes with `drive-error-data-bin` are NOT intercepted (the `Code::Internal` guard)
- [SUGGESTION] lines 198-204: Silent failure when to_bytes() fails on drive-error-data-bin metadata
  When `drive_error_value.to_bytes()` returns `Err`, the failure is silently discarded via `if let Ok(bytes)`. The adjacent consensus error path (lines 181-190) handles `to_bytes()` failure by logging at debug level and returning a `Generic` error. Adding a `tracing::debug!` log here would maintain consistency and aid debugging — even a `trace!` level would be better than complete silence.

packages/rs-sdk/src/error.rs

…ently discarding Addresses PR review feedback: the adjacent consensus error path logs at debug level on to_bytes() failure, but the drive-error-data-bin path silently swallowed the error. Now consistent. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

Improves error propagation across rs-dapi → gRPC metadata → rs-sdk by decoding Tenderdash data payloads that are base64-wrapped CBOR and surfacing Drive internal error messages as a dedicated SDK error variant.

Changes:

Decode Tenderdash JSON-RPC data field as base64 → CBOR and extract a human-readable message when the top-level message is empty/“Internal error”.
Add DriveInternalError to the SDK error enum and parse drive-error-data-bin metadata on Internal gRPC responses to populate it.
Add unit tests in rs-dapi covering plain-text, base64-non-CBOR, and real-world fixtures.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
packages/rs-sdk/src/error.rs	Adds `DriveInternalError` and extracts `message` from CBOR `drive-error-data-bin` metadata for `Internal` gRPC statuses.
packages/rs-dapi/src/services/platform_service/error_mapping.rs	Decodes Tenderdash `data` base64 CBOR to recover readable messages and adds tests for this behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-23T09:58:36Z

packages/rs-sdk/src/error.rs

+    /// Drive returned an internal error that was not classified as a consensus
+    /// error. Contains the decoded human-readable message from the
+    /// `drive-error-data-bin` gRPC metadata (CBOR → message extraction).
+    ///
+    /// This typically indicates a storage-level error (e.g., GroveDB constraint
+    /// violation) that bypassed the consensus validation layer. If pre-validation
+    /// is working correctly, these should be rare.
+    #[error("Drive internal error: {0}")]
+    DriveInternalError(String),
+


Adding a new variant to the public Error enum (DriveInternalError) is a source-level breaking change for downstream crates that exhaustively match on dash_sdk::Error. If this crate is intended to preserve semver compatibility, consider marking the enum #[non_exhaustive] (and/or gating the new variant behind a feature) or updating release notes/versioning to reflect the breaking API change.

Copilot · 2026-03-23T09:58:36Z

packages/rs-sdk/src/error.rs

+            // Check drive-error-data-bin for decoded Drive error messages
+            if status.code() == Code::Internal {
+                if let Some(drive_error_value) = status.metadata().get_bin("drive-error-data-bin") {
+                    match drive_error_value.to_bytes() {
+                        Ok(bytes) => {
+                            if let Some(message) = extract_drive_error_message(&bytes) {
+                                return Self::DriveInternalError(message);
+                            }
+                        }
+                        Err(e) => {
+                            tracing::debug!(
+                                "Failed to decode drive-error-data-bin metadata: {}",
+                                e
+                            );
+                        }
+                    }
+                }
+            }


The new drive-error-data-bin parsing branch isn’t covered by unit tests in this module (there are already tests for consensus metadata and AlreadyExists). Please add a test that constructs a tonic::Status with Code::Internal plus drive-error-data-bin containing CBOR { "message": "..." }, and assert it maps to Error::DriveInternalError with the decoded message.

Copilot · 2026-03-23T09:58:37Z

packages/rs-dapi/src/services/platform_service/error_mapping.rs

+fn decode_data_message(data: &str) -> Option<String> {
+    // Silently try base64 — failure is expected for plain-text data fields,
+    // so we intentionally avoid `base64_decode()` which logs at debug level.
+    let decoded_bytes = engine::GeneralPurpose::new(
+        &base64::alphabet::STANDARD,
+        engine::GeneralPurposeConfig::new()
+            .with_decode_allow_trailing_bits(true)
+            .with_decode_padding_mode(engine::DecodePaddingMode::Indifferent),
+    )
+    .decode(data)
+    .ok()?;
+


decode_data_message duplicates the base64 engine configuration that already exists in base64_decode(). To avoid config drift and make future changes safer, consider reusing the same engine (e.g., a shared static/helper) while still keeping this path silent (no debug logging) for expected non-base64 inputs.

Suggested change

fn decode_data_message(data: &str) -> Option<String> {

// Silently try base64 — failure is expected for plain-text data fields,

// so we intentionally avoid `base64_decode()` which logs at debug level.

let decoded_bytes = engine::GeneralPurpose::new(

&base64::alphabet::STANDARD,

engine::GeneralPurposeConfig::new()

.with_decode_allow_trailing_bits(true)

.with_decode_padding_mode(engine::DecodePaddingMode::Indifferent),

)

.decode(data)

.ok()?;

/// Shared base64 engine configuration for Tenderdash error payloads.

/// This mirrors the configuration used by other base64 helpers, so that

/// future changes only need to be made in one place.

fn tenderdash_base64_engine() -> engine::GeneralPurpose {

engine::GeneralPurpose::new(

&base64::alphabet::STANDARD,

engine::GeneralPurposeConfig::new()

.with_decode_allow_trailing_bits(true)

.with_decode_padding_mode(engine::DecodePaddingMode::Indifferent),

)

}

fn decode_data_message(data: &str) -> Option<String> {

// Silently try base64 — failure is expected for plain-text data fields,

// so we intentionally avoid `base64_decode()` which logs at debug level.

let decoded_bytes = tenderdash_base64_engine().decode(data).ok()?;

github-actions bot added this to the v3.1.0 milestone Mar 16, 2026

lklimek requested a review from Copilot March 16, 2026 15:33

Copilot started reviewing on behalf of lklimek March 16, 2026 15:34 View session

Copilot AI reviewed Mar 16, 2026

View reviewed changes

packages/rs-sdk/src/error.rs Outdated Show resolved Hide resolved

packages/rs-sdk/src/error.rs Outdated Show resolved Hide resolved

packages/rs-sdk/src/error.rs Outdated Show resolved Hide resolved

lklimek force-pushed the fix/sdk-cbor-security-hardening branch from b173ca5 to 4aa4910 Compare March 16, 2026 18:09

lklimek changed the title ~~fix(rs-sdk): decode consensus errors from drive-error-data-bin metadata fallback~~ fix(rs-dapi): decode base64 CBOR message in Tenderdash error data field Mar 16, 2026

lklimek force-pushed the fix/sdk-cbor-security-hardening branch from 0fc9aee to bfba2ef Compare March 16, 2026 19:56

Merge branch 'v3.1-dev' into fix/sdk-cbor-security-hardening

7ae8846

lklimek requested a review from Copilot March 18, 2026 13:32

lklimek marked this pull request as ready for review March 18, 2026 13:32

lklimek requested a review from QuantumExplorer as a code owner March 18, 2026 13:32

Copilot started reviewing on behalf of lklimek March 18, 2026 13:33 View session

Copilot AI reviewed Mar 18, 2026

View reviewed changes

packages/rs-dapi/src/services/platform_service/error_mapping.rs Show resolved Hide resolved

packages/rs-dapi/src/services/platform_service/error_mapping.rs Outdated Show resolved Hide resolved

thepastaclaw reviewed Mar 18, 2026

View reviewed changes

packages/rs-dapi/src/services/platform_service/error_mapping.rs Outdated Show resolved Hide resolved

packages/rs-dapi/src/services/platform_service/error_mapping.rs Show resolved Hide resolved

lklimek added the ready for final review Ready for the final review. If AI was involved in producing this PR, it has already had a reviewer. label Mar 18, 2026

thepastaclaw approved these changes Mar 18, 2026

View reviewed changes

lklimek and others added 3 commits March 20, 2026 07:31

Merge branch 'v3.1-dev' into fix/sdk-cbor-security-hardening

1b38d74

Merge branch 'v3.1-dev' into fix/sdk-cbor-security-hardening

91daf4b

lklimek requested a review from shumkov as a code owner March 20, 2026 09:29

lklimek changed the title ~~fix(rs-dapi): decode base64 CBOR message in Tenderdash error data field~~ fix(rs-dapi,sdk): decode base64 CBOR error messages and detect duplicate identity keys Mar 20, 2026

coderabbitai bot reviewed Mar 20, 2026

View reviewed changes

thepastaclaw suggested changes Mar 20, 2026

View reviewed changes

Revert "feat(sdk): detect duplicate identity key errors from drive-er…

5e99219

…ror-data-bin metadata" This reverts commit b18c173.

lklimek mentioned this pull request Mar 20, 2026

fix(drive-abci): duplicate identity key bypasses pre-validation, returns internal error instead of ConsensusError #3396

Open

thepastaclaw reviewed Mar 20, 2026

View reviewed changes

packages/rs-sdk/src/error.rs Show resolved Hide resolved

packages/rs-sdk/src/error.rs Outdated Show resolved Hide resolved

lklimek and others added 2 commits March 23, 2026 10:41

Merge branch 'v3.1-dev' into fix/sdk-cbor-security-hardening

a9a9fb9

lklimek requested a review from Copilot March 23, 2026 09:54

Copilot started reviewing on behalf of lklimek March 23, 2026 09:54 View session

Copilot AI reviewed Mar 23, 2026

View reviewed changes

Conversation

lklimek commented Mar 16, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue being fixed or feature implemented

Evidence from DET production logs

What was done?

How Has This Been Tested?

Automated

Manual integration test

Breaking Changes

Checklist:

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

thepastaclaw left a comment

Choose a reason for hiding this comment

Code Review

1 additional finding

Uh oh!

Uh oh!

Uh oh!

thepastaclaw left a comment

Choose a reason for hiding this comment

Incremental Review

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 20, 2026

Uh oh!

thepastaclaw left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thepastaclaw Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

thepastaclaw left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

lklimek commented Mar 16, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 16, 2026 •

edited

Loading

codecov bot commented Mar 18, 2026 •

edited

Loading