Refactor connectionId, nodeId, clusterId by ArgusLi · Pull Request #375 · valkey-io/valkey-admin

ArgusLi · 2026-06-22T22:40:14Z

Description

Previously the metrics state was keyed by the db-suffixed connectionId (--db) even though the metrics process is unique per Valkey node. That mismatch caused state to be stored and looked up under inconsistent keys.

This branch establishes a single id space for metrics and re-keys all node-level state to the db-less nodeId (-).

Bug fixes

Fixed double hot-key response.
Fixed standalone hot-keys showing no result.

Tooling

Renamed eslint.config.js to eslint.config.mjs so the ESM-only config loads (the plugins are ESM-only and the root package is CommonJS).
Make npm run lint automatically fix.

Signed-off-by: Argus Li <argus@argusli.dev>

arseny-kostenko · 2026-06-23T23:55:50Z

-        return
+    if (typeof clusterId === "string") {
+      const nodeIds = Object.keys(clusterNodesRegistry[clusterId] ?? {}).filter((id) => metricsServerMap.has(id))
+      const responses = await Promise.all(


a node may be in a semi-alive state like when it's a failover or something, and it will fail the promise, so maybe worth switching to allSettled instead?

in other words, if one node is to be killed anyway — do we need to fail the operation that succeeded in all alive nodes?

I think that's a good change. For this PR, my focus was on retaining existing logic and just ensuring my changes don't break anything. Considering #373 is waiting for this PR so that the bug where errors are not propagated to the right ID is fixed, I would prefer to not add this and the additional handling in this PR, and add it in its own PR right after.

arseny-kostenko · 2026-06-23T23:58:35Z

+      const responses = await Promise.all(
+        nodeIds.map((nodeId) => postConfigToNode(metricsServerMap.get(nodeId)?.metricsURI, config)),
+      )
+      const firstFailure = responses.find((r) => !r.success)


subsequently, just getting the entire list of failing nodes instead of re-sending an update just to discover another failing node?

and then use filter instead of findFirst?

arseny-kostenko · 2026-06-23T23:59:52Z

+        sendUpdateError(ws, { clusterId }, firstFailure)
+      } else {
+        // All nodes responses are the same so we use the first.
+        sendUpdateFulfilled(ws, { clusterId }, responses[0] ?? { success: true, message: "", data: {} })


updating a config is a rarely used operation so we don't have to optimize for network traffic. I'd prefer to have an explicit node: status response

arseny-kostenko · 2026-06-24T00:00:43Z

    const nodes =
-      typeof clusterId === "string"
-        ? clusterNodesRegistry[clusterId]
+      typeof clusterId === "string" 


did lint add trailing spaces?

It appears to have.

ravjotbrar · 2026-06-24T16:19:08Z

+  // `targetId` keys node-level metrics state: `clusterId` for a cluster, else
+  // the db-less `nodeId`. (Connection-scoped state below still uses `id`, the
+  // db-suffixed connectionId.)
+  const nodeId = toNodeId(id!)


Does it make sense to just store nodeId in frontend state instead of stripping the db suffixed ID every time we need it?

We actually are using targetId as the key in the frontend state for node level metrics state. This can be seen in the following functions in the code i.e. selectCommandLogs()(), selectHotKeys()()...

The reason for keeping the connectionId key state is that we still have things that need to be scoped by db such as the connection details, keys etc.

I'm more so referring to storing nodeId in connection state, so components can read it directly instead of calling toNodeId every time. But this isn't a blocker if you don't think its worth the time.

Adding nodeId in connection state means that if the connectionId gets changed, we have to ensure nodeId stays consistent, which is an additional thing to manage. toNodeId() is also cheap operation so there's no real performance benefit from caching it directly in connection state.

So in my mind it's not worth gaining that really small performance gain in exchange for an additional state we have to manage.

Signed-off-by: Argus Li <argus@argusli.dev>

ArgusLi added 4 commits June 18, 2026 14:17

Rename connectionId to nodeId if not containing db.

2561ba5

Signed-off-by: Argus Li <argus@argusli.dev>

Split into explicit cluster/standalone branches and replies.

514581b

Signed-off-by: Argus Li <argus@argusli.dev>

Fix changes in the frontend.

711fa95

Signed-off-by: Argus Li <argus@argusli.dev>

Add configSlice test

ede6eda

Signed-off-by: Argus Li <argus@argusli.dev>

ArgusLi marked this pull request as draft June 22, 2026 22:41

ArgusLi added 5 commits June 23, 2026 13:21

Re-key all node level metrics states to nodeId

00d3843

Signed-off-by: Argus Li <argus@argusli.dev>

Fix hotkey standalone no result bug.

0c63e4e

Signed-off-by: Argus Li <argus@argusli.dev>

fix lint import issue

927a19c

Signed-off-by: Argus Li <argus@argusli.dev>

Fix lint

6a87f05

Signed-off-by: Argus Li <argus@argusli.dev>

Fix double hot key response.

da46eb6

Signed-off-by: Argus Li <argus@argusli.dev>

ArgusLi marked this pull request as ready for review June 23, 2026 22:23

ArgusLi requested review from arseny-kostenko and ravjotbrar June 23, 2026 22:24

arseny-kostenko reviewed Jun 23, 2026

View reviewed changes

arseny-kostenko reviewed Jun 24, 2026

View reviewed changes

ravjotbrar reviewed Jun 24, 2026

View reviewed changes

Comment thread apps/frontend/src/components/ui/monitor-warning-banner.tsx

ravjotbrar reviewed Jun 24, 2026

View reviewed changes

Comment thread apps/frontend/src/state/valkey-features/config/configSlice.ts

ravjotbrar reviewed Jun 24, 2026

View reviewed changes

Comment thread apps/server/src/actions/commandLogs.ts

Add ADR.

568d3d1

Signed-off-by: Argus Li <argus@argusli.dev>

ravjotbrar approved these changes Jun 24, 2026

View reviewed changes

ArgusLi merged commit 0a58993 into main Jun 24, 2026
8 checks passed

ArgusLi deleted the fix/connection-id-node-id branch June 24, 2026 17:48

Uh oh!

Conversation

ArgusLi commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Bug fixes

Tooling

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ArgusLi commented Jun 22, 2026 •

edited

Loading