[GPURUN] new binding script uses numa info from: lscpu and rocm-smi #721

ronlieb · 2025-11-30T21:13:09Z

supports flags:
-v:
-vv
-vvv
-taskset: use taskset
-numactl: use numactl(D)
-nobind: use nobind

lscpu numa info
NUMA node0 CPU(s): 0-7
NUMA node1 CPU(s): 8-15

rocm-smi --showtoponuma info
GPU[0] : (Topology) Numa Node: 6
GPU[0] : (Topology) Numa Affinity: 6
GPU[1] : (Topology) Numa Node: 6
GPU[1] : (Topology) Numa Affinity: 6

supports flags: -v: -vv -vvv -taskset: use taskset -numactl: use numactl(D) -nobind: use nobind lscpu numa info NUMA node0 CPU(s): 0-7 NUMA node1 CPU(s): 8-15 rocm-smi --showtoponuma info GPU[0] : (Topology) Numa Node: 6 GPU[0] : (Topology) Numa Affinity: 6 GPU[1] : (Topology) Numa Node: 6 GPU[1] : (Topology) Numa Affinity: 6

z1-cciauto · 2025-11-30T21:14:31Z

PSDB Link: https://compiler-ci.amd.com/job/compiler-psdb-amd-staging/3029

z1-cciauto · 2025-12-01T16:04:25Z

PSDB Link: https://compiler-ci.amd.com/job/compiler-psdb-amd-staging/3034

ronlieb · 2025-12-02T20:59:58Z

next round will move to using amd-smi

future options: # -h Print this help message and exit # -md Set number of desired devices for multi-device mode, default=1 # -s suppress output, often useful in benchmarking # -q suppress output, quiet, alias of -s, same as GPURUN_VERBOSE=0 # -m use numactl membind to CPUs in same NUMA domain. Note: Allocation # fails when not enough memory available on these nodes. # -l use numactl localalloc to CPUs in same NUMA domain. Note: If # memory cannot be allocated, alloc falls back to other nodes. # -nr use numactl ROCR_VISIBLE_DEVICES # -nm use numactl OMPI_COMM_WORLD_LOCAL_RANK # --version Print version of gpurun and exit

z1-cciauto · 2025-12-06T17:22:18Z

PSDB Link: https://compiler-ci.amd.com/job/compiler-psdb-amd-staging/3146

z1-cciauto · 2025-12-06T22:24:51Z

PSDB Link: https://compiler-ci.amd.com/job/compiler-psdb-amd-staging/3149

ronlieb added testing only Review labels Nov 30, 2025

ronlieb requested review from carlobertolli, dhruvachak and gregrodgers November 30, 2025 21:14

ronlieb requested a review from lfmeadow November 30, 2025 21:25

carlobertolli approved these changes Nov 30, 2025

View reviewed changes

[BindNuma] fixes for cpx/tpx, add -dryrun option

d408b87

ronlieb added 4 commits December 6, 2025 11:04

rename gpurun to gpurun-old

c87a128

rename BindNuma to gpurun

2767b68

add copyright to new file

4bb22c2

ronlieb added 3 commits December 6, 2025 13:21

Merge branch 'amd-staging' into amd/dev/rlieberm/AltBind

dcc87b2

remove errant fi

725d16b

Add help text

5e2b64c

ronlieb removed testing only Review labels Dec 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPURUN] new binding script uses numa info from: lscpu and rocm-smi #721

[GPURUN] new binding script uses numa info from: lscpu and rocm-smi #721

ronlieb commented Nov 30, 2025

Uh oh!

z1-cciauto commented Nov 30, 2025

Uh oh!

z1-cciauto commented Dec 1, 2025

Uh oh!

ronlieb commented Dec 2, 2025

Uh oh!

z1-cciauto commented Dec 6, 2025

Uh oh!

z1-cciauto commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[GPURUN] new binding script uses numa info from: lscpu and rocm-smi #721

Are you sure you want to change the base?

[GPURUN] new binding script uses numa info from: lscpu and rocm-smi #721

Conversation

ronlieb commented Nov 30, 2025

Uh oh!

z1-cciauto commented Nov 30, 2025

Uh oh!

z1-cciauto commented Dec 1, 2025

Uh oh!

ronlieb commented Dec 2, 2025

Uh oh!

z1-cciauto commented Dec 6, 2025

Uh oh!

z1-cciauto commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants