Skip to content

Try to increase shared memory per block to accommodate the kernel #108

@eyalroz

Description

@eyalroz

Many NVIDIA GPUs have a (supposedly, kind-of) adjustable maximum amount of shared memory per block, with the default being 48 KiB or so. We currently do nothing to adjust this - and simply schedule the compiled kernel. We should, instead, try to arrange it so that our larger-shared-memory-utilization kernels are accepted and scheduled rather than rejected.

Metadata

Metadata

Assignees

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions