Skip to content

Conversation

@niklebedenko
Copy link

Fixes #347

See the relevant section in the PTX isa for reference https://docs.nvidia.com/cuda/parallel-thread-execution/#parallel-synchronization-and-communication-instructions-atom

Note this quote:

The bit-size operations are .and, .or, .xor, .cas (compare-and-swap), and .exch (exchange).

The integer operations are .add, .inc, .dec, .min, .max. The .inc and .dec operations return a result in the range [0..b].

@niklebedenko
Copy link
Author

Oops I updated the wrong file, bear with me...

@niklebedenko
Copy link
Author

All sorted :) Works on my machine for building my library, haven't run it through your test suite yet. Please let me know if anything needs changing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Atomic or implementation is buggy

1 participant