Skip to content

Conversation

@stwhite91
Copy link

  • Add BF16 and Float16 datatype support.

  • Add support for duplicating communicators for each OpenMP thread.

  • Use OpenMP inside mpi_all_reduce to split msg across threads/comms.

  • If HANDOFF is defined to 1 at top of coll_mpi.cpp, do not use thread-split.

@stwhite91
Copy link
Author

No need to merge yet, but sharing code here.

@delock
Copy link
Owner

delock commented Nov 15, 2024

@stwhite91 when MPI backend create comms thread according to openmp threads, does it be active when communication is not happening? I mean will it interfere with compute ops before and after communican, which also use openmp threads?

@stwhite91
Copy link
Author

Sorry I didn't see your comment earlier. No the OpenMP threads I have added here are not active outside of the mpi_all_reduce, there is no problem using OpenMP threads elsewhere in the application.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants