Skip to content
View fanshiqing's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@NVIDIA @NVIDIA-NeMo

Block or report fanshiqing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Megatron-LM Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python 1

  2. DAPPLE DAPPLE Public

    Forked from AlibabaPAI/DAPPLE

    An Efficiency Pipelined Data Parallel Approach for Large Models Training

    Python 3

  3. grouped_gemm grouped_gemm Public

    Forked from tgale96/grouped_gemm

    PyTorch bindings for CUTLASS grouped GEMM.

    Cuda 165 47

  4. TransformerEngine TransformerEngine Public

    Forked from NVIDIA/TransformerEngine

    A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

    Python