Skip to content
View ZhijunLStudio's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report ZhijunLStudio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ZhijunLStudio/README.md
Typing SVG

CSDN Gmail


$ cat /about.txt

I'm a researcher focused on making LLMs faster and smarter. Currently working on inference optimization, multimodal post-training (SFT/RLHF/DPO/GRPO), and efficient serving at scale.

Previously explored computer vision, now deep into the world of large language models and reinforcement learning.

$ cat /interests.txt

  • Inference Optimization (Quantization, KV-Cache, Speculative Decoding)
  • Post-training Alignment (SFT, RLHF, DPO, GRPO)
  • Multimodal LLMs (Vision-Language, Audio-Language)
  • Distributed Training & Efficient Serving (vLLM, TensorRT-LLM)

$ ls /skills/

PyTorch PaddlePaddle
vLLM FastDeploy
DeepSpeed Megatron-LM
LLaMA-Factory Python
C++ CUDA
Docker Linux

GitHub Stats

github contribution grid snake animation visitor counter

Pinned Loading

  1. PFCCLab/Camp PFCCLab/Camp Public

    飞桨护航计划集训营

    Mermaid 20 100

  2. FastDeploy FastDeploy Public

    Forked from PaddlePaddle/FastDeploy

    High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

    Python

  3. LlamaFactory LlamaFactory Public

    Forked from hiyouga/LlamaFactory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Python