Skip to content
View LoserCheems's full-sized avatar
🐶
I am loser cheems
🐶
I am loser cheems

Block or report LoserCheems

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
LoserCheems/README.md

Jingze Shi

news: I am looking for a engineering internship in the field of LLM. If you have any information, don't hesitate to get in touch with me. 📧

Experience 🐕

  • 2022.9-Present Undergraduate Student

Competition Awards 🏆

Publications 📝

  • Trainable Dynamic Mask Sparse Attention [Paper]
  • Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting [Paper]

Research Direction 🔭

  • Natural Language Processing
  • Large Language Models
  • Small Language Models
  • Foundation Models
  • Deep Reinforcement Learning
  • High Efficient Algorithm

Skills ⚒️

  • Natural Language: 简体中文, English
  • Programming Language: C++, Python
  • Typesetting Language: Markdown, LaTeX
  • Programming Framework: PyTorch, Transformers

Pinned Loading

  1. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 154k 31.5k

  2. pytorch/pytorch pytorch/pytorch Public

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python 96k 26.3k

  3. huggingface/open-r1 huggingface/open-r1 Public

    Fully open reproduction of DeepSeek-R1

    Python 25.7k 2.4k

  4. huggingface/trl huggingface/trl Public

    Train transformer language models with reinforcement learning.

    Python 16.7k 2.4k

  5. flash-algo/flash-sparse-attention flash-algo/flash-sparse-attention Public

    Trainable fast and memory-efficient sparse attention

    Python 482 46

  6. flash-algo/kernel-course flash-algo/kernel-course Public

    Learn how to develop kernels

    Python 95 1