Skip to content
View yangjianxin1's full-sized avatar
Block or Report

Block or report yangjianxin1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Hi there 👋, I'm Yang Jianxin

yangjianxin1's GitHub stats

I'm a NLPer interested in Large Language Model and graduated from SYSU with a master's degree.

In my free time, I like to write technical blogs on [Wechat Official Accounts: YeungNLP] and [Zhihu: 红雨瓢泼]

🔭 Experiences:

  • Shopee, responsible for building NLP algorithm ability about Customer Service. (from 2022-04 to now)
  • Tencent, responsible for building NLP algorithm ability about Product Understanding. (from 2021-06 to 2022-04)
  • Alibaba, Internship at Alibaba (from 2020-06 to 2020-09).

⚙ Here are some my public projects:

Project Description Code
Firefly One-stop training for LLMs. Some achievements:
1. firefly-llama2-13b ranked 3rd among all 13B models on Open LLM Leaderboard, only 0.5 points less than 1st.
2. firefly-llama-30b ranked 10th among all 30B models on Open LLM Leaderboard trained with single V100.
3. firefly-baichuan-13b achieves over 1.63 million downloads.
4. firefly-qwen1.5-en-7b-dpo improves 7.21 points compared with the official chat model.
5. firefly-gemma-7b improves 9.37 points compared with the official chat model.
GPT2-chitchat Chinese GPT2 for chitchat
Firefly-LLaMA2-Chinese Chinese Llama2 with efficient and effective training method.
LongQLoRA Efficient and Effective method for extending context length of Llama2 to 8192 with single V100. Technical Report
CPM Chinese composition model based on CPM
CLIP-Chinese Chinese CLIP model trained with 1.4 million image-text pairs
ClipCap-Chinese Chinese image caption model based on clip and mengzi
OFA-Chinese Chinese multi-modal unified pre-training model
LLMPruner Prune vocabulary of LLMs to save memory in training.

📁 Here are some my technical blogs:


  1. Firefly Firefly Public

    Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

    Python 5.1k 469

  2. GPT2-chitchat GPT2-chitchat Public

    GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

    Python 2.9k 679

  3. CPM CPM Public

    Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)

    Python 524 135

  4. Firefly-LLaMA2-Chinese Firefly-LLaMA2-Chinese Public


    Python 381 27

  5. CLIP-Chinese CLIP-Chinese Public


    Python 357 56

  6. LongQLoRA LongQLoRA Public

    LongQLoRA: Extent Context Length of LLMs Efficiently

    Python 148 11