Jinpeng Zhang – Medium

Jinpeng Zhang

GPT Technical Evolutionary History (1)

GPT stands for Generative Pre-Training Transformer, is a series of large-scale language models developed by OpenAI. In order to understand…

Feb 15

GPT Technical Evolutionary History (1)

Feb 15

Identify Top Talents in The Tech Interview

Background

Feb 9

Feb 9

DeepSeek-R1 Technical Analysis: Incentivizing Reasoning Capability in LLMs via Reinforcement…

In previous 5 blogs, I explained 5 key techniques DeepSeek model has used to reduce the training cost and improve model accuracy:

Feb 4

DeepSeek-R1 Technical Analysis: Incentivizing Reasoning Capability in LLMs via Reinforcement…

Feb 4

DeepSeek Technical Analysis — (5) FP8 Training

Background

Feb 2

DeepSeek Technical Analysis — (5) FP8 Training

Feb 2

DeepSeek Technical Analysis — (4)DualPipe

Background

Jan 31

DeepSeek Technical Analysis — (4)DualPipe

Jan 31

DeepSeek Technical Analysis — (3) Multi-Token Prediction

Background

Jan 30

DeepSeek Technical Analysis — (3) Multi-Token Prediction

Jan 30

DeepSeek Technical Analysis — (2)MLA

Background

Jan 29

DeepSeek Technical Analysis — (2)MLA

Jan 29

Key Techniques Behind DeepSeek Model’s 10x Efficiency — (1) MoE

Background

Jan 28

Key Techniques Behind DeepSeek Model’s 10x Efficiency — (1) MoE

Jan 28

Transformer Clear Explanation: Attention Is All You Need! — 2017

Before Transformer, RNN(recurrent neural networks), LSTM(long short-term memory, a variant of RNN) and gated RNN have been firmly…

Jan 21

Transformer Clear Explanation: Attention Is All You Need! — 2017

Jan 21

AlexNet: ImageNet Classification with Deep Convolutional Neural Networks — 2012

This work was made by Alex Krizhevsky, Ilya Sutskever and Geoffery E. Hinton in 2012. The AlexNet won the 2012 ImageNet Challenge, and…

Dec 31, 2024

AlexNet: ImageNet Classification with Deep Convolutional Neural Networks — 2012

Dec 31, 2024

Jinpeng Zhang

Jinpeng Zhang

Director of Engineering @ TiDB, focus on building large scale distributed system and high performance engineering team.

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech