ChatGPT’s architecture
ChatGPT is an auto-regressor. It generates text by using a transformer classifier to output a probability distribution over possible next words, given a partial piece of text.
References
1. Why Does Diffusion Work Better than Auto-Regression? YouTube