Toasty_toaster

joined 1 year ago
[–] Toasty_toaster@alien.top 1 points 11 months ago

ChatGPT predicts the most probable next token, or the next token that yields the highest probability of a thumbs up, depending on whether you're talking about the semi-supervised learning or the reinforcement learning stage of training. That is the conceptual underpinning of how the parameter updates are calculated. It only achieves the ability to communicate because it was trained on text that successfully communicates.