• Toasty_toaster@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    ChatGPT predicts the most probable next token, or the next token that yields the highest probability of a thumbs up, depending on whether you’re talking about the semi-supervised learning or the reinforcement learning stage of training. That is the conceptual underpinning of how the parameter updates are calculated. It only achieves the ability to communicate because it was trained on text that successfully communicates.