First, out of principle, because I don't want OpenAI to use my inputs for RLHF to then replace me.
Second, I want much more freedom for experimentation, and I can't have that with a cloud API where I have to constantly worry about how many tokens I consume, which translates to $$$
aGi HaS bEEn AchIEvEd InTErNallY!