All of these LLMs talk the same because: - there's only one Internet - Transformer is all you need - everyone is doing pre-training → supervised fine-tuning → RL - all the “secret sauce” leaks out between the big labs at SF restaurants
347