GPT model selection: 4o-mini vs 4o vs o1 vs GPT-5which to use when
GPT-4o-mini is the workhorse. At $0.15/M input tokens, it's the right default for classification, extraction, routing, simple chat, RAG synthesis on small context. If you don't have a strong reason to use something else, use this.
GPT-4o is the upgrade for quality-sensitive work, sales-facing chatbots, code suggestions in your app, anything where users will notice mistakes. ~$2.50/M input. Still fast (sub-second first token), still cheap enough for high-volume.
o1 / o1-mini are reasoning models, slow (10–60s per response), expensive ($15+/M), but solve multi-step problems mini and 4o get wrong. Use for math, complex code generation, strategic planning. Don't use for chat, users won't wait.
GPT-5 (when available in your tier) is the new top of the lineup. Webtwizz defaults the OPENAI_MODEL env var to gpt-5.4 if you don't set it. Override for cheaper/faster work.
Routing pattern: in any chatbot or AI feature, classify the input first with 4o-mini ("is this question simple or complex?"), then route to 4o or o1 accordingly. Webtwizz can scaffold the routing layer if you describe the rule.



