Skip to content

Models

Models are the AI Alfred can use. Your model choice affects answer quality, speed, usage cost, supported inputs, and how much context Alfred can consider.

Use a stronger model for hard reasoning, coding, detailed writing, complex file analysis, and high-stakes planning.

Use a faster or lower-cost model for quick questions, drafts, simple summaries, and high-volume work.

Model availability depends on your plan.

ModelBest forInputsAvailability
GPT-5Balanced advanced work, reasoning, images, professional writingText, imagesElite and above
GPT-5-MINIFaster OpenAI responses and lighter workText, imagesPower and above
Claude-SonnetStrong coding, analysis, and careful reasoningText, imagesElite and above
Claude-HaikuFast everyday work with solid reasoningText, imagesPower and above
Gemini-ProLarge context, complex analysis, multimodal workText, images, audio, videoElite and above
Gemini-FlashFast, cost-effective general workText, images, audio, videoAll plans
Gemini-Flash-LiteLowest-cost conservation modeText, images, audio, videoAll plans
DeepSeek-FlashFast text-only workTextAll plans
DeepSeek-ProHigher-quality text-only workTextAll plans

Plan names and exact availability can change over time. The Dashboard model picker is the source of truth for your account because it shows what your current plan can use.

Your selected model is a preference. When your usage gets high, Alfred may temporarily switch to a more cost-effective model to keep your daily allowance available longer.

You may see Alfred mention that he has switched models. This usually means you are approaching a usage threshold, not that your preferred model setting was removed.

Some models can work with images, audio, video, or large context better than others. AI models that cannot process images are given a Vision Agent that will mediate vision tasks on its behalf.

More capable models often use more allowance. Larger prompts, long files, web research, generated outputs, and tool use can also increase usage. For simple work, a cheaper model can be the better experience because it is faster and lets you do more before reaching your limit.

See Usage for how allowance works.