Back to deals
OnDevice LLM - Offline AI Chat

Utilities

Free

OnDevice LLM - Offline AI Chat

by Mustafa Ergisi

v1.16 48 MB Universal 4+

Description

OnDevice LLM runs powerful open-source AI models 100% on your iPhone — no internet, no cloud, no account.

Your conversations never leave the device. Every word you type, every reply you get, stays on your phone. There is no server to log it, no company to mine it, no account that can be banned.

NEW IN 1.3 — TERNARY BONSAI, HOME SCREEN, DAILY BRIEF
- Ternary Bonsai arrives: 1.7B / 4B / 8B (MLX) — 8B-class quality in just 1.75 GB, fully offline.
- Brand-new Home screen with Quick Actions: ask a document, ask a photo, brief my day, or start a chat.
- Brief my day: combine your Calendar + Reminders with the app's private memory into a morning briefing — fully on-device.
- Better DeepSeek R1 output (fixed chat template).
- Model info cards on every model — check size, fit, and format before loading.

WHAT'S INSIDE
- A curated catalog of top open models:
- llama.cpp / GGUF: Gemma 4 / Gemma 2, Llama 3.2, Qwen 2.5 / Qwen 3, Phi-4 Mini, SmolLM2, DeepSeek R1.
- MLX (Apple Silicon native): Ternary Bonsai 1.7B / 4B / 8B.
- Quick Start chips: tap a persona (Translator, Code Helper, Brainstorm, Email Writer, ELI5...) or a demo (Rock Paper Scissors, Click Counter, Snake, Calculator, Tic-Tac-Toe, Drawing Pad, Bouncing Ball, SVG illustration) to send the prompt with one tap.
- Run mini-apps in chat: ask for a game, calculator, drawing pad, or SVG and tap "Run" to use the result right inside the chat — fully offline.
- Markdown chat with code blocks, syntax highlighting, copy + share.
- Attach PDFs, images, or text documents — the AI reads them locally.
- Voice input via on-device speech recognition.
- Persistent chat history with search.
- Personal memory that learns useful facts about you (and stays on your phone).
- Share Extension: send text from any app straight to your AI.

PRIVACY BY DESIGN
- 100% offline inference. Airplane mode works. Cellular off works.
- No account. No email. No tracking SDKs. No analytics that report your prompts.
- Your chats, memories, and downloaded models live in your iPhone's app sandbox only.

WHO IT'S FOR
- Privacy-conscious professionals (legal, medical, journalism, therapy notes).
- Frequent travelers, pilots, mariners — anyone working without reliable internet.
- Users in countries where cloud AI services are blocked.
- Tinkerers and developers who want to load and compare open-source models on a phone.

PRO UPGRADE
- Free plan: 1 model loaded, 3 saved chats.
- Pro: unlimited models, unlimited chats, custom system prompts, custom HuggingFace model URLs.
- One-time Lifetime purchase — buy once, own forever.

Subscriptions auto-renew unless canceled at least 24 hours before the period ends. Manage or cancel any time in Settings - Apple ID - Subscriptions.

What's new (v1.16)

Bug fixes and stability improvements.