Retour aux promos
onLM — Offline AI Assistant

Utilitaires

Gratuit

onLM — Offline AI Assistant

par Alexander Kryukov

v1.4.2 43 Mo Universel 17+

Description

Chat with powerful language models, transcribe voice notes, and generate images — all running directly on your iPhone or iPad. No cloud. No servers. No accounts. Your data never leaves your device.

onLM is a native iOS app that brings state-of-the-art AI to a fully offline workspace. Every message, recording, and image is processed locally using your device's hardware, giving you a genuinely private AI assistant that works without an internet connection and without a subscription.

PRIVATE AI CHAT
Chat with open-source LLMs that run entirely on your device. Conversations are stored only on your iPhone or iPad — no servers, no sign-up, no telemetry. Pick a model that fits your task and switch between them anytime.

VOICE NOTES, TRANSCRIPTION & SUMMARIES
Record audio, transcribe it to text, and summarize long recordings into key points — all processed on-device. Useful for meeting notes, interviews, lectures, or spoken ideas. Recordings are auto-titled based on content for easy browsing.

ON-DEVICE IMAGE GENERATION
On supported devices, generate images from text prompts without sending anything to a remote server. Write your prompt in any supported language and onLM translates it locally before generation. Your images stay in a private gallery on your device.

CHOOSE FROM LEADING OPEN-SOURCE MODELS
- Gemma 4 E2B & E4B — Google's edge models with native audio and vision support
- Gemma 3 (4B, 12B) — strong multilingual capabilities from Google
- Qwen 3.5 (2B, 4B, 9B) — excellent all-around performance
- Llama 3 (3B, 8B) — reliable general-purpose models from Meta
- Phi 4 Mini — optimized for math, logic, and code from Microsoft
- Mistral 7B — versatile European-built model

All downloadable models are 4-bit quantized and run efficiently on mobile hardware via Apple's MLX framework.

APPLE INTELLIGENCE INTEGRATION
On supported devices, onLM can use Apple Intelligence as a built-in chat model with zero setup — no download, instant responses. Switch between Apple Intelligence and open-source models at any time.

BUILT FOR YOUR DEVICE
onLM detects your iPhone or iPad's capabilities and recommends the best models for your hardware. Quality ratings help you balance speed and intelligence, and features that require more memory are only surfaced on devices that can handle them — so you never hit a wall unexpectedly.

SEAMLESS EXPERIENCE
- Real-time streaming — watch responses appear word by word
- Background downloads — continue working while models load
- Smart memory management — stable performance on mobile hardware
- Conversation management — organize, search, and rename chats
- Stop and resume generation anytime
- Disk space checks before every download

NO SUBSCRIPTIONS. NO ADS. NO TRACKING.
Download a model once and use it as much as you want. There are no usage limits, no hidden costs, and no telemetry. onLM is a straightforward native app, not a wrapper around someone else's service.

Whether you need a private AI chatbot, an offline voice transcriber for meetings and ideas, or an on-device image generator — onLM gives you the power of modern AI without compromising your privacy.

Nouveautés (v1.4.2)

- 12 GB iPhone stability — capped MLX buffer pool at 20 MB; added per-model KV-cache quantization and output-token caps (Qwen 3.5 9B, Gemma 3/4, Llama 3.1, Mistral) to stop jetsam kills by turn 3.
- Image generation — fixed OOM on 12 GB iPhones (proper hw.memsize gating, quantized SDXL path) and fixed a crash on long or non-Latin prompts (CLIP token sequences now clamped to 77).
- Onboarding — Skip button on the final page once any chat-capable model is ready; downloads keep running in the background.
- Download banner — now shown across all tabs, not just Chat.