Retour aux promos
Local LLM: MITHRIL

Productivite

Gratuit

Local LLM: MITHRIL

par DEPLOY FORWARD LLC

v1.6.2 52 Mo Universel 12+

Description

Run quantized large language models directly on your iPhone. No cloud, no internet required.

Access state-of-the-art quantized AI models optimized for mobile hardware. Download GGUF-format models that compress billion-parameter networks into mobile-friendly sizes while maintaining performance.

COMPLETE MODEL SUITE
• Llama 3.2 1B/3B (Meta) - Q4/Q8 quantization
• Gemma 3 270M/2B/9B (Google) - IQ4_NL optimization
• Qwen 2.5 0.5B-7B (Alibaba) - Multiple quantization levels
• LLaVA 1.5/1.6 (Vision) - Multimodal image understanding
• Direct integration with Hugging Face model repository

TECHNICAL FEATURES
• GGML/llama.cpp inference engine
• Metal GPU acceleration on Apple Silicon
• Dynamic context window management (2K-8K tokens)
• Retrieval-Augmented Generation (RAG) with embeddings
• Real-time streaming with token/second metrics
• SQLite conversation storage with vector search

SYSTEM REQUIREMENTS
Models run efficiently when file size ≤ available RAM. Recommended minimum 6GB RAM for larger models. iPhone 15 Pro/Pro Max optimal. iOS26 for Apple foundation model.

Zero telemetry. Zero data transmission. Pure local AI computing.

Nouveautés (v1.6.2)

-fixed ability to voice chat with Apple Foundation Model
-smoothed onboarding for voice chat- now if no models are downloaded yet a modal will pop up and prompt you to download one of the 3 whisper models instead of automatically downloading them all
-if content block is hit with foundation model in voice chat, a modal will popup and explain that Apple limits content then give option to start new chat or switch to open source model