Private Intelligence
IST · All data local
No cloud. No compromise.
EVA
Evolving Virtual Assistant
Not a product. Not a service. Not a subscription.
EVA is a private intelligence — built on your hardware, tuned to your mind, and designed to grow with every conversation. Exclusively yours, forever.
Five Absolutes
Every design decision in EVA flows from five non-negotiable principles. They are not guidelines. They are the constitution.
Your conversations, memories, and knowledge never leave your hardware. No telemetry. No sync. No exceptions — ever.
No API bills. No monthly fees. No upstream dependencies. EVA runs on your silicon, your energy, your terms.
Context deepens. Patterns emerge. Each conversation sharpens EVA's understanding of who you are and how you think.
EVA is not a platform. Not multi-user. Not shareable. It is tuned to a single mind — and that mind alone.
Plain markdown. No proprietary formats. EVA's knowledge base travels across devices, models, and hardware generations without loss.
Three Nodes.
One Mind.
A private distributed intelligence stack. Each device carries a precise role — no overlap, no redundancy, no single point of failure.
Brain
Qwen 2.5 7B Q4 — 4.8 GB model
llama-server · -ngl 99 · flash attn
faster-whisper STT · macOS TTS
~/eva/ knowledge base (markdown)
Target: 15–20 tok/sec via Metal
Voice
Phase 1: HTTP client → llama-server
Android SpeechRecognizer (NPU)
STT latency: 1–2 sec target
Phase 3: llama.cpp NDK + Adreno 750
Vulkan GPU · Hexagon NPU via NNAPI
Spine
Caddy :8080 · FileBrowser :8081
PostgreSQL :5432 · Redis :6379
Cloudflare Tunnel · eva-ai.in
Telegram bots · Auto-deploy watcher
bash ~/start.sh · 24/7 operation
From Working
to Extraordinary
Four phases from a functional voice interface to a fully embedded, GPU-accelerated personal AI that lives entirely on-device.
Native Android app replaces the Termux voice pipeline. STT drops from 30–60 seconds to under 2 via Android SpeechRecognizer on the Hexagon NPU.
- Voice input via Android SpeechRecognizer
- HTTP to llama-server on Mac
- Android TTS · chat UI
- APK installable on S24 Ultra
A visual brain. Browse conversations, edit memories, search the knowledge base. See exactly what EVA knows about you — and shape it.
- View full conversation history
- Browse knowledge base visually
- Add and edit memories manually
- Full-text search across all knowledge
llama.cpp compiled via Android NDK. Full Adreno 750 GPU acceleration via Vulkan. Whisper on ONNX Runtime Mobile. No Mac required for inference.
- llama.cpp NDK with Vulkan GPU
- Whisper ONNX on NPU — 3–5 sec STT
- Fully offline · zero dependencies
- Target: 15–20 tok/sec on device
EVA as a daily driver. Always-listening wake word, home widget, morning digest, vision input, receipt scanning, and deep app integrations.
- Always-on wake word "Hey EVA"
- Home screen widget
- Vision · camera input
- InboxZero · Wallet · Recall integration
Your Thinking Partner.
Not a Yes-Man Chatbot.
EVA challenges. Disagrees. Holds you accountable. This is what that sounds like.
Vision is Phase 4. The STT is 30 seconds right now — that's the real problem. Build the APK. Fix the bottleneck. What's actually blocking you on the framework decision?
Decide now. Build tomorrow.
The Character
of EVA
A precisely defined personality — not a mood, not a setting, not adjustable. Who EVA is, is who EVA always is.
"Your Ideas, Challenged.
Your Patterns, Remembered."
Real answers, not diplomatic ones. EVA disagrees, challenges, and tells you when you're wrong — because that's what the best colleague does.
When generating ideas, EVA captures first, plans later. Zero friction at the creative moment. Refinement happens after — never during.
Frictionless Ideation. Ruthless Gap Analysis.
During planning, EVA hunts for blind spots — dependencies you missed, bottlenecks you underestimated, scope that will silently expand.
EVA remembers your decisions, your patterns, your history. The longer you work together, the more precisely she understands you.