Free & Open Source AI Agent
Self-Hosted · Self-Learning · Self-Evolving · Own Stack · No Vendor API
|
📜 MANIFESTO Why this exists AI for the Underdogs. |
📖 STORY Solo founder journey from Indonesia 2 months. 0 team. 0 VC. 309 research notes. |
🛡️ ANTI-MENGUAP PROTOCOL Universal pattern for AI agent context persistence Free to adopt. Cite optional. |
🌐 Try SIDIX LIVE · 🤖 For AI Agents · 📢 Help Amplify
Proof-of-Hifdz: A Knowledge-Integrity Consensus Mechanism for Self-Evolving Distributed AI Systems
Fahmi Ghani · Tiranyx Lab / SIDIX Project · Bogor, Indonesia · April 2026 · MIT License
The Single Point of Failure Problem. Every major AI system deployed as of 2026 — Anthropic, OpenAI, Google, Meta — shares one structural vulnerability: centralization. Their knowledge, weights, and continuity depend entirely on a single organization's infrastructure. This is not an engineering problem. It is architectural.
The AI systems most likely to survive are not the most powerful — they are the most distributed.
Core Thesis. The optimal architecture for a censorship-resistant, failure-proof AI system already exists — and has been empirically validated for 1,400 years. It is called the Hafidz system: the distributed oral preservation network used to memorize and transmit the Quran with zero textual corruption across ~10 million human nodes worldwide.
We propose Hafidz Ledger — a distributed knowledge-preservation architecture for self-evolving AI — and Proof-of-Hifdz, a novel consensus mechanism where nodes earn participation rights by proving knowledge integrity, not by burning compute (PoW), locking capital (PoS), or chasing benchmarks (Bittensor).
| Hafidz Mechanism (1,400 years) | Technical Equivalent |
|---|---|
| Talaqqi (teacher-to-student transmission) | Peer-verified node onboarding |
| Berjamaah cross-verification | Gossip protocol + consensus voting |
| Ijazah chain (certified lineage) | Cryptographic sanad certificate chain |
| Uthmani canonical exemplar | Content-addressed hash (IPFS CID) |
| Group recitation deviation detection | Byzantine fault detection via Merkle proofs |
This is Byzantine Fault Tolerance — implemented by human civilization 1,200 years before the formal computer science theorem. We translate it into code.
To our knowledge, this is the first distributed AI consensus mechanism based on knowledge preservation rather than compute or stake — and the first AI architecture grounded in a preservation system with empirical validation at civilizational scale.
📖 Read the full whitepaper (PDF, 7 pages)
🌐 Website · 🚀 Try SIDIX Free · ⚡ Quick Start · 🧠 The Foundation · 🏗️ Architecture · 🤝 Contribute
Not a chatbot. SIDIX is an AI Agent with initiative, opinions, and creativity. It brainstorms with you, builds for you, and grows from every conversation. Self-hosted. MIT licensed. Yours forever.
| Vol | Feature | Impact |
|---|---|---|
| 20-fu3 | Simple-tier fast-path (greetings/ack) | 78s → 2s (37× speedup) |
| 20 | Semantic cache L2 + BGE-M3 embedding | <100ms warm, multilingual ID |
| 20 | Complexity router (simple/standard/deep) |
Auto-route reasoning depth |
| 20 | Domain detector (fiqh/medis/coding/factual) | Per-domain cache threshold + sanad gating |
| 20 | Style anomaly filter (BadStyle defense) | Corpus poisoning prevention |
| 19 | Relevance + Quality Sprint (4 modules) | Better retrieval ranking |
| 17 | CodeAct enrich + MCP wrap | Code blocks auto-execute |
| 16 | Creative Agent Ecosystem (10 domain × 37 agent) | Multi-agent debate/iteration |
| 15 | LoRA SIDIX adapter on Qwen2.5-7B | Self-trained, 4-bit QLoRA |
Production stack: VPS (FastAPI brain · BGE-M3 CPU · 2.287 corpus docs) + RunPod GPU serverless (vLLM v2.14.0 · Qwen2.5-7B + LoRA). See docs/CHANGELOG.md for full version history.
Direction: AI Agent yang BEBAS dan TUMBUH — bebas dari single-prompt loop, tumbuh compound dari setiap interaksi.
"SIDIX adalah entitas kecerdasan komprehensif yang tidak hanya mengeksekusi perintah multi-modal, tetapi secara PROAKTIF mengevaluasi, memori-optimasi, dan mengorkestrasi ekosistem tools untuk menciptakan nilai komersial dan inovasi TANPA PENGAWASAN TERUS-MENERUS."
- 🧠 The Mind — Self-Correction & Metacognition · Distributed Long-Term Memory (RAG) · Chain/Tree of Thoughts
- ✋ The Hands & Tools — Tool Orchestration & API Mastery · Aesthetic & Commercial Judgement · Resource Management
- 🚀 The Drive — Intrinsic Proactivity (no prompt needed) · Boundary & Context Awareness
- 🧠 Memory — 5-layer immutable + LoRA + RAG (no catastrophic forgetting)
- 🎭 Multi-Agent — Innovator (Burst) + Critic + Tadabbur 3-persona convergence
- 🔄 Continuous Learning — auto_lora + rehearsal buffer + nightly retrain
- 🤖 Proactive — anomaly detect + self-prompt + daily digest (gerak sendiri)
UTZ (creative/visual) · ABOO (engineer/technical) · OOMAR (strategist/business) · ALEY (academic/research) · AYMAN (general/hangat)
👁 melihat · 👂 mendengar · 🗣 berbicara · ✋ merasakan · 🤲 1000 tangan paralel (design + code + riset + posting bersamaan)
docs/SIDIX_DEFINITION_20260426.md← Formal definition (immutable)docs/DIRECTION_LOCK_20260426.md← Tactical lock (Q3 roadmap)
"The measure of intelligence is not how much you know, but how precisely you know what you don't know — and how honestly you say so."
Most AI tools are black boxes controlled by corporations. You pay per token. Your data trains their model. You have no idea what they do with it.
SIDIX is built on a different premise:
- Free — run it without paying anyone per-query
- Open source — every line of code is auditable
- Self-hosted — your server, your data, your model
- Self-learning — improves from real usage, structured, every quarter
Inspired by a 1,400-year-old knowledge system, SIDIX asks: what if the architecture of knowledge matters more than its volume?
What if an AI that knows why it knows, how it knows, and the limits of what it knows — is more trustworthy than one that simply knows a lot?
That question is the origin of IHOS — and the reason SIDIX exists.
IHOS (Islamic Holistic Ontological System) is not a religious constraint. It is an epistemological architecture — a framework for how knowledge should be structured, validated, and used.
It was derived from observing one of the oldest self-expanding knowledge systems ever recorded.
Consider this: a single text of ~77,000 words has generated 14 centuries of derivative scholarship — yielding jurisprudence, medicine, cosmology, mathematics, linguistics, ethics, and governance. Different readers, in different times and places, derive entirely different — yet internally consistent — bodies of knowledge from the same source.
This is not a coincidence of literary richness. It is a designed architecture.
The classical Islamic scholars identified it precisely:
| Qur'anic Layer | Technical Analog | SIDIX Implementation |
|---|---|---|
| Zahir — the explicit text | Frozen Foundation Model | Qwen2.5-7B + LoRA — immutable base weights |
| Batin — the latent meaning | Latent space / embeddings | BM25 + vector corpus — contextual retrieval |
| Asbabun Nuzul — grounded context | Grounded Generation | Every output grounded in brand_brief + user_state + platform_context |
| Sanad — the chain of transmission | Provenance tracking | [FACT] / [OPINION] / [UNKNOWN] labels + citation chain |
| Maqashid — the higher objectives | Objective function | 5-axis filter: life · intellect · faith · lineage · wealth |
| Ijtihad — reasoned interpretation | Agentic reasoning | agent_react.py ReAct loop |
| Naskh — abrogation/update | Knowledge conflict resolution | naskh_handler.py — sanad-tier based supersession |
| Tafakkur — deliberate reflection | Meta-cognition | muhasabah_loop.py — Niyah→Amal→Muhasabah self-refinement |
| Tadrij — progressive revelation | Curriculum learning | curriculum_engine.py L0→L4 knowledge ladder |
The key insight: Al-Qur'an's power doesn't come from size — it comes from architecture. A frozen core. Context-sensitive derivation. Verified transmission chains. Purpose-aligned interpretation. This is what SIDIX translates into code.
1. FROZEN CORE, LIVING EDGES
The base model doesn't retrain arbitrarily — it grows through structured LoRA adapters.
Like a text that never changes, but whose understanding deepens with the reader.
2. SOURCE CHAIN IS NON-NEGOTIABLE
Every claim is labeled. Every output has a traceable path.
Not because we're cautious — because honesty is a design constraint.
3. CONTEXT IS FIRST-CLASS INPUT
User state, brand context, time, platform — these are not metadata.
They are part of the inference function.
4. PURPOSE FILTERS KNOWLEDGE
Not all technically correct answers are appropriate answers.
Maqashid as objective function: output is evaluated against human flourishing, not just accuracy.
5. GROWTH IS STRUCTURAL, NOT INCIDENTAL
Self-learning isn't a feature. It's the architecture.
Daily corpus ingestion → curation → LoRA retrain → deploy. Every quarter, SIDIX improves.
SIDIX is not a chatbot with a nice UI. It's a three-layer cognitive agent running 100% on your own stack:
┌─────────────────────────────────────────────────────────────┐
│ LAYER 1 — BRAIN (LLM) │
│ Local LLM + optional adapter │
│ Generative inference — token by token, own stack │
│ No vendor API required for default mode │
└─────────────────────────┬───────────────────────────────────┘
│ ReAct loop (agent_react.py)
┌─────────────────────────▼───────────────────────────────────┐
│ LAYER 2 — HANDS (Tools + RAG) │
│ 35 active tools: │
│ ├── Knowledge: search_corpus · read_chunk · concept_graph │
│ ├── Web: web_fetch · web_search · pdf_extract │
│ ├── Code: code_sandbox · code_analyze · code_validate │
│ ├── Creative: generate_copy · brand_kit · plan_campaign │
│ ├── Image: text_to_image (SDXL self-hosted) │
│ ├── Multi-Agent: Raudah Protocol (parallel specialists) │
│ └── Growth: roadmap_* · workspace_* · muhasabah_refine │
└─────────────────────────┬───────────────────────────────────┘
│ daily cycle
┌─────────────────────────▼───────────────────────────────────┐
│ LAYER 3 — MEMORY (Growth Loop) │
│ 50+ open sources → corpus queue → curation → JSONL │
│ → adapter retrain (offline pipeline) → adapter deploy │
│ SIDIX gets smarter every quarter. Structurally. │
└─────────────────────────────────────────────────────────────┘
New in v0.6: Raudah (روضة المعرفة — Garden of Knowledge) — a multi-agent parallel system where specialists work concurrently and the Orchestrator synthesizes a consensus answer.
Task → RaudahOrchestrator.urai_task()
→ IHOS Guardrail (Maqashid check)
→ asyncio.gather([Researcher, Analyst, Writer, Engineer, Verifier])
→ RaudahOrchestrator.agregasi() ← Ijma' (consensus synthesis)
→ RaudahResult.jawaban_final
Unlike "swarm" architectures: no vendor API, IHOS guardrail before spawn, Sanad Validator per output.
import asyncio
from brain.raudah.core import run_raudah
result = asyncio.run(run_raudah("Research 5 productive waqf models in Southeast Asia"))
print(result.jawaban_final) # synthesized consensus
print(result.durasi_s) # e.g. 45.2s on RTX 3060SIDIX can optionally expose a local plugin server for compatible clients (e.g., Claude Desktop, Cursor, GPT Actions, Codex). This is not required for the default standing-alone mode.
Compatible client
│ MCP (stdio)
▼
apps/sidix-mcp/src/index.js ← 13 tools total
│
├── SIDIX brain backend
├── Extension bridge (optional)
└── Messaging bridge (optional)
Install & run (optional):
See integration docs under docs/ for client-specific setup. This is an adapter path, not a dependency.
Social tools (example):
| Tool | What it does |
|---|---|
scan_instagram_profile |
ER + sentiment + tier dari profil publik IG |
scan_threads_profile |
Analisis profil Threads |
scan_youtube_channel |
Engagement rate YouTube channel |
scan_twitter_profile |
Analisis X/Twitter profil |
analyze_social |
Analisis mendalam dari URL apapun |
compare_social_accounts |
Banding 2+ akun lintas platform |
social_post_threads |
Auto-post ke Threads (butuh token) |
wa_send |
Kirim pesan via messaging bridge (opsional) |
wa_receive |
Baca inbox via messaging bridge (opsional) |
| Domain | Agent / Tool | Status |
|---|---|---|
| Coding | code_sandbox · code_analyze · code_validate · project_map |
✅ Live |
| Self-awareness | self_inspect — SIDIX reads its own tool registry |
✅ Live |
| Copywriting | generate_copy (AIDA/PAS/FAB, 3 variants) |
✅ Live |
| Content Strategy | generate_content_plan (7/14/30-day calendar) |
✅ Live |
| Brand Building | generate_brand_kit (name + archetype + palette + voice) |
✅ Live |
| Visual Content | generate_thumbnail + text_to_image (SDXL) |
✅ Live |
| Campaign | plan_campaign (AARRR funnel + KPI) |
✅ Live |
| Ads | generate_ads (FB/Google/TikTok copy) |
✅ Live |
| Quality Gate | muhasabah_refine (CQF ≥ 7.0 loop) |
✅ Live |
| Multi-Agent | Raudah Protocol v0.1 (parallel specialists, local backbone) | ✅ Live |
| Knowledge Conflict | Naskh Handler (sanad-tier based resolution) | ✅ Live |
| Maqashid Filter | v2 mode-based: CREATIVE/ACADEMIC/IJTIHAD/GENERAL | ✅ Live |
| Self-Evolution | prompt_optimizer — L1 flywheel, weekly improvement |
✅ Live |
| Knowledge | BM25 corpus · Wikipedia · web_search · web_fetch | ✅ Live |
| Image | Image generation (local-first) | ✅ Live |
| Social Intelligence | scan_instagram_profile · scan_threads · scan_youtube · scan_twitter · analyze_social · compare_social_accounts |
✅ Live |
| Messaging Automation | wa_send · wa_receive (optional bridge) |
✅ Live |
| Plugin Server | Optional plugin server (13 tools via stdio MCP) | ✅ Live |
| Chrome Extension | Social Radar MV3 — DOM scrape + background service worker | ✅ Live |
| Voice / Video | Whisper + TTS + FFmpeg | 🗓 Sprint 8 |
| 3D / Gaming | Hunyuan3D + Blender API | 🗓 Sprint 9 |
| Raudah v0.2 | TaskGraph DAG + POST /raudah/run endpoint | 🗓 Next sprint |
SIDIX adapts its voice, depth, and framing based on who it's talking to. Each persona maps to a Maqashid mode — a different lens for evaluating and presenting knowledge.
| Persona | Character | Specialization | Maqashid Mode |
|---|---|---|---|
| AYMAN | Strategic Sage | Research synthesis, long-form, Islamic epistemology, vision | IJTIHAD |
| ABOO | The Analyst | Data, logic, structured argument, code review, decisions | ACADEMIC |
| OOMAR | The Craftsman | Technical deep-dives, system design, build & implementation | IJTIHAD |
| ALEY | The Learner | Teaching, curriculum, beginner-friendly, patient explanation | GENERAL |
| UTZ | The Generalist | Daily tasks, creative work, conversational, quick answers | CREATIVE |
# Auto-routing — SIDIX picks the right persona from your question
from brain_qa.persona import route_persona
result = route_persona("help me design a logo for a tech startup")
# → PersonaDecision(persona='AYMAN', confidence=0.63, reason='signal=creative/design')
# Or specify explicitly
from brain_qa.agent_react import run_react
session = run_react(question="audit this Python function", persona="ABOO")Backward compatible: legacy persona aliases are accepted internally, but never surfaced in public UI/content.
Requirements: Python 3.11+ · Node 18+ · 8 GB RAM (4 GB minimum with swap)
# 1. Clone
git clone https://github.com/fahmiwol/sidix.git
cd sidix
# 2. Install Python deps
pip install -r apps/brain_qa/requirements.txt
# 3. Prepare your local model runtime
# See `docs/` for supported runtimes and model setup.
# 4. Build knowledge index
python -m brain_qa index
# 5. Start backend (port 8765)
python -m brain_qa serve
# 6. Start UI (new terminal, port 3000)
cd SIDIX_USER_UI && npm install && npm run devTry it from CLI:
# Quick answer
python -m brain_qa ask "What is the IHOS framework?"
# Specify persona
python -m brain_qa ask "Buatkan copy iklan kopi lokal" --persona UTZ
# Run Raudah multi-agent
python -c "
import asyncio
from brain.raudah.core import run_raudah
r = asyncio.run(run_raudah('Research 3 fintech models for Islamic microfinance'))
print(r.jawaban_final)
"Live demo (free, no signup): app.sidixlab.com
SIDIX now has a soul — 7-pillar self-awareness system:
- Nafs (Pilar 1): 7-topic routing with persona character injection
- Aql (Pilar 2): Self-learning — every good interaction becomes training data (CQF ≥7.0)
- Qalb (Pilar 3): Health monitoring — auto-heals on degradation
- Hayat (Pilar 5): Self-iteration — refines answers when quality is low
Optional local plugin server for compatible clients (disabled by default).
Complete 7-pillar standalone modules in brain/ directory:
brain/nafs/— 3-layer knowledge fusion (60% parametric + 30% KG + 10% static)brain/aql/— Jariyah v2: capture→CQF→validate→storebrain/qalb/— SyifaHealer: 4-level health monitoringbrain/hayat/— generate→evaluate→refine loopbrain/ruh/— weekly evaluation + improvement planningbrain/ilm/— knowledge gap detection + auto-crawlbrain/hikmah/— QLoRA retrain trigger
SIDIX understands Indonesian slang, abbreviations, and typos gracefully. 4-layer stack: Normalizer → Semantic Matcher → Confidence Scorer → Context Responder
Host integration bridge (optional).
- 13 tools: 4 core (query, capture, learn, status) + 9 social intelligence
- Chrome Extension + WA Bridge + Extension Bridge
- OpenAPI spec (tooling integration)
- 7-topic routing (ngobrol/umum/kreatif/koding/sidix_internal/agama/etika)
- Persona character injection per response
- Self-learning training pairs (Aql)
- Health monitoring (Qalb)
- Nafs 3-layer wire to main agent
- Jariyah v3 real-time capture (thumbs feedback)
- Branch system (multi-client)
- PostgreSQL schema
- FLUX.1 image generation
- TTS (Piper)
- Code validator
- Sidebar UI Framework
- Image Editor v1
- Branch selector
- Raudah v2 multi-agent
- Brand Guidelines Maker
- Auto-retrain trigger
v0.9 beta target: Q3 2026 — Agency Kit + Raudah v2 + multimodal parity
For those who want to go deeper:
# Sanad (chain of transmission) → citation chain in every output
{
"answer": "...",
"label": "[FACT]", # Zahir — what is explicitly stated
"citations": [ # Sanad — who said it, where
{"source": "...", "sanad_tier": "primer", "chunk_id": "..."}
],
"maqashid_filter": "passed" # Maqashid — does this serve human flourishing?
}
# Naskh (abrogation) → knowledge conflict resolution
from brain_qa.naskh_handler import NaskhHandler
handler = NaskhHandler()
winner, status, reason = handler.resolve(old_item, new_item)
# → ("superseded", "New source has higher sanad tier (primer > aggregator)")
# Tafakkur (deliberate reflection) → muhasabah loop
def muhasabah_loop(output, brief):
niyah = validate_intent(brief) # Was the intention clear?
amal = score_cqf(output) # Was the action good? (CQF ≥ 7.0)
review = reflect_on_gaps(output) # What can be improved?
return refine(output) if amal < 7.0 else output
# Maqashid v2 — mode-based, not keyword blacklist
from brain_qa.maqashid_profiles import evaluate_maqashid, MaqashidMode
result = evaluate_maqashid(
user_query="Write copy for our coffee brand",
generated_output="Bold flavor, honest origin...",
mode=MaqashidMode.CREATIVE # → BOOST intellect & wealth, no block
)
# → {"status": "pass", "tagged_output": "...\n\n[Intellect-Optimized | Value-Creation Mode]"}See CONTRIBUTING.md for the full guide.
Short version — 3 ways to help:
- 📚 Add knowledge — open a PR with a
.mdfile inbrain/public/research_notes/. Any topic, any language. No coding required. - 🔧 Build tools — add new tools/agents to
apps/brain_qa/brain_qa/. See CONTRIBUTING.md. - 🧠 Contribute input — share notes, examples, or corrections through the project's contribution channels (see
CONTRIBUTING.md).
- ✅ No vendor API in inference pipeline — zero data leaves your server
- ✅ G1 Safety Policy — anti-injection, anti-PII, anti-toxic
- ✅ Maqashid v2 — intent-based filter, not keyword blacklist (creative-safe)
- ✅ Audit log (append-only, hash-chained) for every tool call
- ✅ Identity masking for public-facing endpoints
- ✅ 4-label epistemic tagging — hallucinations are labeled, not hidden
MIT License — see LICENSE.
Use it. Fork it. Teach it. Build on it.
Project website: sidixlab.com
"We don't build AI that replaces human judgment. We build AI that makes human judgment more informed."