feat(swiftbuddy): MemPalace v1, native macOS theming, HF model manage… by solderzzc · Pull Request #18 · SharpAI/SwiftLM

solderzzc · 2026-04-07T23:24:52Z

…ment, TDD harness

SwiftBuddy App:

Adaptive macOS theming via native NSColor semantics (Light/Dark mode)
Removed confusing segmented picker from ModelPickerView
Embedded HuggingFace search directly into ModelManagementView
Added 'Strict MLX Formatting Only' toggle for HF search filtering
Fixed ServerManager ghost isOnline state (deferred until NIO bind succeeds)
Fixed missing 'pickaxe' SF Symbol → 'hammer.fill'
Bulletproof JSON extraction via boundary scanning (cleanJSON)

MemPalace Core:

Wing → Room → MemoryEntry architecture (SwiftData)
Apple NLEmbedding vector search with cosine similarity
ExtractionService for LLM-based memory mining
3 tool-calling schemas (save_fact, search, list_rooms)

Testing Infrastructure:

SwiftBuddyTests target in Package.swift
ExtractionServiceTests (3 adversarial JSON parsing tests)
HFModelSearchTests (strict/loose MLX filter network tests)
Persistent TDD harness system at .agents/harness/
- Level 1: Memory handling (9 features, 3 passing)
- Level 2: Model management (10 features, 2 passing)
- Level 3: MemPalace parity audit vs milla-jovovich/mempalace (34 features, ~22%)

Build:

generate_xcodeproj.py updated with Hummingbird + SwiftSoup deps
Network entitlements for HF API access
arm64-only architecture enforcement

- Injected export pipeline guaranteeing MLX metal library initialization hooks bypass Github Action test environments natively

- Introduced currentWing target on ChatViewModel for persona routing - Intercepted userText explicitly searching SwiftData native memories - Pre-pended retrieved factual context invisibly inside system prompts ensuring zero-latency, 100% stable context retention across all dumb models seamlessly

…teObject lifecycle bug

…rference and make downloaded models directly tappable to load

…hat bypasses token.isThinking flag

…for finalized chat messages

…r via HubApi for SwiftBuddy

…ded catalog

… MLX/GGUF formatting for Hub queries

…ing the Search UI model list

…o prevent macOS layout recursion crashes resulting in blank models

…cursive background querying for HF Hub discovery

…ative Hub cursor pagination

…skeleton constraints for HuggingFace Hub modal layout

… recursion crash

…tize absolute cached file size in row view

…rView bounds

…ng to RegistryService to trace GitHub API access drops

…hed persona.json and statically request known room txt files

…g preventing successful 404 recovery

… WAL transaction flooding during massive persona corpus ingestion

…lts on SwiftBuddyApp boot sequence

…oops by converting TextEditor blocks to vertical TextFields inside iOS/macOS active ScrollViews

…ine and introduce Native graphical Map hierarchy for memory rooms

…or teardown on macOS modal sheets

…natively into ChatView toolbars for RAG identity mapping

…tly reflect the currently selected memory persona wing

…try and pivot root Navigation to a primary Friends List model

…ectures by forcefully prepending RAG variables linearly against raw User instructions rather than allocating hostile System Role bounds

…cks and trap silent HF snapshot failures to guarantee observable developer console logs

…serve KV Prefix caching continuity across MLX generations, and patch RPG Thought UI aesthetics

…r to reject raw boilerplate text and prevent small parameter LLM line-by-line regurgitation

…ridging and append Persona deletion traps in UI

…s RAG context extraction

…ap to prevent multiline Persona RAG directives from leaking into user UI bubbles

… sequence during Persona model downloads

…he ModelPicker sheets to display real-time global download speeds and ETA dynamically

… Qwen 3 and Qwen 3.5 exclusivity as requested

…mpalace-v1 # Conflicts: # .github/workflows/build.yml # scripts/profiling/profile_runner.py

… and DMG packaging pipeline via Github Actions

… pipeline

…ut DMG artifact names

…catalog with Phi-4, Qwen3.5, and Liquid CFM

…nd preserve prompt across db reloads

…ated recommended() catalog function

… metadata pane

…lling

…arnings

…mpalace-v1

…ace polling endpoints

…l Inspector sidebar

…d() calls during MoE streaming MoE routing exhibits strong temporal locality — adjacent tokens frequently route to the same experts (60-70% overlap). This cache stores recently-loaded quantized expert weight matrices in a bounded LRU (default 2048 entries) keyed by (safetensorsPath, tensorName, expertIndex). On cache hits, the entire pread() → allocator::malloc → eval cycle is skipped, yielding zero I/O latency for repeated expert accesses. Cache hit/miss metrics are logged to stderr every 10 seconds alongside existing SSD stream stats. The cache is automatically cleared on model unload to prevent stale weights and free unified memory.

…enchmark Results with Hot Expert LRU Cache active: - SSD + 16-Worker Prefetch: 3.8 tok/s, 5.95s TTFT, 34.9 GB GPU - SSD + TurboQuant: 3.0 tok/s, 9.46s TTFT, 34.9 GB GPU - SSD Stream (cold): 0.01 tok/s, 299.66s TTFT, 88.2 GB GPU The expert cache eliminates ~60-70% of redundant pread() calls on warm runs, delivering a 300x+ improvement over cold SSD streaming.

… commit 122B test logs

solderzzc added 30 commits April 7, 2026 18:53

ci: secure MLX_METAL_PATH for E2E tests dynamically

eb853c8

- Injected export pipeline guaranteeing MLX metal library initialization hooks bypass Github Action test environments natively

fix(gui): Resolve SwiftBuddy HuggingFace search blank list due to Sta…

dce2503

…teObject lifecycle bug

fix(gui): Resolve ModelManagementView SwiftUI sheet presentation inte…

7b7e281

…rference and make downloaded models directly tappable to load

chore: Add diagnostic logging to HFModelSearch

6f4f132

fix(chat): Improve extraction of raw <think> tags from model output t…

3f2a374

…hat bypasses token.isThinking flag

feat(chat): Persist generated reasoning in a collapsed ThinkingPanel …

8001126

…for finalized chat messages

fix(macOS): Re-enable native model downloading in ModelDownloadManage…

2b7c1dc

…r via HubApi for SwiftBuddy

fix(models): Allow arbitrary Hugging Face models to appear in downloa…

b57e22e

…ded catalog

feat(search): Dynamically fetch exact model storage sizes and display…

cc6aa31

… MLX/GGUF formatting for Hub queries

fix(search): Prevent Hugging Face rate limits from abruptly hard-fail…

6b06d03

…ing the Search UI model list

fix(search): Refactor HFSearchTab to use ScrollView instead of List t…

c0de485

…o prevent macOS layout recursion crashes resulting in blank models

feat(search): Implement horizontal parameter size filtering UI and re…

4e440c9

…cursive background querying for HF Hub discovery

fix(search): Support M-scale parameter size filtering and implement n…

57b9f93

…ative Hub cursor pagination

style(ui): Remove model picker auto-open on launch and enforce rigid …

35c5190

…skeleton constraints for HuggingFace Hub modal layout

fix(search): Exclude GGUF models implicitly and fix CatalogTab layout…

8064be6

… recursion crash

style(search): Move Hub search header to top of model list and priori…

dcd6f82

…tize absolute cached file size in row view

feat(mempalace): Mock registry personas and implement strict Inspecto…

cd0c448

…rView bounds

chore(networking): Add User-Agent headers and explicit response loggi…

0e18a56

…ng to RegistryService to trace GitHub API access drops

feat(mempalace): Refactor RegistryService fetching to rely on CDN-cac…

c304f45

…hed persona.json and statically request known room txt files

fix(mempalace): Bypass aggressive internal URLSession CDN edge cachin…

c0fa8ed

…g preventing successful 404 recovery

perf(mempalace): Implement batch saveMemories API to prevent CoreData…

ec87d79

… WAL transaction flooding during massive persona corpus ingestion

feat(engine): Automatically load the last active model from UserDefau…

a8d2b0d

…lts on SwiftBuddyApp boot sequence

fix(ui): Eliminate NSDetectedLayoutRecursion infinite AppKit redraw l…

ac2a9e2

…oops by converting TextEditor blocks to vertical TextFields inside iOS/macOS active ScrollViews

feat(mempalace): Implement Wake-Up Persona integration into MLX pipel…

7ed0f6e

…ine and introduce Native graphical Map hierarchy for memory rooms

fix(ui): Add explicit window dismiss button to PalaceVisualizerView f…

cc9fe0a

…or teardown on macOS modal sheets

feat(ui): Add explicit swiftData driven Persona wing picker injected …

965c2cc

…natively into ChatView toolbars for RAG identity mapping

feat(ui): Bind chat navigation title and input placeholder to explici…

beec023

…tly reflect the currently selected memory persona wing

feat(ui): Decouple Text Ingestion / Memory Miner from Inspector Regis…

a8db7cd

…try and pivot root Navigation to a primary Friends List model

fix(ml): Resolve Jinja Template constraint failures with Gemma archit…

0376eb8

…ectures by forcefully prepending RAG variables linearly against raw User instructions rather than allocating hostile System Role bounds

solderzzc and others added 28 commits April 8, 2026 16:48

fix(ui): Enforce instantaneous visual feedback on remote download cli…

faf780a

…cks and trap silent HF snapshot failures to guarantee observable developer console logs

fix(rag): Permanently map context references into State buffer to pre…

5ae356a

…serve KV Prefix caching continuity across MLX generations, and patch RPG Thought UI aesthetics

fix(mempalace): Enforce strict context synthesis rules in Memory Mine…

164b486

…r to reject raw boilerplate text and prevent small parameter LLM line-by-line regurgitation

feat: Implement persistent cross-session Chat History via SwiftData b…

539a828

…ridging and append Persona deletion traps in UI

fix(mempalace): Fix PersonaLoader seed case-mismatch breaking Lumina'…

0d81899

…s RAG context extraction

fix(ui): Enforce dotMatchesLineSeparators inline modifier in regex tr…

12f6d54

…ap to prevent multiline Persona RAG directives from leaking into user UI bubbles

feat(ui): Implement immersive ancient RPG magical summoning animation…

de7897a

… sequence during Persona model downloads

feat(ui): Inject universal FloatingDownloadBanner overlay pinned to t…

bf9a1a4

…he ModelPicker sheets to display real-time global download speeds and ETA dynamically

feat(models): Scrub legacy Qwen2.5 array elements to strictly enforce…

6747d5d

… Qwen 3 and Qwen 3.5 exclusivity as requested

Merge remote-tracking branch 'origin/main' into feature/swiftbuddy-me…

03595ea

…mpalace-v1 # Conflicts: # .github/workflows/build.yml # scripts/profiling/profile_runner.py

ci(release): fully automate hardened macOS app signing, notarization,…

262eda1

… and DMG packaging pipeline via Github Actions

ci(release): pivot to zero-secret open-source ad-hoc DMG distribution…

cc5dfb5

… pipeline

ci(release): inject dynamic version numbers from Info.plist into outp…

b1e2223

…ut DMG artifact names

feat(ui): redesign model picker with master-detail layout and update …

a96bcd9

…catalog with Phi-4, Qwen3.5, and Liquid CFM

chore(assets): integrate Silicone Polymer Buddy as official AppIcon

4e41d8c

fix(inference): properly inject system persona natively for Gemma 4 a…

913465a

…nd preserve prompt across db reloads

test: update lifecycle tests to evaluate staffPicks instead of deprec…

0ac126b

…ated recommended() catalog function

feat(ui): implement native macOS inspector for collapsible right-hand…

9535cb1

… metadata pane

feat: implement L0-L3 MemoryStack architecture and active RAG tool ca…

6f26896

…lling

feat: implement matrix persona extraction overlay and resolve Xcode w…

aebf8cc

…arnings

Merge remote-tracking branch 'origin/main' into feature/swiftbuddy-me…

b836c35

…mpalace-v1

test: synchronize XCTSkipIf guards from main to bypass flaky HuggingF…

754cdd1

…ace polling endpoints

feat: implement sliding window text chunker for corpus ingest

f474f16

ui: visually decouple duplicated Memory Palace bounds from the latera…

8025417

…l Inspector sidebar

test: add chat tools inference harness and automation scripts

b09db44

test: add standalone CLI simulation tools for local testing

2bec172

solderzzc force-pushed the feature/swiftbuddy-mempalace-v1 branch from 102ef78 to 278ea04 Compare April 10, 2026 03:55

fix(ci): point mlx-swift-lm to papps-ssd-streaming feature branch and…

7f70d14

… commit 122B test logs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(swiftbuddy): MemPalace v1, native macOS theming, HF model manage…#18

feat(swiftbuddy): MemPalace v1, native macOS theming, HF model manage…#18
solderzzc wants to merge 65 commits intomainfrom
feature/swiftbuddy-mempalace-v1

solderzzc commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

solderzzc commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant