Friday 7/4/25
-- "DeepSeek-V3 on M4 Mac: Blazing Fast Inference on Apple Silicon", DigiAlps
Of course, this impressive achievement required a considerable amount of code … 😔
- Buy a Mac Mini, the "server", with M4 chip and at least 16 GB ram, 256 GB solid state storage (but extensible via external SSD).
- Deploy a Mac desktop or laptop as the "workstation"
- Buy/obtain inexpensive a monitor, mouse, keyboard for the Mini. These devices will rarely be used, mainly when configuring hardware and software; otherwise the Mini will be “headless”. All queries to the models on the Mini will be done remotely from the Mac.
- Buy a Thunderbolt cable. Warning: Be sure you buy a Thunderbolt cable, not a USB-C cable). While they look alike and will physically fit the same ports, only a proper Thunderbolt cable will enable the Thunderbolt bridge connection between the two Macs.
- Connect monitor, mouse, and keyboard to the Mini.
- Designate a Mac (desktop or laptop) as the client "workstation" for the Mini.
- Connect a Mac Thunderbolt port to a Mini Thunderbolt port with the Thunderbolt cable.
- Apple > System Settings > Network
- Click … "Three Dots ..." on right side of screen below the "Ethernet" label
- Click "Add Service"
- Select "Thunderbolt Bridge" > "Create"
- Click "Thunderbolt Bridge" > “Details” > “TCP/IP”
- Wait until address and mask appear in the IP address and Mask boxes.
- Click "OK"
- System Settings > General > “Sharing”
- Turn on “Screen Sharing”
- Turn on “File Sharing”
- System Settings > Energy
- Turn on “Prevent automatic sleeping when display is off”
- Turn on “Wake for network access”
- Apple > System Settings > Network
- Click three dots “..." on right side of screen below "Ethernet" label.
- Click "Add Service"
- Select "Thunderbolt Bridge" > "Create"
- Click "Thunderbolt Bridge" > “Details” > “TCP/IP”.
- Wait until address and mask appear in IP address and Mask boxes.
- Click "OK".
Use Finder to check your connection. - Finder > Go > Network ... the Mac name should appear.
- Click the name to display list of folders at lowest level on the Mini.
- Open Finder > Go > Network
- Double-click your Mac mini’s name when it appears.
- Click the “Share Screen” button in far right upper corner.
- Click "Continue" button.
- A full copy of what's currently on the Mini screen will be displayed in a big window on the Mac.
1. Configure the Notes app on the Mini with local pages where you will securely store prompts and responses locally, a/k/a a "vault" that is inaccessible to iCloud … and Apple.
- On the Mac start a remote session on your Mini.
(See step *** A. 12, above) - Notes > Settings > Enable On my Mac account
- Set Locked Notes to "On My Mac"
- Set a strong password for the locked notes
- Suggestion: Always include various #hashtags in your notes to make each one easier to find via Notes "Search". There are no folders in the locked notes storage.
- Continue the remote session on the Mini
or start a new session (See step *** A. 12, above) - Safari > Settings > "Safari is not your default browser"
- Select "Safari"
- Continue the remote session on the Mini
or start a new session (See step *** A. 12, above) - Go to the App Store.
- Download DuckDuckGo to the Mini.
- Safari > Settings > Search ... SearchEngine ... Select "DuckDuckGo".
Note: This selection may try to set DuckDuckGo as the default browser. If so, clear this selection. Safari offers many privacy protections beyond those offered by DuckDuckGo.
that hosts the models and a chatbot that provides user access to the models.
- On the Mac start a remote session on the Mini.
(See step *** A. 12, above) - Open Safari
Go to: https://lmstudio.ai
- Click “Download for macOS” button on the homepage.
-- Wait for the .dmg file to download
-- The file is named something like LM-Studio.dmg.
-- It will appear in your Downloads folder. - Double-click file name in the Downloads folder
- Drag LM Studio icon into the Applications folder
- Open Launchpad > Double-click LM Studio icon to launch it.
-- If macOS asks if you’re sure you want to open it, click Open.
-- Allow Full Disk Access (Optional but recommended) - *** Search for and download models from the LM Server home page.
Note: Click anywhere on the image of the LM Server home page (below) to make it larger and more readable.
-- Click on the Search icon 🔍 in the left sidebar. This opens the model catalog.
-- In the search bar at the top of the app window, type the full name of the model.
-- Click the Download button next to it.
-- Wait for download … The download will start and show progress … Once complete, the model will appear in the “My Models” tab.
-- When download is complete, click the "Load model" button.
Here are my [Claude's] recommendations with the exact GGUF specifications for your Mac Mini M4 with 16GB RAM: ... [GGUF stands for “GPT-Generated Unified Format” - it’s a file format specifically designed for storing and running large language models efficiently.]
*** Best General Foundation Model:
-- mistral-7b-instruct-v0.3.Q5_K_M.gguf (~5GB)
-- Your reliable daily driver with excellent quality/performance balance.
An Alternative General Foundation Model:
-- Qwen2.5-7B-Instruct.Q5_K_M.gguf
*** Specialized Models Worth Exploring:
For Medical/Health Questions:
• meditron-7b.Q5_K_M.gguf (~5GB) - Built on Llama 2, trained on medical literature
• biomistral-7b.Q5_K_M.gguf (~5GB) - Mistral fine-tuned for biomedical tasks
For Coding Help/Tech Explanations:
• codellama-7b-instruct.Q5_K_M.gguf (~5GB) - Excellent for explaining code and technical concepts
• starcoder-7b.Q5_K_M.gguf (~5GB) - Good alternative for multiple programming languages
For Creative Writing/Content:
• zephyr-7b-beta.Q5_K_M.gguf (~5GB) - Llama 2-based, excellent for conversational and creative tasks
• openhermes-2.5-mistral-7b.Q5_K_M.gguf (~5GB) - Great for storytelling and creative writing
For Research/Analysis:
• nous-hermes-2-mixtral-8x7b.Q4_K_M.gguf (~26GB) - Complex reasoning, uses most of your RAM
• wizardlm-7b.Q5_K_M.gguf (~5GB) - Good for following complex instructions
For Legal/Professional Writing:
• saul-7b.Q5_K_M.gguf (~5GB) - Trained on legal documents (not legal advice)
*** Setup Strategy:
1. Start with mistral-7b-instruct-v0.3.Q5_K_M.gguf
2. Add 2-3 specialized models based on your interests
Each model name is exactly what you’ll search for when downloading!
Note: None of these models remember previous prompts and responses. So a "prompt engineering" strategy is recommended.
-- 1. Compose and carefully edit your prompts on the Mac using Notes in iCloud.
-- 2. Copy the edited prompt during a remote session from Notes in iCloud on the Mini into the LM Server chatbot's prompt box.
-- 3. Copy the prompt and the model's response into the Notes "On my Mac" vault, adding appropriate #hashtags for easier search and retrieval.
-- 4. Delete the original prompt from Notes in iCloud.
- "The failure of Apple's leadership to comprehend the impact of the generative AI revolution on Internet search" 5/16/25
- "Apple's dilemma: Pleasing its iPhone users AND its long term investors", 6/12/25
- "Powerful AI data centers in the home/office using Apple technology, Part 1 -- Greatly reduced carbon footprints", 6/28/25
No comments:
Post a Comment
Your comments will be greatly appreciated ... Or just click the "Like" button above the comments section if you enjoyed this blog note.