Eric Pratt

The Sovereign AI Society

Activity

Mon

Wed

Fri

Sun

Jul

Aug

Sep

Oct

Nov

Dec

Jan

Feb

Mar

Apr

May

Jun

What is this?

Less

Owned by Eric

The Sovereign AI Society

4 members • Free

Own your AI infrastructure. Local, cloud, or hybrid — for finance pros, operators, and builders who want informed control.

Memberships

Skoolers

175.2k members • Free

Self-Host Hub

157 members • Free

AI Society: Income with AI

257 members • Free

13 contributions to The Sovereign AI Society

Eric Pratt

May 15 •

Model Library

MONDAY MODEL DROP — May 11, 2026

THIS WEEK'S MODEL: Gemma 4 26B MoE (Q4_K_M) WHAT IT'S FOR: Frontier-quality local reasoning over long business documents — contracts, RFPs, board decks, financial statements, construction specs, vendor agreements. WHY IT MATTERS: Gemma 4 is the first local model where the long context window is real. Gemma 3 advertised 128K but its information-retrieval rate at that length was a brutal 13.5%. Gemma 4 jumped that to 66.4% — and the new 26B MoE goes all the way to 256K tokens (~192,000 words, or a 500-page document). The Mixture-of-Experts design activates only ~4B of 26B parameters per token, so it punches at flagship quality (MMLU Pro 85.2%, GPQA Diamond 84.3%) while staying inside consumer-hardware reach. For our community, this is the model that finally makes "drop the whole contract in and ask questions" a sovereign, on-device workflow. INSTALL: ollama pull gemma4 RUN IT: ollama run gemma4 TRY THIS PROMPT — "Monday Morning Document Triage": You are a senior business analyst preparing my Monday morning briefing. I am pasting a business document below. Produce a one-page brief with: • TL;DR — three sentences. What is this document and what does it ask of me? • KEY OBLIGATIONS — the top 5 commitments, deadlines, or deliverables, each with the exact section/page reference. • DOLLARS & DATES — every specific dollar amount, percentage, deadline, and quantity, with context. • RED FLAGS — any clauses, terms, or numbers that deviate from industry norms or that I should push back on. Be specific. • QUESTIONS BEFORE I SIGN — three sharp questions I should ask the counterparty. • NEXT ACTION — the single most important thing I should do today. Do not summarize generically. Quote exact phrases when calling out risk. DOCUMENT: --- [ PASTE YOUR DOCUMENT HERE ] --- HARDWARE REQUIREMENTS: Minimum: 16 GB unified memory (Apple M1/M2/M3) or 12 GB VRAM (RTX 3060 / 4060 Ti 12GB) — ~12–18 tok/s, shorter contexts. Recommended: 32 GB+ unified memory (M2 Pro/Max, M3 Pro/Max) or 16–24 GB VRAM (RTX 4080 / 4090) — ~25–45 tok/s, comfortable 64K–128K context.

Eric Pratt

Apr 12 •

Introductions

Introduce Yourself — Who Are You and What Brought You Here?

This is the room where everyone starts. Drop a comment below with: → Your name and what you do professionally → Your current AI experience level (beginner, tinkering, running local models, deploying for clients) → What brought you to The Sovereign AI Society — privacy? cost control? curiosity? building a service? → Your current hardware setup (laptop specs, any GPU, Mac/PC — whatever you're working with) No wrong answers. No judgment on budgets or experience levels. The person running a $300 mini PC and the person building a multi-GPU rig are both exactly where they need to be. Every introduction earns you points on the leaderboard. More importantly, it helps us understand who's here so we can help you faster. I'll reply to every intro personally. Welcome. — Eric

New comment Apr 13

Eric Pratt

0 likes • Apr 13

[attachment]

Eric Pratt

Apr 12 •

Model Library

MONDAY MODEL DROP — April 20, 2026

Welcome to the first-ever Monday Model Drop. Every Monday, I break down one model worth your attention — with install commands, benchmarks, and a real-world prompt you can copy/paste. No fluff. No hype. Just what works. THIS WEEK'S MODEL: Llama 3.1 8B (Q4_K_M quantization) WHAT IT'S FOR: Your first local AI workhorse — general conversation, summarization, document drafting. WHY IT MATTERS: This model proves you don't need a $3,000 GPU to run serious AI locally. 8B parameters, 4-bit quantized, runs on almost anything with 8GB+ VRAM or a modern Mac. INSTALL (copy-paste this): ollama pull llama3.1:8b RUN IT: ollama run llama3.1:8b TRY THIS PROMPT (copy-paste this): "You are a financial analyst. Summarize the following quarterly earnings data into a 3-paragraph executive briefing. Focus on revenue trends, margin changes, and one forward-looking risk. Keep it under 250 words." Then paste in any earnings data, financial report, or even a news article. Watch what happens. HARDWARE REQUIREMENTS: Minimum: 8GB VRAM (RTX 3060, 4060) or 16GB unified memory (M1 Pro+) Recommended: 16GB VRAM or 32GB unified Speed on RTX 4060 Ti 16GB: ~45 tokens/sec Speed on M4 Pro 48GB: ~35 tokens/sec Speed on RTX 3060 12GB: ~28 tokens/sec ERIC'S TAKE: Llama 3.1 8B is your baseline. If you can only run one model, run this one. It handles 80% of business use cases well enough that you'll question why you were paying for API calls. For complex reasoning, step up to 70B or use a cloud model — but for drafting, summarizing, Q&A, and routine document work, this is the right first move. The goal this week: pull the model, run the prompt above, and post your results (speed + quality) in the comments. Let's see what your rigs can do. — Eric

Eric Pratt

Apr 12 •

Start Here

The Sovereign AI Society Is Live — Here's How to Get Started

It's here. The Sovereign AI Society is officially open — the only community for business professionals who want to understand, control, and profit from their AI infrastructure. This isn't another "learn to prompt" community. This is where you learn the hardware, the models, the deployment decisions, and the business applications underneath. Explore what's inside: Hardware Lab, Model Library, Skills Marketplace (124 curated skills), Finance & Accounting AI, Enterprise & Sales, LinkedIn & Brand Building, and Wins & Case Studies. Your first 48 hours matter. Do this right now: 1) Read the Welcome post in Start Here. 2) Introduce yourself. 3) Post your hardware specs. 4) Pick one quick win from the welcome post. The people who take action first get 10x the value. Don't lurk. Start now. Your AI. Your hardware. Your rules. — Eric

Eric Pratt

Apr 12 •

Wins & Case Studies

Wins & Case Studies — Show Your Work, Inspire the Community

This is the trophy room. Every deployment, every benchmark improvement, every client win, every "I got my first local model running" moment — it goes here. What counts as a win: → First local model running (yes, this counts — it's a real milestone) → Hardware build complete → Benchmark improvement after optimization → Business workflow automated → Client deployment successful → Cost savings calculated and proven → First piece of AI content published on LinkedIn → Skill deployed from the Marketplace → Revenue generated from AI services How to post a win: Keep it simple. Tell us what you did, what changed, and what you learned. Photos and screenshots are encouraged. Numbers are gold. Why this channel matters: Seeing other people's wins is the single strongest motivator in a community. When you see someone with similar hardware, similar experience, similar goals — and they just hit a milestone — it rewires your brain from "maybe someday" to "I'm doing this next week." Post your wins. No matter how small. The person who needs to see it is already here. Biggest wins earn points on the leaderboard. The best case studies get featured. — Eric

1-10 of 13

Level 1 - Observer

1point to level up

Eric Pratt

@eric-pratt-8984

I’m Eric — husband, dad, and systems-first builder. EfficioLife is where I share insights to scale outputs and results using impactful AI power ups.

Active 1d ago

Joined Apr 12, 2026

Cleveland, OH

Contributions

Followers

Following