User
Write something
🔒 Q&A w/ Nate is happening in 6 days
Pinned
🎉 AIS+ Just Won the Skool Games
AI Automation Society Plus just won Q3 Skool Games, back-to-back 1st place finished and top 5 in all three quarters of 2025. This wouldn't be possible without each of you helping us make AI Automation Society a space for everyone. Truly grateful for this amazing community we're building together. Quick heads up: Plus pricing increases in Q4. If you've been thinking about joining, now's the time. We'll give everyone a full week's notice before any price changes. Check it out here. Thank you all for the incredible support. You guys are the best! Cheers, Nate
🎉 AIS+ Just Won the Skool Games
Pinned
🚀New Video: How to Automate ANY Content with Poppy and n8n (no code)
In this video, I show how to use Poppy AI with n8n to completely automate your content creation process. $25 OFF Poppy AI HERE Poppy lets you build specialized AI assistants that can be trained on almost anything, YouTube videos, TikToks, Instagram posts, websites, or even your own local files. Just drag and drop the media, and your Poppy assistant can use it to get smarter. From there, you can connect Poppy with n8n to automate research, content generation, or any part of your creative workflow. The best part is that it’s all no-code, making it simple and powerful for anyone to start building AI-driven content systems.
Pinned
Please Read | Rules and Guidelines 📜
1) 🚫 No Business Promotions → NO “DM me for…” or "Comment 'Automation'" posts. 2) 🔗 No Linking Your Own Community/YouTube Videos 3) 🏷️ Title Specifically 4) 🔍 Search for Help First (searchbar) 5) 🙌 Stay Respectful 6) ❌ Enforced Clean‑Up Posts that break these rules will be removed without warning. If you ever have questions, feel free to ask. Let’s make this the best AI Automation community out there by sharing, collaborating, and learning together. 🚀
How to Fix Data Ingestion Updates in RAG
One of the main problems with RAG systems is keeping the knowledge base fresh and accurate. - Internal documents are constantly updated in Google Drive. - Public information on the website (blog, product pages, docs) changes regularly. - Old files need to be removed, otherwise the AI risks retrieving outdated data. I just built an ingestion workflow for a SaaS client that solves these issues. Here’s how it works: 1. Continuous monitoring - Google Drive triggers for file creation, updates, and deletions. - Monthly website scraping with Firecrawl to refresh all key URLs. 2. Smart updates - Each document is hashed. If hash unchanged → skip. - If changed → old embeddings are deleted from Postgres/PGVector and replaced with new ones. - Deleted files in Drive also delete their vectors automatically. 3. Metadata for better retrieval - GPT-4.1 classifies every document as **internal** or **external** and generates a one-sentence summary. - Metadata like `file_id`, `doc_type`, and `summary` ensures more precise retrieval. 4. Vectorization pipeline - Content is normalized, split into chunks with overlap. - OpenAI embeddings are created and stored in **PGVector**. - A record manager table tracks file IDs + hashes. Result: The RAG agent always has access to the latest, cleaned, and properly categorized knowledge, both from internal docs and external web pages. No stale data, no duplicates, no hallucinations from outdated sources. If you’re building RAG systems, I’d argue this ingestion & update layer is the real bottleneck for accuracy, not just the retrieval model itself. Hope that helps!
How to Fix Data Ingestion Updates in RAG
INTRODUCTION
I'm just joining this community , hope I'm welcome
1-30 of 7,263
AI Automation Society
skool.com/ai-automation-society
A community for mastering AI-driven automation and AI agents. Learn, collaborate, and optimize your workflows!
Leaderboard (30-day)
Powered by