Long-Term Memory (RAG) for your Mac

Fluent just got a massive brain upgrade. Save, recall, reference and synthesize your knowledge instantly. Write like Shakespeare with a single shortcut.

Fluent 1.7: Introducing Long-Term Memory for your Mac

Knowledge is King

Fluent 1.7 introduces Memory, a RAG (Retrieval-augmented generation) feature that lets you build personal local knowledge base from your documents and files. Memory in Fluent is model-agnostic and dynamic – meaning AI can decide on its own when it's appropriate to use it. Of course, you can always manually pick data you need.

Whether you're researching, writing documentation, or study materials, Memory's semantic search enables you to instantly find the data you need and feed it into AI for more grounded, fine-grained results. This data is stored locally on your device and can be used by any model with tool support.

On top of that, it's a surprisingly excellent feature for writing tasks. You can now replicate yourself or anybody else with almost 100% precision. Write blog and social media posts with ease, never worrying about whether your content "sounds like AI". Augment your writing – not create artificial "persona".

Overview of Memory groups in Fluent

Synthesize – Generate

The perfect use case of Memory knowledge comes from blending it. Imagine having an Email that you don't have time or mood to reply, but have to. Let's say your SaaS customer urgently asks you to provide information related to batch discounts.

Great. Now, the most important part is – Fluent knows your writing style, your voice and all of the nuances, because you provided examples of it in the past. In addition, Fluent contains extensive documentation around your SaaS, including pricing details.

Let's say you have an action with Auto Insert option enabled and you trigger it with your custom shortcut. Here's the action prompt:

text
Analyze this Email from my customer. Using {{ @"My Email Messages" }} as examples of my previous replies to customer letters, generate a reply that is valuable and concise.

The reply should be between 1 to 3 paragraphs, each 1-2 sentences. It should be easy to perceive, not over-intelligent (understandable by average college student), do not mix ideas. Vocabulary should not be over-intelligent, instead it should be slightly limited so a non-native English speaker can understand.

Make your best to 100% replicate my writing style, so that it’s not possible to identify that was written by anyone else.

I encourage you to use {{ @"Fluent Documentation" }} and {{ @"Future Plans" }} to identify information which it would be reasonable to include in the letter.

Return only the resulting reply, no thoughts, no explanations.

With this prompt Fluent on-par with AI will synthesize all the necessary information and generate a perfect response. Use SOTA models for the result that sounds exactly like you would have written. Simple like that. Here's the video showcasing this example:

Don't forget to enable app and/or browser context, so it's automatically attached, and Fluent knows on which message it should reply.

🔥 Hot Tip: if model struggles to format the response, refine your prompt.
For example, add stricter requirements, like:
Critical: FORMAT the resulting reply. Apply newlines, use paragraphs, etc.

Use Cases

While semantic search is the most wanted use case of a knowledge base, writing texts that do not sound like a bot/AI, and are not detectable as written by AI – is a huge advantage in content creation.

Writing social media and blog posts, chats and Email messages, essays and even generating satire and humor is possible with Fluent's Memory engine with a proper context, data and prompt.

Besides of writing, here're some ideas where Memory could be a great supercharger:

  • Knowledge base search
  • Research and data analysis
  • Troubleshooting
  • Document analysis

How Memory Works?

Memory in Fluent is designed based on the hybrid RAG architecture. It's equally performant for both semantic search and writing tasks.

In Fluent, you create a Memory Group to store your data. Memory Group is a set of files and dynamic notes (we'll discuss it a bit later), which are efficiently indexed and stored locally on your Mac. Fluent is supplied with 3 default groups: My Profile, My Writing Style and My Projects. Feel free to update them accordingly to supply some initial source of knowledge

In order for Memory feature to function, you would also need to explicitly enable it under Settings > Integrations – as Memory is a built-in MCP integration.

Organizing Your Memory

Here's couple of recommendations and tricks to follow when organizing your local knowledge base. Core principle is to keep your memory groups lean and domain-oriented. It means: do not push every single message, PDF, Excel sheet into a single basket. And do not organize by file types – that's absolutely unnecessary.

Instead, separate your knowledge groups by topic, like:

  • My Profile (About Me)
  • My Email Replies
  • My Reddit Posts
  • My Discord Messages
  • Invoices
  • Favorite Quotes
  • Steam Wishlist
  • ... and so on

This way Fluent will search through vast amounts of your knowledge efficiently, utilizing only the relevant data.

Keeping Memory Up-to-Date

Fluent currently does not automatically reindex your memory. If you update the files you have added to Fluent, you will need to reindex them manually by going into Settings > Memory, opening the group of interest and clicking "Reindex".

💡 Dynamic notes do not need to be reindexed – Fluent already handles it for you.

Background reindexing is already close in the roadmap.

Getting Started

Fluent 1.7 is available now. Existing users can update directly, and new users can download the app to experience the power of AI transformation on Mac.

Excited? Give it a try!

Download Fluent for Free

What’s Next

We will now focus on bringing further smaller features and improvements based on your feedback, e.g. Background Actions, OpenAI/Gemini subscription support. Additionally, there are lots of exciting things related to Scheduled Actions coming soon.

Stay tuned for more updates, and as always, we'd love to hear your feedback and feature requests in our Discord server.