Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Heritage survives when communities choose to sustain it, speakers say at Pakistan Mother Languages Literature Festival – Pakistan

    February 15, 2026

    $8,000 freight, closed borders wipe out produce exports

    February 15, 2026

    Ladies Worldwide Movie Competition’s tenth version opens in Lahore

    February 15, 2026
    Facebook X (Twitter) Instagram
    Sunday, February 15
    Trending
    • Heritage survives when communities choose to sustain it, speakers say at Pakistan Mother Languages Literature Festival – Pakistan
    • $8,000 freight, closed borders wipe out produce exports
    • Ladies Worldwide Movie Competition’s tenth version opens in Lahore
    • Railway Estate Development & Marketing Company Jobs 2026 Job Advertisement Pakistan
    • Paxful Fined $4M After Admitting It Profited From Legal Exercise on Its Crypto Platform
    • Brand NEW Prosperity Sketch High EPC’s
    • Lego Batman Slashes RAM Necessities On PC As Costs Spike
    • Pak, Ind brace for high-stakes clash
    • Is security is ‘lifeless’ at xAI?
    • Postmortem reveals Toronto Zoo’s Kiko the giraffe died from cardiac failure
    Facebook X (Twitter) Instagram Pinterest Vimeo
    The News92The News92
    • Home
    • World
    • National
    • Sports
    • Crypto
    • Travel
    • Lifestyle
    • Jobs
    • Insurance
    • Gaming
    • AI & Tech
    • Health & Fitness
    The News92The News92
    Home - AI & Tech - How to Build a Self-Organizing Agent Memory System for Long-Term AI Reasoning 
    AI & Tech

    How to Build a Self-Organizing Agent Memory System for Long-Term AI Reasoning 

    Naveed AhmadBy Naveed AhmadFebruary 15, 2026No Comments5 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    In this tutorial, we build a self-organizing memory system for an agent that goes beyond storing raw conversation history and instead structures interactions into persistent, meaningful knowledge units. We design the system so that reasoning and memory management are clearly separated, allowing a dedicated component to extract, compress, and organize information. At the same time, the main agent focuses on responding to the user. We use structured storage with SQLite, scene-based grouping, and summary consolidation, and we show how an agent can maintain useful context over long horizons without relying on opaque vector-only retrieval.

    import sqlite3
    import json
    import re
    from datetime import datetime
    from typing import List, Dict
    from getpass import getpass
    from openai import OpenAI
    
    
    OPENAI_API_KEY = getpass("Enter your OpenAI API key: ").strip()
    client = OpenAI(api_key=OPENAI_API_KEY)
    
    
    def llm(prompt, temperature=0.1, max_tokens=500):
       return client.chat.completions.create(
           model="gpt-4o-mini",
           messages=[{"role": "user", "content": prompt}],
           temperature=temperature,
           max_tokens=max_tokens
       ).choices[0].message.content.strip()

    We set up the core runtime by importing all required libraries and securely collecting the API key at execution time. We initialize the language model client and define a single helper function that standardizes all model calls. We ensure that every downstream component relies on this shared interface for consistent generation behavior.

    class MemoryDB:
       def __init__(self):
           self.db = sqlite3.connect(":memory:")
           self.db.row_factory = sqlite3.Row
           self._init_schema()
    
    
       def _init_schema(self):
           self.db.execute("""
           CREATE TABLE mem_cells (
               id INTEGER PRIMARY KEY,
               scene TEXT,
               cell_type TEXT,
               salience REAL,
               content TEXT,
               created_at TEXT
           )
           """)
    
    
           self.db.execute("""
           CREATE TABLE mem_scenes (
               scene TEXT PRIMARY KEY,
               summary TEXT,
               updated_at TEXT
           )
           """)
    
    
           self.db.execute("""
           CREATE VIRTUAL TABLE mem_cells_fts
           USING fts5(content, scene, cell_type)
           """)
    
    
       def insert_cell(self, cell):
           self.db.execute(
               "INSERT INTO mem_cells VALUES(NULL,?,?,?,?,?)",
               (
                   cell["scene"],
                   cell["cell_type"],
                   cell["salience"],
                   json.dumps(cell["content"]),
                   datetime.utcnow().isoformat()
               )
           )
           self.db.execute(
               "INSERT INTO mem_cells_fts VALUES(?,?,?)",
               (
                   json.dumps(cell["content"]),
                   cell["scene"],
                   cell["cell_type"]
               )
           )
           self.db.commit()

    We define a structured memory database that persists information across interactions. We create tables for atomic memory units, higher-level scenes, and a full-text search index to enable symbolic retrieval. We also implement the logic to insert new memory entries in a normalized and queryable form.

     def get_scene(self, scene):
           return self.db.execute(
               "SELECT * FROM mem_scenes WHERE scene=?", (scene,)
           ).fetchone()
    
    
       def upsert_scene(self, scene, summary):
           self.db.execute("""
           INSERT INTO mem_scenes VALUES(?,?,?)
           ON CONFLICT(scene) DO UPDATE SET
               summary=excluded.summary,
               updated_at=excluded.updated_at
           """, (scene, summary, datetime.utcnow().isoformat()))
           self.db.commit()
    
    
       def retrieve_scene_context(self, query, limit=6):
           tokens = re.findall(r"[a-zA-Z0-9]+", query)
           if not tokens:
               return []
    
    
           fts_query = " OR ".join(tokens)
    
    
           rows = self.db.execute("""
           SELECT scene, content FROM mem_cells_fts
           WHERE mem_cells_fts MATCH ?
           LIMIT ?
           """, (fts_query, limit)).fetchall()
    
    
           if not rows:
               rows = self.db.execute("""
               SELECT scene, content FROM mem_cells
               ORDER BY salience DESC
               LIMIT ?
               """, (limit,)).fetchall()
    
    
           return rows
    
    
       def retrieve_scene_summary(self, scene):
           row = self.get_scene(scene)
           return row["summary"] if row else ""

    We focus on memory retrieval and scene maintenance logic. We implement safe full-text search by sanitizing user queries and adding a fallback strategy when no lexical matches are found. We also expose helper methods to fetch consolidated scene summaries for long-horizon context building.

    class MemoryManager:
       def __init__(self, db: MemoryDB):
           self.db = db
    
    
       def extract_cells(self, user, assistant) -> List[Dict]:
           prompt = f"""
    Convert this interaction into structured memory cells.
    
    
    Return JSON array with objects containing:
    - scene
    - cell_type (fact, plan, preference, decision, task, risk)
    - salience (0-1)
    - content (compressed, factual)
    
    
    User: {user}
    Assistant: {assistant}
    """
           raw = llm(prompt)
           raw = re.sub(r"```json|```", "", raw)
    
    
           try:
               cells = json.loads(raw)
               return cells if isinstance(cells, list) else []
           except Exception:
               return []
    
    
       def consolidate_scene(self, scene):
           rows = self.db.db.execute(
               "SELECT content FROM mem_cells WHERE scene=? ORDER BY salience DESC",
               (scene,)
           ).fetchall()
    
    
           if not rows:
               return
    
    
           cells = [json.loads(r["content"]) for r in rows]
    
    
           prompt = f"""
    Summarize this memory scene in under 100 words.
    Keep it stable and reusable for future reasoning.
    
    
    Cells:
    {cells}
    """
           summary = llm(prompt, temperature=0.05)
           self.db.upsert_scene(scene, summary)
    
    
       def update(self, user, assistant):
           cells = self.extract_cells(user, assistant)
    
    
           for cell in cells:
               self.db.insert_cell(cell)
    
    
           for scene in set(c["scene"] for c in cells):
               self.consolidate_scene(scene)

    We implement the dedicated memory management component responsible for structuring experience. We extract compact memory representations from interactions, store them, and periodically consolidate them into stable scene summaries. We ensure that memory evolves incrementally without interfering with the agent’s response flow.

    class WorkerAgent:
       def __init__(self, db: MemoryDB, mem_manager: MemoryManager):
           self.db = db
           self.mem_manager = mem_manager
    
    
       def answer(self, user_input):
           recalled = self.db.retrieve_scene_context(user_input)
           scenes = set(r["scene"] for r in recalled)
    
    
           summaries = "\n".join(
               f"[{scene}]\n{self.db.retrieve_scene_summary(scene)}"
               for scene in scenes
           )
    
    
           prompt = f"""
    You are an intelligent agent with long-term memory.
    
    
    Relevant memory:
    {summaries}
    
    
    User: {user_input}
    """
           assistant_reply = llm(prompt)
           self.mem_manager.update(user_input, assistant_reply)
           return assistant_reply
    
    
    
    
    db = MemoryDB()
    memory_manager = MemoryManager(db)
    agent = WorkerAgent(db, memory_manager)
    
    
    print(agent.answer("We are building an agent that remembers projects long term."))
    print(agent.answer("It should organize conversations into topics automatically."))
    print(agent.answer("This memory system should support future reasoning."))
    
    
    for row in db.db.execute("SELECT * FROM mem_scenes"):
       print(dict(row))

    We define the worker agent that performs reasoning while remaining memory-aware. We retrieve relevant scenes, assemble contextual summaries, and generate responses grounded in long-term knowledge. We then close the loop by passing the interaction back to the memory manager so the system continuously improves over time.

    In this tutorial, we demonstrated how an agent can actively curate its own memory and turn past interactions into stable, reusable knowledge rather than ephemeral chat logs. We enabled memory to evolve through consolidation and selective recall, which supports more consistent and grounded reasoning across sessions. This approach provides a practical foundation for building long-lived agentic systems, and it can be naturally extended with mechanisms for forgetting, richer relational memory, or graph-based orchestration as the system grows in complexity.


    Check out the Full Codes. Also, feel free to follow us on Twitter and don’t forget to join our 100k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.




    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleCanada reaches Olympic women’s hockey semifinals – National
    Next Article Cricket fraternity urges urgent medical care for Imran Khan
    Naveed Ahmad
    • Website
    • Tumblr

    Related Posts

    AI & Tech

    Is security is ‘lifeless’ at xAI?

    February 15, 2026
    AI & Tech

    Homeland Safety reportedly despatched a whole bunch of subpoenas searching for to unmask anti-ICE accounts

    February 15, 2026
    AI & Tech

    In a changed VC landscape, this exec is doubling down on overlooked founders

    February 15, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Demo
    Top Posts

    Oatly loses ‘milk’ branding battle in UK Supreme Courtroom

    February 12, 20261 Views

    ‘Fly excessive my angel’: 12-year-old lady dies by suicide amid bullying allegations

    February 7, 20261 Views

    Lenovo’s Qira is a Guess on Ambient, Cross-device AI—and on a New Type of Working System

    January 30, 20261 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Demo
    Most Popular

    Oatly loses ‘milk’ branding battle in UK Supreme Courtroom

    February 12, 20261 Views

    ‘Fly excessive my angel’: 12-year-old lady dies by suicide amid bullying allegations

    February 7, 20261 Views

    Lenovo’s Qira is a Guess on Ambient, Cross-device AI—and on a New Type of Working System

    January 30, 20261 Views
    Our Picks

    Heritage survives when communities choose to sustain it, speakers say at Pakistan Mother Languages Literature Festival – Pakistan

    February 15, 2026

    $8,000 freight, closed borders wipe out produce exports

    February 15, 2026

    Ladies Worldwide Movie Competition’s tenth version opens in Lahore

    February 15, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions
    • Advertise
    • Disclaimer
    © 2026 TheNews92.com. All Rights Reserved. Unauthorized reproduction or redistribution of content is strictly prohibited.

    Type above and press Enter to search. Press Esc to cancel.