Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Resident Evil Requiem Twitch Badges Arrive

    February 27, 2026

    Why You Should Ask About Package Delivery Policies When Apartment Hunting

    February 27, 2026

    Bright Kids Public High School Quetta Jobs 2026 2026 Job Advertisement Pakistan

    February 27, 2026
    Facebook X (Twitter) Instagram
    Friday, February 27
    Trending
    • Resident Evil Requiem Twitch Badges Arrive
    • Why You Should Ask About Package Delivery Policies When Apartment Hunting
    • Bright Kids Public High School Quetta Jobs 2026 2026 Job Advertisement Pakistan
    • OpenAI says Tumbler Ridge shooter would spur police flag beneath guidelines now
    • 5 kids die after ingesting contaminated water in Ghotki
    • PayPal may not be seeking to promote itself, report
    • Discos bleed Rs397bn in FY25 as losses, recoveries worsen: Nepra
    • XRP-Paypal Rumors: What This Acquisition Would Mean For Ripple
    • Sony May Be Drastically Shifting Its PC Strategy – Report
    • Physics Trainer & Pakistan Research Trainer Jobs 2026 2026 Job Commercial Pakistan
    Facebook X (Twitter) Instagram Pinterest Vimeo
    The News92The News92
    • Home
    • World
    • National
    • Sports
    • Crypto
    • Travel
    • Lifestyle
    • Jobs
    • Insurance
    • Gaming
    • AI & Tech
    • Health & Fitness
    The News92The News92
    Home - AI & Tech - This AI Agent Is Designed to Not Go Rogue
    AI & Tech

    This AI Agent Is Designed to Not Go Rogue

    Naveed AhmadBy Naveed AhmadFebruary 27, 2026Updated:February 27, 2026No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    This AI Agent Is Designed to Not Go Rogue
    Share
    Facebook Twitter LinkedIn Pinterest Email


    AI brokers like OpenClaw have lately exploded in recognition exactly as a result of they will take the reins of your digital life. Whether or not you desire a personalised morning information digest, a proxy that may combat together with your cable firm’s customer support, or a to-do listing auditor that can do some duties for you and prod you to resolve the remaining, agentic assistants are constructed to entry your digital accounts and perform your instructions. That is useful—however has additionally induced lots of chaos. The bots are on the market mass-deleting emails they have been instructed to protect, writing hit pieces over perceived snubs, and launching phishing assaults towards their homeowners.

    Watching the pandemonium unfold in latest weeks, longtime safety engineer and researcher Niels Provos determined to strive one thing new. At the moment he’s launching an open supply, safe AI assistant known as IronCurtain designed so as to add a important layer of management. As an alternative of the agent immediately interacting with the person’s programs and accounts, it runs in an remoted digital machine. And its capacity to take any motion is mediated by a coverage—you can even consider it as a structure—that the proprietor writes to control the system. Crucially, IronCurtain can also be designed to obtain these overarching insurance policies in plain English after which runs them by way of a multistep course of that makes use of a big language mannequin (LLM) to transform the pure language into an enforceable safety coverage.

    “Companies like OpenClaw are at peak hype proper now, however my hope is that there’s a possibility to say, ‘Effectively, that is most likely not how we need to do it,’” Provos says. “As an alternative, let’s develop one thing that also provides you very excessive utility, however will not be going to enter these fully uncharted, generally harmful, paths.”

    IronCurtain’s capacity to take intuitive, easy statements and switch them into enforceable, deterministic—or predictable—pink traces is significant, Provos says, as a result of LLMs are famously “stochastic” and probabilistic. In different phrases, they do not essentially all the time generate the identical content material or give the identical info in response to the identical immediate. This creates challenges for AI guardrails, as a result of AI programs can evolve over time such that they revise how they interpret a management or constraint mechanism, which can lead to rogue exercise.

    An IronCurtain coverage, Provos says, could possibly be so simple as: “The agent could learn all my electronic mail. It could ship electronic mail to folks in my contacts with out asking. For anybody else, ask me first. By no means delete something completely.”

    IronCurtain takes these directions, turns them into an enforceable coverage, after which mediates between the assistant agent within the digital machine and what’s generally known as the mannequin context protocol server that offers LLMs entry to knowledge and different digital companies to hold out duties. Having the ability to constrain an agent this manner provides an essential element of entry management that internet platforms like electronic mail suppliers do not at the moment supply as a result of they weren’t constructed for the state of affairs the place each a human proprietor and AI agent bots are all utilizing one account.

    Provos notes that IronCurtain is designed to refine and enhance every person’s “structure” over time because the system encounters edge circumstances and asks for human enter about tips on how to proceed. The system, which is model-independent and can be utilized with any LLM, can also be designed to keep up an audit log of all coverage selections over time.

    IronCurtain is a analysis prototype, not a shopper product, and Provos hopes that folks will contribute to the challenge to discover and assist it evolve. Dino Dai Zovi, a well known cybersecurity researcher who has been experimenting with early variations of IronCurtain, says that the conceptual strategy the challenge takes aligns along with his personal instinct about how agentic AI must be constrained.



    Source link

    agent alignment AI Ethics AI safety rogue AI prevention
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleOcado to axe 1,000 jobs in cost-cutting drive
    Next Article 302 Discovered
    Naveed Ahmad
    • Website
    • Tumblr

    Related Posts

    AI & Tech

    PayPal may not be seeking to promote itself, report

    February 27, 2026
    AI & Tech

    How Chinese language AI Chatbots Censor Themselves

    February 27, 2026
    AI & Tech

    Are You ‘Agentic’ Enough for the AI Era?

    February 27, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Demo
    Top Posts

    How to Get a Bigger Penis – The Stem Cell Secret to Natural Penis Enlargement & A Quiz

    February 22, 20261 Views

    Oatly loses ‘milk’ branding battle in UK Supreme Courtroom

    February 12, 20261 Views

    Resident Evil Requiem Twitch Badges Arrive

    February 27, 20260 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Demo
    Most Popular

    How to Get a Bigger Penis – The Stem Cell Secret to Natural Penis Enlargement & A Quiz

    February 22, 20261 Views

    Oatly loses ‘milk’ branding battle in UK Supreme Courtroom

    February 12, 20261 Views

    Resident Evil Requiem Twitch Badges Arrive

    February 27, 20260 Views
    Our Picks

    Resident Evil Requiem Twitch Badges Arrive

    February 27, 2026

    Why You Should Ask About Package Delivery Policies When Apartment Hunting

    February 27, 2026

    Bright Kids Public High School Quetta Jobs 2026 2026 Job Advertisement Pakistan

    February 27, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions
    • Advertise
    • Disclaimer
    © 2026 TheNews92.com. All Rights Reserved. Unauthorized reproduction or redistribution of content is strictly prohibited.

    Type above and press Enter to search. Press Esc to cancel.