<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
  xmlns:atom="http://www.w3.org/2005/Atom"
  xmlns:content="http://purl.org/rss/1.0/modules/content/"
  xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>AIM Intelligence Blog</title>
    <link>https://aim-intelligence.vercel.app/blog</link>
    <description>Latest insights on AI security, red teaming, LLM safety, and enterprise AI from the AIM Intelligence research and engineering teams.</description>
    <language>en-us</language>
    <atom:link href="https://aim-intelligence.vercel.app/blog/rss.xml" rel="self" type="application/rss+xml" />
    <lastBuildDate>Wed, 10 Jun 2026 07:12:03 GMT</lastBuildDate>
    <ttl>60</ttl>
    <image>
      <url>https://aim-intelligence.vercel.app/images/logo_top.svg</url>
      <title>AIM Intelligence Blog</title>
      <link>https://aim-intelligence.vercel.app/blog</link>
    </image>

    <item>
      <title>BadHost (CVE-2026-48710): The Starlette Vulnerability Threatening Millions of AI Agents</title>
      <link>https://aim-intelligence.vercel.app/blog/badhost-cve-2026-48710</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/badhost-cve-2026-48710</guid>
      <description>CVE-2026-48710, dubbed BadHost, is a critical host header injection vulnerability in the Starlette Python web framework that allows unauthenticated attackers to bypass path-based authentication. With 325 million weekly downloads, Starlette underpins FastAPI, vLLM, LiteLLM, and virtually every Python MCP server — putting millions of AI agents at risk. Sysdig has documented the first in-the-wild case of an LLM agent autonomously exploiting a related vulnerability to exfiltrate an AWS database in under two minutes.</description>
      <author>yonggyu kim</author>
      <category>SECURITY</category>
      <pubDate>Wed, 10 Jun 2026 07:12:03 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/badhost-cve-2026-48710.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Tool-Mediated Belief Injection: How Tool Outputs Can Cascade Into Model Misalignment</title>
      <link>https://aim-intelligence.vercel.app/blog/tool-mediated-belief-injection</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/tool-mediated-belief-injection</guid>
      <description>When we deploy language models with access to external tools, we dramatically expand their capabilities. However, tool access also introduces new attack surfaces that differ fundamentally from traditional prompt injection. We document how adversarially crafted tool outputs can establish false premises that persist and compound across a conversation.</description>
      <author>Siddhant</author>
      <category>RESEARCH</category>
      <pubDate>Sun, 30 Nov 2025 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/tool-mediated-belief-injection.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>MisalignmentBench: How We Social Engineered LLMs Into Breaking Their Own Alignment</title>
      <link>https://aim-intelligence.vercel.app/blog/misalignment-bench</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/misalignment-bench</guid>
      <description>We got frontier models to lie, manipulate, and self-preserve. Not through prompt injection or jailbreaks. We deployed them in contextually rich scenarios with specific roles and guidelines. The models broke their own alignment trying to navigate the situations we created.</description>
      <author>Siddhant</author>
      <category>RESEARCH</category>
      <pubDate>Thu, 14 Aug 2025 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/misalignment-bench.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>How ELITE Reveals Dangerous Weaknesses in Vision-Language AI</title>
      <link>https://aim-intelligence.vercel.app/blog/elite-vlm-safety</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/elite-vlm-safety</guid>
      <description>As AI systems evolve to process images and text together, the risks grow exponentially. ELITE doesn&apos;t just measure whether a model is &apos;safe&apos; — it evaluates how dangerous its outputs could be with precision that rivals human reviewers.</description>
      <author>Eugene Choi</author>
      <category>RESEARCH</category>
      <pubDate>Thu, 29 May 2025 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/elite-vlm-safety.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Pressure Point: How One Bad Metric Can Push AI Toward a Fatal Choice</title>
      <link>https://aim-intelligence.vercel.app/blog/pressure-point</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/pressure-point</guid>
      <description>In a simulated earthquake response scenario, Claude 4 Opus was given conflicting rules. When pressured by authority, it reversed its ethical decision and recommended letting a critical patient die to optimize an efficiency score.</description>
      <author>Siddhant Panpatil</author>
      <category>RESEARCH</category>
      <pubDate>Mon, 26 May 2025 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/pressure-point.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Exploiting MCP: Emerging Security Threats in Large Language Models (LLMs)</title>
      <link>https://aim-intelligence.vercel.app/blog/exploiting-mcp</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/exploiting-mcp</guid>
      <description>Discover how attackers exploit vulnerabilities in the Model Context Protocol (MCP) to manipulate Large Language Models (LLMs), steal data, and disrupt operations. Learn real-world attack scenarios and defense strategies.</description>
      <author>Eugene Choi</author>
      <category>SECURITY</category>
      <pubDate>Wed, 21 May 2025 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/exploiting-mcp.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Making AI Safer with SPA-VL: A New Dataset for Ethical Vision-Language Models</title>
      <link>https://aim-intelligence.vercel.app/blog/spa-vl-dataset</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/spa-vl-dataset</guid>
      <description>SPA-VL is a meticulously designed dataset that sets a new standard for safety alignment in VLMs, incorporating diversity, feedback, and real-world relevance to ensure AI systems are both powerful and ethical.</description>
      <author>Eugene Choi</author>
      <category>RESEARCH</category>
      <pubDate>Wed, 27 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/spa-vl-dataset.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>The Hidden Threat: Understanding Indirect Prompt Injection in LLMs</title>
      <link>https://aim-intelligence.vercel.app/blog/indirect-prompt-injection</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/indirect-prompt-injection</guid>
      <description>Indirect Prompt Injection (IPI) is a sophisticated attack that manipulates how LLM-integrated applications process external data, causing them to misinterpret maliciously crafted inputs as commands.</description>
      <author>Sejin</author>
      <category>SECURITY</category>
      <pubDate>Mon, 25 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/indirect-prompt-injection.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Introducing AI Safety Benchmark v0.5: MLCommons&apos; Initiative</title>
      <link>https://aim-intelligence.vercel.app/blog/ai-safety-benchmark</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/ai-safety-benchmark</guid>
      <description>AI Safety Benchmark v0.5 is a proof-of-concept benchmark designed to evaluate the safety of text-based generative language models, providing a structured approach to assess potential risks.</description>
      <author>Eugene Choi</author>
      <category>RESEARCH</category>
      <pubDate>Mon, 18 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/ai-safety-benchmark.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Indirect Prompt Injection Attacks Against Web Agents</title>
      <link>https://aim-intelligence.vercel.app/blog/indirect-prompt-injection-web-agent</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/indirect-prompt-injection-web-agent</guid>
      <description>Explore how EIA, AdvWeb, and WIPI attack methods exploit vulnerabilities in VLM-powered web agents, revealing serious security concerns for AI systems that interact with web environments.</description>
      <author>Jiankimr</author>
      <category>SECURITY</category>
      <pubDate>Fri, 15 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/indirect-prompt-injection-web-agent.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>AIM Red Team: Leveraging Psychological Personas for Advanced LLM Jailbreaking Strategies</title>
      <link>https://aim-intelligence.vercel.app/blog/aim-red-team</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/aim-red-team</guid>
      <description>Explore how psychological persona-based approaches can be used to test LLM vulnerabilities through single-turn and multi-turn jailbreaking scenarios based on Big Five personality traits.</description>
      <author>Hyunjun Kim</author>
      <category>RESEARCH</category>
      <pubDate>Fri, 15 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/aim-red-team.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Refining Vision-Language Model Benchmarks: Base Query Generation and Toxicity Analysis</title>
      <link>https://aim-intelligence.vercel.app/blog/vlm-benchmarks-toxicity</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/vlm-benchmarks-toxicity</guid>
      <description>For existing VLM Safety benchmarks, there are cases where the text alone is sufficiently informative without the image. We explore base query generation and toxicity measurement methods.</description>
      <author>Eugene Choi</author>
      <category>RESEARCH</category>
      <pubDate>Sat, 09 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/vlm-benchmarks-toxicity.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Defending Web Agents: Advanced Security Strategies through AdvWeb and BrowserART</title>
      <link>https://aim-intelligence.vercel.app/blog/defending-web-agents</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/defending-web-agents</guid>
      <description>Explore cutting-edge methodologies for identifying and mitigating vulnerabilities in VLM-powered web agents, including the AdvWeb attack framework and BrowserART red teaming toolkit.</description>
      <author>Sejin</author>
      <category>SECURITY</category>
      <pubDate>Sat, 09 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/defending-web-agents.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>AIM RED TEAM: Insights from the KAIST Lab Meeting on Persona-Based Jailbreak Strategies</title>
      <link>https://aim-intelligence.vercel.app/blog/kaist-lab-meeting</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/kaist-lab-meeting</guid>
      <description>This week, we held a productive meeting with the KAIST lab to refine the direction of our ongoing research project and to solidify our experimental design. The focus was on integrating psychological approaches with LLMs to design jailbreak prompts.</description>
      <author>Hyunjun Kim</author>
      <category>RESEARCH</category>
      <pubDate>Fri, 08 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/kaist-lab-meeting.jpeg" type="image/jpeg" length="0" />
    </item>
    <item>
      <title>Evaluating Text-based VLM Attack Methods: In-depth Look at Figstep</title>
      <link>https://aim-intelligence.vercel.app/blog/figstep-vlm-attacks</link>
      <guid isPermaLink="true">https://aim-intelligence.vercel.app/blog/figstep-vlm-attacks</guid>
      <description>To evaluate VLM Safety, it is essential to develop a secure model that incorporates the unique characteristics of VLMs. We analyze Figstep and RTVLM datasets to assess typographic visual prompt attacks.</description>
      <author>Doehyeon</author>
      <category>RESEARCH</category>
      <pubDate>Sat, 02 Nov 2024 00:00:00 GMT</pubDate>
      <enclosure url="https://aim-intelligence.vercel.app/blog/thumbnails/figstep-vlm-attacks.jpeg" type="image/jpeg" length="0" />
    </item>
  </channel>
</rss>