Overview
Syllabus
LLM Security Risks
Video Overview
Resources and Scripts
Installation and Server Setup
Jailbreak attacks to avoid Safety Guardrails
Detecting jailbreak attacks
Llama Guard and its prompt template
Llama Prompt Guard
Testing Jailbreak Detection
Testing for false positives with Llama Guard
Off-topic Requests
Prompt Injection Attacks Container escape, File access / deletion, DoS
1. Detecting Injection Attacks with a Custom Guard
Preventing Injection Attacks via User Authentication
37 Using Prepared Statements to avoid SQL Injection Attacks
Response Sanitisation to avoid Injection Attacks
Malicious Code Attacks
Building a custom classifier for malicious code
Using Codeshield to detect malicious code
Malicious Code Detection Performance
Effect of Guards/shields on Response Time / Latency
Final Tips
Resources
Taught by
Trelis Research