Overview: Large Language Models (LLMs) face challenges, notably aligning responses with human values to prevent harmful outputs and their multilingual capabilities which are being exploited by attackers. Malicious users have […]
Read MoreSandwich attack: Multi-language Mixture Adaptive Attack on LLMs
- April 12, 2024
- Comments off