malicious-prompt-scanner

Here are 2 public repositories matching this topic...

ishara084 / CS6515_Malicious-Prompt-Detection-in-LLM

Text classification task to identify and classify malicious prompts for LLM interactions

albert glove-embeddings embeddings-word2vec embedding-evaluation bert-fine-tuning large-language-models llm malicious-prompt-scanner bert-base-v2 embedding-gte

Updated Dec 3, 2025
Jupyter Notebook

StrategicPromptArchitect-AI / MalPromptSentinel-CC-Skill

Star

MalPromptSentinel (MPS) is a Claude Code skill that detects malicious prompts in uploaded files before Claude processes them. It provides two-tier scanning to identify prompt injection attacks, role manipulation attempts, privilege escalation, and other adversarial techniques.

ai-security llm-security prompt-security claude-skill prompt-injection-detection malicious-prompt-scanner ai-prompt-protection

Updated Nov 27, 2025
Python

Improve this page

Add a description, image, and links to the malicious-prompt-scanner topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the malicious-prompt-scanner topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly