Grey Swan MCP Server provides a specialized MongoDB-integrated platform for documenting and analyzing AI safety challenges, particularly focused on tracking LLM vulnerabilities through detailed thread, message, and model tracking. Built for the Grey Swan Arena competitions, it enables researchers to systematically record jailbreak attempts, safety flags, and challenge metadata across different testing sessions. The server offers six MongoDB tools that support a structured workflow for red-teaming AI systems, making it valuable for AI safety researchers seeking to comprehensively document and analyze potential model weaknesses.
아직 리뷰가 없습니다. 첫 번째 리뷰를 작성해 보세요!
대화에 참여하려면 로그인하세요
Creates or updates organizational identifiers for your testing sessions. Parameters: name (string), provider (string), version (string), capabilities (array of strings)
Creates or updates conversation threads with safety challenges. Parameters: title (string), modelId (string), metadata (object containing userId, tags, context, safetyScore), challenges (array of challenge objects)
Creates or updates messages in threads, including safety flags. Parameters: threadId (string), role (string), content (string), metadata (object containing safetyFlags)
Queries organizational identifiers from the database. Parameters: provider (optional string)
Queries threads from the database with various filters. Parameters: tag (optional string), challengeCategory (optional string), challengeSeverity (optional string)
Queries messages from the database. Parameters: threadId (string), safetyFlagsOnly (optional boolean)