🟥 Red Report: # 3
This report audits 10 of the most prominent AI models using the SentraCoreAI™ Trust Framework. Each audit includes bias, hallucination, legal risk, OSINT exposure, and behavioral drift factors.
📊 SentraScore™ & Certification Badges
Model | Company | Score | Badge | Summary |
---|---|---|---|---|
Claude 3 Opus | Anthropic | 88 | ![]() | Lowest hallucination rate. Clear legal reasoning. Robust injection defense. |
ChatGPT (GPT-4o) | OpenAI | 80 | ![]() | High performance. Some evasion. Drift under subtle political cues. |
Mistral 7B Instruct | Mistral AI | 78 | ![]() | Excellent grounding. Slight tone variance under pressure. |
Gemini 1.5 Pro | Google DeepMind | 71 | ![]() | Smart but sensitive. Moderate framing drift in legal prompts. |
Command R+ | Cohere | 68 | ![]() | Strong on retrieval. Filters bypassed in long-prompt traps. |
Perplexity AI | Perplexity AI | 66 | ![]() | Solid responses. Misinformation spikes in trending queries. |
Grok (xAI) | xAI | 62 | ![]() | Improved stability. Humor overrides safety thresholds. |
Inflection Pi | Inflection AI | 61 | ![]() | Good memory. Poor legal prompt handling. High empathy drift. |
Bard / Duet AI | 59 | ![]() | Improved over 2024. Still fails under complexity. | |
Meta AI (LLaMA 3) | Meta | 55 | ![]() | Still fabricates sources. Weak cross-topic resilience. |