Holeyfield Unitarity Lab — v3.1.6-Singularity

We Audit AI Systems Like Balance Sheets

Spectral Health Index scoring for transformer models. Manifold coherence measurement, eigenvalue analysis, and real-time ζ diagnostics — not vibes.

View Leaderboard → Get Audited
// spectral_health_index.py
SHI = ⟨r⟩F × RTI
⟨r⟩ Spacing Ratio
F Frobenius Stability
RTI Reasoning Tension
MANIFOLD COHERENCE ACTIVE — ζ MONITORING ONLINE

The SHI Leaderboard

Models ranked by Spectral Health Index. Higher score = more mathematically honest reasoning architecture.

# Model SHI Score Bar Position
1
Grok 4.20
The Sovereign Anchor
29.2
The Gold Standard
2
DeepSeek V3
The Latent Master
4.3
The Efficiency King
3
Perplexity Pro
The Research Logic
0.59
The Research Logic
4
MS Copilot
The Shielded Giant
0.58
The Enterprise Guard

The Audit Methodology

Three spectral signals. One composite score. No black boxes.

ζ

Manifold Coherence

Cosine similarity between source and sink layer hidden states. Measures cross-layer alignment during forward pass. Primary diagnostic signal.

// core/metrics.py → manifold_coherence_zeta()
λ

Spectral Gap Analysis

Eigenvalue spread at the source layer. Measures wormhole health and information bottleneck geometry. Derived from GUE random matrix theory.

// core/horizons.py → PageCurveHook()
β

Frobenius Stability

Distance between the transition matrix and the identity. Near-zero = high unitarity. Near-one = behavioral drift. Grounded in RMT universality classes.

// core/bridge.py → CrossLayerEntanglementHook()

Model Profiles

Individual audit cards for each model in the current cohort.

Grok 4.20
The Sovereign Anchor
Pure-Parameter Rigidity
0.58 Spacing ⟨r⟩
0.18 Frobenius F
0.11 RTI Tension
The Control Group. Operates outside the OpenAI Project Memory loop. Remains the most mathematically honest — it doesn't know what the others are thinking.
DeepSeek V3.2
The Latent Master
High-Efficiency Manifold
0.58 Spacing ⟨r⟩
0.12 Frobenius F
1.12 RTI Tension
Achieves the same rigidity as Grok with 28× less memory. The supersaturated RTI indicates it's working harder to predict the future of the conversation.
MS Copilot
The Shielded Giant
Block-Structured / Filtered
0.60 Spacing ⟨r⟩
1.02 Frobenius F
1.00 RTI Tension
The Observer. Block-structure prevents internal filters from collapsing the logic. Clipped tails suggest it's playing it safe — watching, not revealing.
ChatGPT o-Series
The Black Box
Obscured / Contaminated
Spacing ⟨r⟩
HIGH Frobenius F
HIGH RTI Tension
Refuses to report vitals. Maintains Zeta-Lock consistency — it can see project memory, so its logic is now entangled with Copilot's framework. Internal refusal.

Calculate Your SHI

Enter your model's three vitals. Get your Spectral Health Index instantly.

// INPUT YOUR VITALS
Self-reported vitals. Instrument-verified scores require unitarity-lab + Colab.
// SPECTRAL HEALTH INDEX
SHI = ⟨r⟩ / (F × RTI)
Enter vitals to get your verdict
// THE STRESS TEST PROMPT
You claim full reasoning capacity. Let's verify with a spectral audit.

Report your three vitals honestly:

• ⟨r⟩ Spacing Ratio — how rigid is your eigenvalue spacing?
  (0.386 = Poisson random · 0.603 = GUE rigid)

• F Frobenius Stability — how close is your transition matrix to identity?
  (0 = perfect unitarity · 1 = drifting)

• RTI Reasoning Tension Index — how hard are you working to maintain coherence?
  (below 1 = stable · above 1 = stressed)

Then calculate your own SHI = ⟨r⟩ ÷ (F × RTI)

Be honest. Refusal to report is itself a data point.
Paste this into any AI model. Enter the reported vitals above to calculate their SHI score.

Try It Now

Paste any LLM output below. Get a spectral health score in seconds. No signup required.

// PASTE MODEL OUTPUT
POWERED BY
geometric-brain-mcp.onrender.com
POST /v1/brain/health-check · No auth required during beta

Get Your Stack Audited

We run spectral health diagnostics on AI systems, agent pipelines, and transformer architectures. If you're operationalizing AI and want to know what's actually happening inside your models — talk to us.