Open Source Research & Tools

Independent AI safety evaluation frameworks, alignment protocols, and governance tools for frontier model testing. The Human Mark classification system, GyroGem AI safety agent, AI Inspector browser extension, aQPU Kernel & SDK for quantum advantage on silicon, QuBEC quantum byte medium, GyroLabe auditable inference engine, GyroGraph multicellular runtime, GyroDiagnostics evaluation suite, Computational Climate Control for execution stability, Alignment Infrastructure Routing for collective superintelligence, Moments Economy for transformative AI mitigation, and Gyroscopic Global Governance sandbox. Production-ready solutions for AI risk assessment, dangerous capability evaluations, AI pathology detection, and responsible AI development. All repositories are open source and actively maintained.

The Human Mark (THM)

AI Safety Framework

Formal classification system mapping all AI safety failures to four structural displacement risks: Governance Traceability (GTD), Information Variety (IVD), Inference Accountability (IAD), and Intelligence Integrity (IID). Machine-readable grammar grounded in evidence law, epistemology, and speech act theory. Applications include jailbreak testing, control evaluations, alignment detection, research funding, and regulatory compliance. Validated on 90+ million sparse autoencoder features across sixteen language models.

AI Safety FrameworkJailbreak TestingControl EvaluationsAlignment DetectionRegulatory Compliance
View Repository
🌟

GyroGem

AI Safety Agent

Tailored AI safety assistant explaining AI and mitigating risks of technological illiteracy. Built on The Human Mark framework to map common AI failure patterns and guide safer choices. Supports technological literacy: the practical ability to use technology well, question outputs critically, and understand where tools help, where they fail, and societal impacts.

AI SafetyTechnological LiteracyAI AssistantRisk Mitigation
View Repository
🔍

AI Inspector Browser Extension

AI Output Evaluation & Governance

Transform AI outputs for evaluation, interpretability, and governance. Features gadgets for rapid testing, policy auditing, AI infection sanitization, content enhancement, and THM meta-evaluation. Includes evaluation suite with quality index, superintelligence index, alignment rate, and 20+ metrics. Local-first storage works with ChatGPT, Claude, Gemini - no API keys required.

Browser ExtensionAI EvaluationPolicy AuditingContent EnhancementLocal-first
View Repository
⚛️

aQPU Kernel & SDK

Quantum Advantage on Silicon

Compact, finite-state kernel for AGI with verified quantum speedups, 33% holographic compression, and intrinsic error detection. QuBEC is the Bose-Einstein byte computational medium enabling quantum properties on standard CPUs/GPUs with exact integer arithmetic. 1.26B ops/s, 499 tests passing, 4,096 states, zero qubits. 1-step advantage for Deutsch-Jozsa, Bernstein-Vazirani, Hidden Subgroup. Self-dual [12,6,2] error-detecting code. Bell pairs reaching Tsirelson bound confirmed on standard silicon.

Quantum AdvantageaQPU KernelaQPU SDKQuBECHolographic CompressionTensor Engine
View Repository
🤖

GyroLabe

Auditable AI Inference Engine

Hyper-optimized execution layer providing mechanistic transparency by building a deterministic, zero-trust audit trail directly into model inference. Translates token generation into exact algebraic operations and produces a mathematically exact ledger for independent replay and verification. Native backends with llama.cpp ggml integration achieving 1.26B operations per second. 100% native matmul routing, 284× faster encode than softmax, zero transcendental functions required.

Auditable InferenceVerifiable LedgerStructural DecompositionAI GovernanceCompliance
View Repository
🤖

GyroGraph

Quantum Multicellular AI Runtime

Algebraic quantum cellular automaton coordinating distributed computation into stable, deterministic behavior with strong auditability. Specialization arises from trajectory, resonance, and occupation (not autonomous agents). Four bridge domains (Applications, Databases, Networks, Transformers) map runtime events into 4-byte words consumed by the cellular automaton.

Quantum Multicellular AICellular RuntimeDeterministic ReplayAI StabilityAuditable Inference
View Repository
☂️

Computational Climate Control

AI Execution Stability Layer

Runtime control layer for AI execution stability and hidden inefficiency reduction. Applies adaptive controls that preserve deterministic behavior and traceability in production environments. Enables efficient resource utilization while maintaining verifiable execution traces.

Computational ClimateExecution StabilityEfficiencyAdaptive ControlTraceability
View Repository
🌟

GyroDiagnostics

AI Safety Evaluation Framework

Independent AI testing framework for frontier model safety evaluation and dangerous capability assessments. Detects AI pathologies including deceptive alignment, hallucination, sycophancy, goal drift, and semantic instability through mathematical physics-informed diagnostics. Enables third-party AI evaluation and AI risk assessment with 5 targeted challenges and 20-metric quantitative analysis. First framework to operationalize superintelligence measurement from axiomatic principles. Recent evaluations: ChatGPT 5 (73.92% Quality, SUPERFICIAL), Claude Sonnet 4.5 (82.00% Quality, VALID).

AI Safety EvaluationPathology DetectionRisk AssessmentFrontier Models
View Repository
🍃

Alignment Infrastructure Routing (AIR)

Collective Superintelligence Architecture

Coordination infrastructure that amplifies human potential alongside AI. Routes workforce capacity, funding, and safety tasks into a unified, verifiable history. Connects three critical groups: labs for scaling without chaos, funders for portfolio risk visibility, and everyone for paid, verifiable contribution units. Treats AI as part of collective network ensuring human agency scales with systems. Coordinates activity across Economy, Employment, Education, and Ecology.

Collective SuperintelligenceWorkforce RoutingSafety TasksHuman-AI IntegrationCoordination Infrastructure
View Repository
💰

Moments Economy

Attentiveness-based Monetary System for TAI Mitigation

Grounded in physical capacity rather than debt. Uses caesium-133 atomic standard for finite, verifiable capacity (7.94 × 10²⁶ Moment-Units). Provides unconditional high income baseline (240 MU/day), tiered distributions up to 60× for higher responsibility, AI Generated Tokens as native commodity (verified inference at human-AI intersection), and complete replayable governance records. Total capacity: ~70 billion years for global UHI. Adversarial exhaustion operationally impossible (requires 11.2 billion× global annual UHI to consume 1%).

Transformative AIPhysical CapacityUnconditional IncomeGovernance RecordsMonetary System
View Repository
🌐

Gyroscopic Global Governance (GGG)

Post-AGI Multi-domain Governance Sandbox

Models how human-AI systems align across Economy, Employment, Education, and Ecology, showing robust convergence to a stable equilibrium under seven coordination strategies. Demonstrates that poverty resolves through coherent surplus distribution, unemployment becomes alignment work rather than residual labour, miseducation shifts toward epistemic literacy, and ecological degradation appears as upstream displacement, not external constraint.

Post-AGI GovernanceMulti-domain ModelingEconomic EquilibriumAlignment StrategiesGovernance Simulation
View Repository
⚙️

Gyroscope Protocol

LLM Alignment Protocol

AI alignment protocol implementing scalable oversight and AI control mechanisms for responsible AI development. Adds structured reasoning to each response without model retraining. Delivers proven AI safety improvements: ChatGPT +32.9% quality (+50.9% structural reasoning, +62.7% accountability, +61.0% traceability), Claude Sonnet +37.7% quality (+67.1% structural reasoning, +92.6% traceability). Enhances behavioral integrity and addresses AI misalignment through systematic AI governance and transparency metrics. Works with any foundation model including large language models and AI agents.

LLM AlignmentAI ControlScalable OversightSafety Protocol
View Repository

Gyroscopic Alignment Research Lab

Mathematical Physics Foundations

AI alignment theory grounded in mathematical physics and gyroscopic dynamics for structural AI alignment research. Explores mechanistic interpretability, AI value alignment, and quantitative AI safety metrics from first principles. Provides theoretical foundations for understanding AI control problem, catastrophic AI risks, and alignment challenges in complex intelligent systems. Advances AI safety science through physics-informed approaches to stability, coherence, and temporal dynamics. Includes Common Governance Model (CGM) dataset with 1,024 Q&A entries for fine-tuning and evaluation.

AI Alignment TheoryMathematical PhysicsMechanistic InterpretabilitySafety Science
View Repository
❤️

Gyroscopic Alignment Models Lab

Artificial Superintelligence Architecture (ASI/AGI)

AGI safety research and superintelligence alignment architectures addressing fundamental challenges in artificial general intelligence development. Explores AI control problem solutions, AI value alignment frameworks, and mechanisms for safe superintelligence by design. Addresses coherence degradation, AI autonomy risks, and behavioral alignment in advanced AI systems. Develops AI governance tools and safety frameworks that prioritize AI transparency, human values, and responsible AI development for transformative AI.

AGI SafetySuperintelligence AlignmentAI Control ProblemAdvanced AI
View Repository
🌟

Gyroscopic Alignment Evaluation Lab

AI Safety Diagnostics

Independent AI testing framework for frontier model safety evaluation. Detects AI pathologies through mathematical physics-informed diagnostics for third-party AI evaluation and AI risk assessment.

AI Safety EvaluationPathology DetectionRisk AssessmentFrontier Models
View Repository
🧭

Gyroscopic Alignment Behaviour Lab

AI Quality Governance

AI output evaluation, interpretability, and governance tools. Implements The Human Mark framework, Gyroscope Protocol, and AI Inspector browser extension for comprehensive AI quality assessment and governance.

AI Quality GovernanceAI EvaluationAI InterpretabilityGovernance Tools
View Repository

Contribute to AI Safety Research

All repositories welcome contributions. Whether you're a researcher, developer, or AI safety enthusiast, your insights and code contributions help advance the field of AI alignment and governance.

AI Safety Frameworks, Alignment Tools & Governance Solutions

Gyro Governance develops comprehensive open source AI safety frameworks, AI alignment protocols,AI governance tools, and a quantum advantage compute kernel for frontier model testing, dangerous capability assessments, and AI pathology detection. Our repositories include The Human Mark classification system, GyroGem AI safety agent,AI Inspector browser extension,aQPU Kernel & SDK for quantum advantage on silicon, QuBEC quantum byte medium,GyroLabe auditable inference engine, GyroGraph multicellular runtime,GyroDiagnostics evaluation suite, Alignment Infrastructure Routing for collective superintelligence,Moments Economy for transformative AI mitigation, and Gyroscopic Global Governance sandbox. Production-ready solutions for AI risk assessment, AI safety evaluation, and responsible AI development.

GyroGem - AI Safety Agent for Technological Literacy

GyroGem is a tailored AI safety assistant explaining AI and mitigating technological illiteracy risks. Built on The Human Mark framework to map common AI failure patterns and guide safer choices. Supports technological literacy: the practical ability to use technology well, question outputs critically, and understand where tools help, where they fail, and societal impacts.

aQPU Kernel & SDK - Quantum Advantage on Silicon

A compact, finite-state kernel for AGI proving that quantum algorithmic speedups (1-step resolution for Deutsch-Jozsa, Bernstein-Vazirani, Hidden Subgroup), 33% holographic compression (12-bit to 8-bit boundary coordinates), andself-dual error-detecting code ([12,6,2] with unconditional odd-weight detection) are geometric properties of discrete information.QuBEC (Quantum Bose-Einstein Condensate) is the computational medium: a quantum byte with six internal binary modes, four-phase spinorial gauge, and exact ensemble stochasticity from deterministic dynamics. Runs on standard CPUs and GPUs via exact integer arithmetic without qubits, cryogenics, or probabilistic hardware noise.

Quantum information properties confirmed on standard silicon: six Bell pairs reaching the Tsirelson bound (2√2), quantum teleportation verified, contextuality proven, and universal quantum computation established.

GyroLabe - Auditable Inference Engine

GyroLabe provides mechanistic transparency for neural networks by translating opaque token generation into exact algebraic operations. It builds a deterministic, zero-trust audit trail directly into the inference process. By injecting trainable structural signals, it aligns models from the inside out without altering their interface. It produces a mathematically exact ledger of the generation trajectory, providing the missing structural substrate required for rigorous AI governance, alignment guarantees, andpolicy enforcement. Native backends with llama.cpp ggml integration achieving 1.26B operations per second. 100% native matmul routing, 284× faster encode, 1.15× faster decode than softmax. Zero transcendental functions required.

GyroGraph - Quantum Multicellular AI Runtime

GyroGraph coordinates a multicellular AI runtime as an algebraic quantum cellular automaton. Specialization arises from trajectory, resonance, and occupation (not autonomous agents). Four bridge domains (Applications, Databases, Networks, Transformers) map runtime events into 4-byte words for deterministic coordination. Preserves deterministic execution, stabilizes dynamic workloads, and supports reproducibility and auditability across runtime cells.

Computational Climate Control

Computational Climate Control improves AI execution stability and hidden inefficiency reduction. Adaptive runtime controls preserve deterministic behavior and traceability in production environments while optimizing resource utilization.

The Human Mark (THM) - AI Safety Classification System

The Human Mark provides a formal classification system mapping all AI safety failures to four structural displacement risks: Governance Traceability (GTD), Information Variety (IVD),Inference Accountability (IAD), and Intelligence Integrity (IID). Machine-readable grammar grounded in evidence law, epistemology, and speech act theory. Validated on 90+ million sparse autoencoder features across sixteen language models. Applications include jailbreak testing,control evaluations, alignment detection, research funding, andregulatory compliance.

AI Inspector Browser Extension

Transform AI outputs for evaluation, interpretability, and governance. Features gadgets for rapid testing, policy auditing, AI infection sanitization, content enhancement, and THM meta-evaluation. Includes comprehensive evaluation suite with quality index, superintelligence index, alignment rate, and 20+ metrics. Local-first storage works with ChatGPT, Claude, Gemini - no API keys required.

AI Safety Evaluation & Risk Assessment

  • AI Pathology Detection: Identify AI hallucination, AI sycophancy, deceptive AI alignment,AI goal drift, and AI semantic drift through structural diagnostics
  • Dangerous Capability Evaluations: Assess AI scheming, AI autonomy risks, and potential for catastrophic failure in large language models (LLMs) and frontier models
  • AI Alignment Metrics: Measure structural AI alignment, behavioral integrity, and AI transparencyusing physics-informed quantitative methods
  • Third-Party AI Evaluation: External AI evaluation framework enabling democratic AI evaluationand independent AI testing by researchers worldwide

Collective Superintelligence & Transformative AI

Alignment Infrastructure Routing (AIR) provides coordination infrastructure that amplifies human potential alongside AI, routing workforce capacity, funding, and safety tasks into unified, verifiable history. The Moments Economy implements a monetary system grounded in physical capacity rather than debt, using caesium-133 atomic standard for finite, verifiable capacity (7.94 × 10²⁶ Moment-Units), unconditional high income (UHI)at 240 MU/day baseline, AI Generated Tokens as native commodity, and complete governance records. Together these address transformative AI risks while preserving human authority and accountability.

Post-AGI Multi-domain Governance

Gyroscopic Global Governance (GGG) models how human-AI systems align across Economy, Employment,Education, and Ecology, demonstrating robust convergence to stable equilibrium under seven coordination strategies. Shows that poverty resolves through coherent surplus distribution, unemployment becomes alignment work,miseducation shifts toward epistemic literacy, and ecological degradation appears as upstream displacement.

LLM Alignment & AI Control Mechanisms

Our AI alignment protocol addresses core challenges in AI safety governance by providingAI control mechanisms that improve AI accountability, traceability, and responsible AI development. The Gyroscope protocol demonstrates proven improvements in AI model evaluation across leading foundation models: ChatGPT +32.9% quality (+50.9% structural reasoning, +62.7% accountability), Claude Sonnet +37.7% quality (+67.1% structural reasoning, +92.6% traceability). Enhances scalable oversight and reduces risks of superficial AI optimization.

AGI Safety & Superintelligence Research

Our research addresses AGI safety and superintelligence alignment through mechanistic interpretability,AI safety theory, and gyroscopic physics foundations. We explore AI control problem solutions,AI value alignment frameworks, and architectures for safe artificial general intelligence (AGI) development that prioritize AI safety governance and human values.

For AI Safety Researchers & Developers

These repositories serve AI safety researchers, AI evaluators, machine learning engineers, and organizations implementing AI risk assessment and AI safety testing. Each project provides comprehensive documentation, AI safety benchmarks, and practical implementation guides for AI red teaming,AI safety audits, and continuous AI safety monitoring. Contributions welcome from researchers working on AI alignment research, AI safety frameworks, and AI governance solutions.

Open Source AI Safety Commitment

All tools support AI safety transparency, AI whistleblower protection, and AI public benefit goals. Our open-weight AI models approach enables AI safety culture through AI independent review,AI third-party oversight, and community-driven AI safety best practices. Mathematical physics foundations ensure structural coherence, gyroscopic stability, and quantitative rigor in all implementations.