value-alignment

Here are 5 public repositories matching this topic...

defrecord / value-alignment-toolkit

A comprehensive toolkit for implementing, analyzing, and validating AI value alignment based on Anthropic's 'Values in the Wild' research.

python cli privacy ai simulation data-analysis org-mode hy ethics anthropic value-alignment

Updated Apr 30, 2025
Python

AlbertBarqueDuran / TriEthix

Star

TriEthix is a novel evaluation framework that systematically benchmarks frontier LLMs across three foundational ethical perspectives: virtue, deontology, and consequentialism in 3 steps: (Step-1) Moral Weights; (Step-2) Moral Consistency; and (Step-3) Moral Reasoning. TriEthix reveals robust moral profiles for AI Safety, Governance, and Welfare.

ai-safety ai-ethics ai-alignment human-machine-interaction llms value-alignment moral-decision-making ai-welfare

Updated Nov 17, 2025
Python

YTohamy / algorithmic-resonance

Star

A unified framework: Collective Resonance → Strange Attractors → Value Alignment → Algorithmic Intentionality → Emergent Algorithmic Behavior

machine-learning research ai emergent-behavior rag value-alignment algorithmic-resonance collective-resonance algorithmic-intentionality

Updated Aug 26, 2025
Python

ashioyajotham / Value-Aligned-Confabulation-VAC-Research

Star

Research framework for evaluating value-aligned confabulation in LLMs - distinguishing beneficial speculation from harmful hallucination.

ai-safety value-alignment llm-evaluation hallucination-detection confabulation-research

Updated Sep 15, 2025
Python

pxtnv8by6z-lgtm / sachi-protocol

Star

AI ethics framework built on Layer 0 Principle: ∀x, V(x) > 0. Combines philosophical depth with measurable implementation.

python machine-learning philosophy ai-safety ai-ethics restorative-justice ai-governance value-alignment

Updated Nov 24, 2025
Python

Improve this page

Add a description, image, and links to the value-alignment topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the value-alignment topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

value-alignment

Here are 5 public repositories matching this topic...

defrecord / value-alignment-toolkit

AlbertBarqueDuran / TriEthix

YTohamy / algorithmic-resonance

ashioyajotham / Value-Aligned-Confabulation-VAC-Research

pxtnv8by6z-lgtm / sachi-protocol

Improve this page

Add this topic to your repo