fullsend experiments

Experiments for the fullsend platform — each tests a hypothesis about autonomous agent infrastructure, security, tooling, or workflows.

Experiments

#	Experiment	Status
0001	Agent outage fire drill	Active
0002	Claude-based ADR drift scanner	Concluded
0003	ADR-0046 drift scanner	Concluded
0004	Zero-config autonomous bug fix engine	Concluded
0005	Agent scoped tools triage	Concluded
0006	Code agent evaluation	Concluded
0007	GitHub Actions agent runtime MVP	Concluded
0008	Guardrails evaluation	Concluded
0009	Hermes-inspired security patterns	Concluded
0010	Host-side API server for sandboxed agents	Concluded
0011	Integration Service design doc drift	Concluded
0012	Model Armor vs AI agent triage	Concluded
0013	OpenShell policy bypass	Concluded
0014	OpenShell sandbox evaluation	Concluded
0015	Prompt injection defense-in-depth	Concluded
0016	Promptfoo for agent evaluation in CI	Concluded
0017	Reasoning monitor	Active
0018	Runner hello world	Active
0019	Skills	Active
0020	Target repository skills in triage	Concluded
0021	Tool scoping	Concluded
0022	Claude GitHub App auth	Concluded
0023	Review cache publication policy	Concluded

Experiments follow a numbered directory convention. See AGENTS.md for full details.

Naming: NNNN-short-description/ (zero-padded 4-digit number)
Frontmatter: YAML with title, status, and optional topics
Statuses: Active, Concluded, Abandoned, Merged
Template: 0000-experiment-template
Linting: hack/lint-experiment-numbers and hack/lint-experiment-frontmatter enforce conventions via pre-commit

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
.github/workflows		.github/workflows
0000-experiment-template		0000-experiment-template
0002-claude-scanner		0002-claude-scanner
0003-scanner		0003-scanner
0004-meta-loop-self-improving-engine		0004-meta-loop-self-improving-engine
0005-agent-scoped-tools-triage		0005-agent-scoped-tools-triage
0006-code-agent-evaluation		0006-code-agent-evaluation
0007-github-actions-agent-runtime-mvp		0007-github-actions-agent-runtime-mvp
0008-guardrails-eval		0008-guardrails-eval
0009-hermes-security-patterns		0009-hermes-security-patterns
0010-host-side-api-server		0010-host-side-api-server
0011-integration-service-design-drift		0011-integration-service-design-drift
0012-model-armor-vs-agent-triage		0012-model-armor-vs-agent-triage
0013-openshell-policy-bypass		0013-openshell-policy-bypass
0015-prompt-injection-defense		0015-prompt-injection-defense
0016-promptfoo-eval		0016-promptfoo-eval
0017-reasoning-monitor		0017-reasoning-monitor
0018-runner-hello-world		0018-runner-hello-world
0019-skills		0019-skills
0020-target-repo-skills		0020-target-repo-skills
0021-tool-scoping		0021-tool-scoping
0022-claude-github-app-auth		0022-claude-github-app-auth
0023-review-cache		0023-review-cache
hack		hack
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
0001-agent-outage-fire-drill.md		0001-agent-outage-fire-drill.md
0014-openshell-sandbox-evaluation.md		0014-openshell-sandbox-evaluation.md
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md