Prompt Engines Lab Notes

Real experiments. Published findings. No filler.

We test AI systems in production and write about what holds up. Security, architecture, model evaluation, and tooling decisions from active builds.

Latest

7 notes

Feb 25, 2026Security

Openclaw: security and deployment best practices with Docker

Airgapped Docker containers, hardened networking, pinned images, and zero outbound access for agent deployments that handle real data.

Feb 25, 2026Architecture

Tutor architecture: how we structure AI tutoring systems

Session handling, knowledge routing, adaptive difficulty, and the feedback loop.

Feb 24, 2026Agents

Nanoclaw vs Openclaw: which one to deploy

Deployment framework for choosing the right agent runtime by team shape and operational constraints.

Feb 23, 2026Tooling

Opencode: an open alternative to Claude Code

Where open coding assistants work today, and where managed tools still win.

Feb 22, 2026Evaluation

ChatGPT Codex 5.3 vs Claude Opus 4.6 for core build

Edit accuracy, recovery speed, and architecture-level task comparison.

Feb 21, 2026RAG

RAG without hallucination

Guardrails for retrieval-only responses that stay faithful to approved documentation.

Feb 20, 2026Image Models

Aesthetic scoring for early childhood

Scoring rubric for generated visuals intended for young children.

Archive

Feb 25 · 9 min · Security

Openclaw: security and deployment best practices with Docker

Feb 25 · 10 min · Architecture

Tutor architecture: how we structure AI tutoring systems

Feb 24 · 7 min · Agents

Nanoclaw vs Openclaw: which one should you deploy for your agent system

Feb 23 · 6 min · Tooling

Opencode: an open alternative to Claude Code

Feb 22 · 8 min · Evaluation

Reviewing ChatGPT Codex 5.3 vs Claude Opus 4.6 for core build

Feb 21 · 7 min · RAG

RAG without hallucination: answer from official documents only

Feb 20 · 6 min · Image Models

Aesthetic scoring for early childhood: image models and prompts reviewed