Testing¶

What it is / when to use it¶

This page documents the contributor testing workflow for PenguiFlow:

Use it before opening a PR and when debugging CI failures.

This page does not define product-level QA or benchmarking.
It does not cover integration tests requiring real network access or external credentials (avoid those in CI).

CI runs on:

Checks enforced before merge:

See .github/workflows/ci.yml.

uv run pytest

uv run ruff check .
uv run mypy

uv run pytest --cov=penguiflow --cov-report=term --cov-report=xml --cov-fail-under=84.5

uv run python scripts/check_md_links.py
uv run mkdocs build --strict

./scripts/validate_frontend.sh

Flaky tests: isolate the failing test, remove timing assumptions, and ensure no external network calls.
Coverage regression: add at least one negative/error-path test for new behavior (policy target is ≥85%).
Docs build fails under --strict: fix broken links or missing pages; avoid linking from curated docs to excluded/internal docs.
Frontend validation fails: rebuild UI assets under penguiflow/cli/playground_ui and re-run.

Use pytest -vv -s for more output during debugging.
Prefer targeted test runs (single file) before full suite when iterating locally.

Never run tests that require real API keys in CI.
Avoid recording real tool payloads in golden snapshots; keep fixtures synthetic and redacted.

If CI fails but local passes:
verify Python version,
run the exact CI commands above,
confirm you installed the same dependency groups (.[dev], .[docs] when needed).