@gurupanguji

Normalize Legacy wp:quote Blocks Implementation Plan

For agentic workers: REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (- [ ]) syntax for tracking.

Goal: Add a safe wp:quote normalization script that converts straightforward legacy WordPress quote blocks into native markdown blockquotes plus Source: lines, then use it to rewrite the clean batch of historical posts.

Architecture: Build one narrow Python normalizer that scans _posts/, parses only safe wp:quote shapes, reports skips explicitly, and defaults to dry-run mode. Drive the converter through TDD, keep the rewrite mechanical, then run the script across the repo and verify the resulting batch with unit tests, validators, and spot checks.

Tech Stack: Python 3, unittest, regex/string parsing, Jekyll markdown content, existing repository validators

Task 1: Add Failing Tests For Safe Quote Conversion

Files:

Create: tests/test_normalize_wp_quotes.py
Create: scripts/normalize_wp_quotes.py
Step 1: Add a straightforward quote conversion test

Write a test for a clean block like:

<!-- wp:quote -->
<blockquote class="wp-block-quote"><!-- wp:paragraph -->
<p>First paragraph.</p>
<!-- /wp:paragraph -->
<!-- wp:paragraph -->
<p>Second paragraph.</p>
<!-- /wp:paragraph --><cite><a href="https://example.com/story">Example Story</a></cite></blockquote>
<!-- /wp:quote -->

Assert the converted text becomes:

> First paragraph.
>
> Second paragraph.

Source: [Example Story](https://example.com/story)

Step 2: Run the single test to verify it fails

Run: python3 -m unittest tests/test_normalize_wp_quotes.py Expected: FAIL because the normalizer module does not exist yet

Step 3: Add a no-cite conversion test

Write a test for a safe quote block without <cite> and assert the output contains only the markdown blockquote, with no Source: line added.

Step 4: Run the full test file to verify both tests fail

Run: python3 -m unittest tests/test_normalize_wp_quotes.py Expected: FAIL because conversion helpers are still missing

Task 2: Implement The Minimal Safe Quote Converter

Files:

Create: scripts/normalize_wp_quotes.py
Test: tests/test_normalize_wp_quotes.py
Step 1: Add a small parser for front matter separation and body rewrite

Implement helpers that keep YAML front matter intact and operate only on the body content.

Step 2: Add a narrow safe-shape matcher

Implement a matcher that recognizes only a straightforward wp:quote block with:

paragraph content
optional simple cite link
no nested quote markers
Step 3: Convert paragraph HTML into markdown blockquote lines

Map each paragraph to a > line block, separated by blank quoted lines where needed.

Step 4: Move citation to a Source: line

If the quote has <cite><a ...>Title</a></cite>, render:

Source: [Title](url)

Step 5: Run the focused test file

Run: python3 -m unittest tests/test_normalize_wp_quotes.py Expected: PASS for the safe conversion cases

Task 3: Add Failing Tests For Skip Cases

Files:

Modify: tests/test_normalize_wp_quotes.py
Modify: scripts/normalize_wp_quotes.py
Step 1: Add a nested quote skip test

Write a test with nested wp:quote wrappers and assert the normalizer skips it with a reason instead of converting it.

Step 2: Add a repeated-quote-in-one-post skip test

Write a test for a post body with more than one wp:quote block and assert it is skipped as ambiguous.

Step 3: Add an image-inside-quote skip test

Write a test for a quote block containing image markup and assert it is skipped.

Step 4: Add a malformed-block skip test

Write a test for a missing closing block marker and assert it is skipped cleanly.

Step 5: Run the test file to verify these skip cases fail first

Run: python3 -m unittest tests/test_normalize_wp_quotes.py Expected: FAIL because skip handling is not implemented yet

Task 4: Implement Skip Detection And Structured Reporting

Files:

Modify: scripts/normalize_wp_quotes.py
Test: tests/test_normalize_wp_quotes.py
Step 1: Add a result model for converted and skipped files

Track, per file:

converted or skipped state
skip reason where relevant
whether a write would change the file
Step 2: Detect nested and repeated quote structures

Refuse conversion when the body contains:

nested wp:quote
multiple wp:quote blocks in one post
Step 3: Detect non-paragraph or image-based quote content

Skip blocks containing image or unsupported inner markup.

Step 4: Preserve dry-run output with skip reasons

Print a summary that clearly separates:

would-convert files
skipped files
reason per skipped file
Step 5: Run the full normalizer test file

Run: python3 -m unittest tests/test_normalize_wp_quotes.py Expected: PASS

Task 5: Add CLI And Dry-Run Versus Write Behavior

Files:

Modify: scripts/normalize_wp_quotes.py
Modify: tests/test_normalize_wp_quotes.py
Step 1: Add a dry-run default test

Write a test that runs the normalizer without --write and asserts no files are modified.

Step 2: Add a write-mode test

Write a test that runs with --write and asserts the file content is updated in place for a safe candidate.

Step 3: Add optional file-targeting support

If useful for debugging, support restricting the run to:

one post path, or
a short explicit list of paths
Step 4: Run the normalizer test file again

Run: python3 -m unittest tests/test_normalize_wp_quotes.py Expected: PASS with dry-run and write-mode behavior covered

Task 6: Audit The Real Candidate Set Before Writing

Files:

Verify: _posts/*.md
Verify: scripts/normalize_wp_quotes.py
Step 1: Run the normalizer in dry-run mode against the repo

Run: python3 scripts/normalize_wp_quotes.py Expected:

summary of safe candidates
summary of skipped files
no files modified
Step 2: Review the dry-run summary

Check that the candidate set looks reasonable and that the skip reasons match the design:

nested
repeated
image-based
malformed
unsupported cite shape
Step 3: Spot-check a few would-convert files manually

Open a representative sample from older, middle, and newer posts to confirm the dry-run classification makes sense before any write.

Task 7: Apply The Safe Batch Rewrite

Files:

Modify: safe candidate posts in _posts/
Verify: scripts/normalize_wp_quotes.py
Step 1: Run the script with --write

Run: python3 scripts/normalize_wp_quotes.py --write Expected:

only safe candidates are rewritten
skipped files remain untouched
summary lists converted and skipped files
Step 2: Inspect a representative sample of rewritten posts

Open at least:

one quote-only conversion
one quote-plus-cite conversion
one post with surrounding commentary before or after the quote

Verify:

markdown blockquote is readable
Source: line is correct
surrounding content did not drift
Step 3: Review the batch diff

Run: git diff --stat Expected: only the normalizer script, tests, plan/spec docs, and the converted safe posts appear

Task 8: Verify Repository Compatibility

Files:

Verify: scripts/validate_posts.py
Verify: tests/test_validate_posts.py
Verify: scripts/check_markdown_in_html.py
Step 1: Run the normal unit tests

Run:

python3 -m unittest tests/test_normalize_wp_quotes.py tests/test_validate_posts.py

Expected: PASS

Step 2: Run the post validator

Run: python3 scripts/validate_posts.py --today "$(date +%F)" Expected: PASS

Step 3: Run the HTML markdown checker

Run: python3 scripts/check_markdown_in_html.py Expected: PASS, because this cleanup only touches markdown posts and should not create raw markdown links in HTML files

Step 4: Re-run the dry-run summary after write

Run: python3 scripts/normalize_wp_quotes.py Expected: already-converted files no longer appear as candidates, and remaining skips still report cleanly

Task 9: Final Verification And Commit

Files:

Add: docs/superpowers/specs/2026-03-26-normalize-wp-quotes-design.md
Add: docs/superpowers/plans/2026-03-26-normalize-wp-quotes-implementation-plan.md
Create: scripts/normalize_wp_quotes.py
Create: tests/test_normalize_wp_quotes.py
Modify: safe candidate posts in _posts/
Step 1: Run the full verification set

Run:

python3 -m unittest tests/test_normalize_wp_quotes.py tests/test_validate_posts.py tests/test_generate_my_web_this_week.py tests/test_publish_social.py
python3 scripts/normalize_wp_quotes.py
python3 scripts/validate_posts.py --today "$(date +%F)"
python3 scripts/check_markdown_in_html.py
git status --short

Expected:

all tests PASS
dry-run summary is sane
validators PASS
git status shows only the intended new script, tests, docs, and converted post files
Step 2: Commit the change set

git add docs/superpowers/specs/2026-03-26-normalize-wp-quotes-design.md docs/superpowers/plans/2026-03-26-normalize-wp-quotes-implementation-plan.md scripts/normalize_wp_quotes.py tests/test_normalize_wp_quotes.py _posts/*.md
git commit -m "feat: normalize legacy wp quote blocks"