···11+---
22+name: critique
33+description: Evaluate design from a UX perspective, assessing visual hierarchy, information architecture, emotional resonance, cognitive load, and overall quality with quantitative scoring, persona-based testing, automated anti-pattern detection, and actionable feedback. Use when the user asks to review, critique, evaluate, or give feedback on a design or component.
44+version: 2.1.1
55+user-invocable: true
66+argument-hint: "[area (feature, page, component...)]"
77+---
88+99+## STEPS
1010+1111+### Step 1: Preparation
1212+1313+Invoke /impeccable, which contains design principles, anti-patterns, and the **Context Gathering Protocol**. Follow the protocol before proceeding. If no design context exists yet, you MUST run /impeccable teach first. Additionally gather: what the interface is trying to accomplish.
1414+1515+### Step 2: Gather Assessments
1616+1717+Launch two independent assessments. **Neither must see the other's output** to avoid bias.
1818+1919+You SHOULD delegate each assessment to a separate sub-agent for independence. Use your environment's agent spawning mechanism (e.g., Claude Code's `Agent` tool, or Codex's subagent spawning). Sub-agents should return their findings as structured text. Do NOT output findings to the user yet.
2020+2121+If sub-agents are not available in the current environment, complete each assessment sequentially, writing findings to internal notes before proceeding.
2222+2323+**Tab isolation**: When browser automation is available, each assessment MUST create its own new tab. Never reuse an existing tab, even if one is already open at the correct URL. This prevents the two assessments from interfering with each other's page state.
2424+2525+#### Assessment A: LLM Design Review
2626+2727+Read the relevant source files (HTML, CSS, JS/TS) and, if browser automation is available, visually inspect the live page. **Create a new tab** for this; do not reuse existing tabs. After navigation, label the tab by setting the document title:
2828+2929+```javascript
3030+document.title = "[LLM] " + document.title;
3131+```
3232+3333+Think like a design director. Evaluate:
3434+3535+**AI Slop Detection (CRITICAL)**: Does this look like every other AI-generated interface? Review against ALL **DON'T** guidelines in the impeccable skill. Check for AI color palette, gradient text, dark glows, glassmorphism, hero metric layouts, identical card grids, generic fonts, and all other tells. **The test**: If someone said "AI made this," would you believe them immediately?
3636+3737+**Holistic Design Review**: visual hierarchy (eye flow, primary action clarity), information architecture (structure, grouping, cognitive load), emotional resonance (does it match brand and audience?), discoverability (are interactive elements obvious?), composition (balance, whitespace, rhythm), typography (hierarchy, readability, font choices), color (purposeful use, cohesion, accessibility), states & edge cases (empty, loading, error, success), microcopy (clarity, tone, helpfulness).
3838+3939+**Cognitive Load** (consult [cognitive-load](reference/cognitive-load.md)):
4040+4141+- Run the 8-item cognitive load checklist. Report failure count: 0-1 = low (good), 2-3 = moderate, 4+ = critical.
4242+- Count visible options at each decision point. If >4, flag it.
4343+- Check for progressive disclosure: is complexity revealed only when needed?
4444+4545+**Emotional Journey**:
4646+4747+- What emotion does this interface evoke? Is that intentional?
4848+- **Peak-end rule**: Is the most intense moment positive? Does the experience end well?
4949+- **Emotional valleys**: Check for anxiety spikes at high-stakes moments (payment, delete, commit). Are there design interventions (progress indicators, reassurance copy, undo options)?
5050+5151+**Nielsen's Heuristics** (consult [heuristics-scoring](reference/heuristics-scoring.md)):
5252+Score each of the 10 heuristics 0-4. This scoring will be presented in the report.
5353+5454+Return structured findings covering: AI slop verdict, heuristic scores, cognitive load assessment, what's working (2-3 items), priority issues (3-5 with what/why/fix), minor observations, and provocative questions.
5555+5656+#### Assessment B: Automated Detection
5757+5858+Run the bundled deterministic detector, which flags 25 specific patterns (AI slop tells + general design quality).
5959+6060+**CLI scan**:
6161+6262+```bash
6363+npx impeccable --json [--fast] [target]
6464+```
6565+6666+- Pass HTML/JSX/TSX/Vue/Svelte files or directories as `[target]` (anything with markup). Do not pass CSS-only files.
6767+- For URLs, skip the CLI scan (it requires Puppeteer). Use browser visualization instead.
6868+- For large directories (200+ scannable files), use `--fast` (regex-only, skips jsdom)
6969+- For 500+ files, narrow scope or ask the user
7070+- Exit code 0 = clean, 2 = findings
7171+7272+**Browser visualization** (when browser automation tools are available AND the target is a viewable page):
7373+7474+The overlay is a **visual aid for the user**. It highlights issues directly in their browser. Do NOT scroll through the page to screenshot overlays. Instead, read the console output to get the results programmatically.
7575+7676+1. **Start the live detection server**:
7777+ ```bash
7878+ npx impeccable live &
7979+ ```
8080+ Note the port printed to stdout (auto-assigned). Use `--port=PORT` to fix it.
8181+2. **Create a new tab** and navigate to the page (use dev server URL for local files, or direct URL). Do not reuse existing tabs.
8282+3. **Label the tab** via `javascript_tool` so the user can distinguish it:
8383+ ```javascript
8484+ document.title = "[Human] " + document.title;
8585+ ```
8686+4. **Scroll to top** to ensure the page is scrolled to the very top before injection
8787+5. **Inject** via `javascript_tool` (replace PORT with the port from step 1):
8888+ ```javascript
8989+ const s = document.createElement("script");
9090+ s.src = "http://localhost:PORT/detect.js";
9191+ document.head.appendChild(s);
9292+ ```
9393+6. Wait 2-3 seconds for the detector to render overlays
9494+7. **Read results from console** using `read_console_messages` with pattern `impeccable`. The detector logs all findings with the `[impeccable]` prefix. Do NOT scroll through the page to take screenshots of the overlays.
9595+8. **Cleanup**: Stop the live server when done:
9696+ ```bash
9797+ npx impeccable live stop
9898+ ```
9999+100100+For multi-view targets, inject on 3-5 representative pages. If injection fails, continue with CLI results only.
101101+102102+Return: CLI findings (JSON), browser console findings (if applicable), and any false positives noted.
103103+104104+### Step 3: Generate Combined Critique Report
105105+106106+Synthesize both assessments into a single report. Do NOT simply concatenate. Weave the findings together, noting where the LLM review and detector agree, where the detector caught issues the LLM missed, and where detector findings are false positives.
107107+108108+Structure your feedback as a design director would:
109109+110110+#### Design Health Score
111111+112112+> _Consult [heuristics-scoring](reference/heuristics-scoring.md)_
113113+114114+Present the Nielsen's 10 heuristics scores as a table:
115115+116116+| # | Heuristic | Score | Key Issue |
117117+| --------- | ------------------------------- | --------- | ------------------------------------ |
118118+| 1 | Visibility of System Status | ? | [specific finding or "n/a" if solid] |
119119+| 2 | Match System / Real World | ? | |
120120+| 3 | User Control and Freedom | ? | |
121121+| 4 | Consistency and Standards | ? | |
122122+| 5 | Error Prevention | ? | |
123123+| 6 | Recognition Rather Than Recall | ? | |
124124+| 7 | Flexibility and Efficiency | ? | |
125125+| 8 | Aesthetic and Minimalist Design | ? | |
126126+| 9 | Error Recovery | ? | |
127127+| 10 | Help and Documentation | ? | |
128128+| **Total** | | **??/40** | **[Rating band]** |
129129+130130+Be honest with scores. A 4 means genuinely excellent. Most real interfaces score 20-32.
131131+132132+#### Anti-Patterns Verdict
133133+134134+**Start here.** Does this look AI-generated?
135135+136136+**LLM assessment**: Your own evaluation of AI slop tells. Cover overall aesthetic feel, layout sameness, generic composition, missed opportunities for personality.
137137+138138+**Deterministic scan**: Summarize what the automated detector found, with counts and file locations. Note any additional issues the detector caught that you missed, and flag any false positives.
139139+140140+**Visual overlays** (if browser was used): Tell the user that overlays are now visible in the **[Human]** tab in their browser, highlighting the detected issues. Summarize what the console output reported.
141141+142142+#### Overall Impression
143143+144144+A brief gut reaction: what works, what doesn't, and the single biggest opportunity.
145145+146146+#### What's Working
147147+148148+Highlight 2-3 things done well. Be specific about why they work.
149149+150150+#### Priority Issues
151151+152152+The 3-5 most impactful design problems, ordered by importance.
153153+154154+For each issue, tag with **P0-P3 severity** (consult [heuristics-scoring](reference/heuristics-scoring.md) for severity definitions):
155155+156156+- **[P?] What**: Name the problem clearly
157157+- **Why it matters**: How this hurts users or undermines goals
158158+- **Fix**: What to do about it (be concrete)
159159+- **Suggested command**: Which command could address this (from: /animate, /quieter, /shape, /optimize, /adapt, /clarify, /layout, /distill, /delight, /audit, /harden, /polish, /bolder, /typeset, /critique, /colorize, /overdrive)
160160+161161+#### Persona Red Flags
162162+163163+> _Consult [personas](reference/personas.md)_
164164+165165+Auto-select 2-3 personas most relevant to this interface type (use the selection table in the reference). If `.github/copilot-instructions.md` contains a `## Design Context` section from `impeccable teach`, also generate 1-2 project-specific personas from the audience/brand info.
166166+167167+For each selected persona, walk through the primary user action and list specific red flags found:
168168+169169+**Alex (Power User)**: No keyboard shortcuts detected. Form requires 8 clicks for primary action. Forced modal onboarding. High abandonment risk.
170170+171171+**Jordan (First-Timer)**: Icon-only nav in sidebar. Technical jargon in error messages ("404 Not Found"). No visible help. Will abandon at step 2.
172172+173173+Be specific. Name the exact elements and interactions that fail each persona. Don't write generic persona descriptions; write what broke for them.
174174+175175+#### Minor Observations
176176+177177+Quick notes on smaller issues worth addressing.
178178+179179+#### Questions to Consider
180180+181181+Provocative questions that might unlock better solutions:
182182+183183+- "What if the primary action were more prominent?"
184184+- "Does this need to feel this complex?"
185185+- "What would a confident version of this look like?"
186186+187187+**Remember**:
188188+189189+- Be direct. Vague feedback wastes everyone's time.
190190+- Be specific. "The submit button," not "some elements."
191191+- Say what's wrong AND why it matters to users.
192192+- Give concrete suggestions, not just "consider exploring..."
193193+- Prioritize ruthlessly. If everything is important, nothing is.
194194+- Don't soften criticism. Developers need honest feedback to ship great design.
195195+196196+### Step 4: Ask the User
197197+198198+**After presenting findings**, use targeted questions based on what was actually found. ask the user directly to clarify what you cannot infer. These answers will shape the action plan.
199199+200200+Ask questions along these lines (adapt to the specific findings; do NOT ask generic questions):
201201+202202+1. **Priority direction**: Based on the issues found, ask which category matters most to the user right now. For example: "I found problems with visual hierarchy, color usage, and information overload. Which area should we tackle first?" Offer the top 2-3 issue categories as options.
203203+204204+2. **Design intent**: If the critique found a tonal mismatch, ask whether it was intentional. For example: "The interface feels clinical and corporate. Is that the intended tone, or should it feel warmer/bolder/more playful?" Offer 2-3 tonal directions as options based on what would fix the issues found.
205205+206206+3. **Scope**: Ask how much the user wants to take on. For example: "I found N issues. Want to address everything, or focus on the top 3?" Offer scope options like "Top 3 only", "All issues", "Critical issues only".
207207+208208+4. **Constraints** (optional; only ask if relevant): If the findings touch many areas, ask if anything is off-limits. For example: "Should any sections stay as-is?" This prevents the plan from touching things the user considers done.
209209+210210+**Rules for questions**:
211211+212212+- Every question must reference specific findings from the report. Never ask generic "who is your audience?" questions.
213213+- Keep it to 2-4 questions maximum. Respect the user's time.
214214+- Offer concrete options, not open-ended prompts.
215215+- If findings are straightforward (e.g., only 1-2 clear issues), skip questions and go directly to Step 5.
216216+217217+### Step 5: Recommended Actions
218218+219219+**After receiving the user's answers**, present a prioritized action summary reflecting the user's priorities and scope from Step 4.
220220+221221+#### Action Summary
222222+223223+List recommended commands in priority order, based on the user's answers:
224224+225225+1. **`/command-name`**: Brief description of what to fix (specific context from critique findings)
226226+2. **`/command-name`**: Brief description (specific context)
227227+ ...
228228+229229+**Rules for recommendations**:
230230+231231+- Only recommend commands from: /animate, /quieter, /shape, /optimize, /adapt, /clarify, /layout, /distill, /delight, /audit, /harden, /polish, /bolder, /typeset, /critique, /colorize, /overdrive
232232+- Order by the user's stated priorities first, then by impact
233233+- Each item's description should carry enough context that the command knows what to focus on
234234+- Map each Priority Issue to the appropriate command
235235+- Skip commands that would address zero issues
236236+- If the user chose a limited scope, only include items within that scope
237237+- If the user marked areas as off-limits, exclude commands that would touch those areas
238238+- End with `/polish` as the final step if any fixes were recommended
239239+240240+After presenting the summary, tell the user:
241241+242242+> You can ask me to run these one at a time, all at once, or in any order you prefer.
243243+>
244244+> Re-run `/critique` after fixes to see your score improve.
···11+# Cognitive Load Assessment
22+33+Cognitive load is the total mental effort required to use an interface. Overloaded users make mistakes, get frustrated, and leave. This reference helps identify and fix cognitive overload.
44+55+---
66+77+## Three Types of Cognitive Load
88+99+### Intrinsic Load — The Task Itself
1010+1111+Complexity inherent to what the user is trying to do. You can't eliminate this, but you can structure it.
1212+1313+**Manage it by**:
1414+1515+- Breaking complex tasks into discrete steps
1616+- Providing scaffolding (templates, defaults, examples)
1717+- Progressive disclosure — show what's needed now, hide the rest
1818+- Grouping related decisions together
1919+2020+### Extraneous Load — Bad Design
2121+2222+Mental effort caused by poor design choices. **Eliminate this ruthlessly** — it's pure waste.
2323+2424+**Common sources**:
2525+2626+- Confusing navigation that requires mental mapping
2727+- Unclear labels that force users to guess meaning
2828+- Visual clutter competing for attention
2929+- Inconsistent patterns that prevent learning
3030+- Unnecessary steps between user intent and result
3131+3232+### Germane Load — Learning Effort
3333+3434+Mental effort spent building understanding. This is _good_ cognitive load — it leads to mastery.
3535+3636+**Support it by**:
3737+3838+- Progressive disclosure that reveals complexity gradually
3939+- Consistent patterns that reward learning
4040+- Feedback that confirms correct understanding
4141+- Onboarding that teaches through action, not walls of text
4242+4343+---
4444+4545+## Cognitive Load Checklist
4646+4747+Evaluate the interface against these 8 items:
4848+4949+- [ ] **Single focus**: Can the user complete their primary task without distraction from competing elements?
5050+- [ ] **Chunking**: Is information presented in digestible groups (≤4 items per group)?
5151+- [ ] **Grouping**: Are related items visually grouped together (proximity, borders, shared background)?
5252+- [ ] **Visual hierarchy**: Is it immediately clear what's most important on the screen?
5353+- [ ] **One thing at a time**: Can the user focus on a single decision before moving to the next?
5454+- [ ] **Minimal choices**: Are decisions simplified (≤4 visible options at any decision point)?
5555+- [ ] **Working memory**: Does the user need to remember information from a previous screen to act on the current one?
5656+- [ ] **Progressive disclosure**: Is complexity revealed only when the user needs it?
5757+5858+**Scoring**: Count the failed items. 0–1 failures = low cognitive load (good). 2–3 = moderate (address soon). 4+ = high cognitive load (critical fix needed).
5959+6060+---
6161+6262+## The Working Memory Rule
6363+6464+**Humans can hold ≤4 items in working memory at once** (Miller's Law revised by Cowan, 2001).
6565+6666+At any decision point, count the number of distinct options, actions, or pieces of information a user must simultaneously consider:
6767+6868+- **≤4 items**: Within working memory limits — manageable
6969+- **5–7 items**: Pushing the boundary — consider grouping or progressive disclosure
7070+- **8+ items**: Overloaded — users will skip, misclick, or abandon
7171+7272+**Practical applications**:
7373+7474+- Navigation menus: ≤5 top-level items (group the rest under clear categories)
7575+- Form sections: ≤4 fields visible per group before a visual break
7676+- Action buttons: 1 primary, 1–2 secondary, group the rest in a menu
7777+- Dashboard widgets: ≤4 key metrics visible without scrolling
7878+- Pricing tiers: ≤3 options (more causes analysis paralysis)
7979+8080+---
8181+8282+## Common Cognitive Load Violations
8383+8484+### 1. The Wall of Options
8585+8686+**Problem**: Presenting 10+ choices at once with no hierarchy.
8787+**Fix**: Group into categories, highlight recommended, use progressive disclosure.
8888+8989+### 2. The Memory Bridge
9090+9191+**Problem**: User must remember info from step 1 to complete step 3.
9292+**Fix**: Keep relevant context visible, or repeat it where it's needed.
9393+9494+### 3. The Hidden Navigation
9595+9696+**Problem**: User must build a mental map of where things are.
9797+**Fix**: Always show current location (breadcrumbs, active states, progress indicators).
9898+9999+### 4. The Jargon Barrier
100100+101101+**Problem**: Technical or domain language forces translation effort.
102102+**Fix**: Use plain language. If domain terms are unavoidable, define them inline.
103103+104104+### 5. The Visual Noise Floor
105105+106106+**Problem**: Every element has the same visual weight — nothing stands out.
107107+**Fix**: Establish clear hierarchy: one primary element, 2–3 secondary, everything else muted.
108108+109109+### 6. The Inconsistent Pattern
110110+111111+**Problem**: Similar actions work differently in different places.
112112+**Fix**: Standardize interaction patterns. Same type of action = same type of UI.
113113+114114+### 7. The Multi-Task Demand
115115+116116+**Problem**: Interface requires processing multiple simultaneous inputs (reading + deciding + navigating).
117117+**Fix**: Sequence the steps. Let the user do one thing at a time.
118118+119119+### 8. The Context Switch
120120+121121+**Problem**: User must jump between screens/tabs/modals to gather info for a single decision.
122122+**Fix**: Co-locate the information needed for each decision. Reduce back-and-forth.
···11+# Heuristics Scoring Guide
22+33+Score each of Nielsen's 10 Usability Heuristics on a 0–4 scale. Be honest — a 4 means genuinely excellent, not "good enough."
44+55+## Nielsen's 10 Heuristics
66+77+### 1. Visibility of System Status
88+99+Keep users informed about what's happening through timely, appropriate feedback.
1010+1111+**Check for**:
1212+1313+- Loading indicators during async operations
1414+- Confirmation of user actions (save, submit, delete)
1515+- Progress indicators for multi-step processes
1616+- Current location in navigation (breadcrumbs, active states)
1717+- Form validation feedback (inline, not just on submit)
1818+1919+**Scoring**:
2020+| Score | Criteria |
2121+|-------|----------|
2222+| 0 | No feedback — user is guessing what happened |
2323+| 1 | Rare feedback — most actions produce no visible response |
2424+| 2 | Partial — some states communicated, major gaps remain |
2525+| 3 | Good — most operations give clear feedback, minor gaps |
2626+| 4 | Excellent — every action confirms, progress is always visible |
2727+2828+### 2. Match Between System and Real World
2929+3030+Speak the user's language. Follow real-world conventions. Information appears in natural, logical order.
3131+3232+**Check for**:
3333+3434+- Familiar terminology (no unexplained jargon)
3535+- Logical information order matching user expectations
3636+- Recognizable icons and metaphors
3737+- Domain-appropriate language for the target audience
3838+- Natural reading flow (left-to-right, top-to-bottom priority)
3939+4040+**Scoring**:
4141+| Score | Criteria |
4242+|-------|----------|
4343+| 0 | Pure tech jargon, alien to users |
4444+| 1 | Mostly confusing — requires domain expertise to navigate |
4545+| 2 | Mixed — some plain language, some jargon leaks through |
4646+| 3 | Mostly natural — occasional term needs context |
4747+| 4 | Speaks the user's language fluently throughout |
4848+4949+### 3. User Control and Freedom
5050+5151+Users need a clear "emergency exit" from unwanted states without extended dialogue.
5252+5353+**Check for**:
5454+5555+- Undo/redo functionality
5656+- Cancel buttons on forms and modals
5757+- Clear navigation back to safety (home, previous)
5858+- Easy way to clear filters, search, selections
5959+- Escape from long or multi-step processes
6060+6161+**Scoring**:
6262+| Score | Criteria |
6363+|-------|----------|
6464+| 0 | Users get trapped — no way out without refreshing |
6565+| 1 | Difficult exits — must find obscure paths to escape |
6666+| 2 | Some exits — main flows have escape, edge cases don't |
6767+| 3 | Good control — users can exit and undo most actions |
6868+| 4 | Full control — undo, cancel, back, and escape everywhere |
6969+7070+### 4. Consistency and Standards
7171+7272+Users shouldn't wonder whether different words, situations, or actions mean the same thing.
7373+7474+**Check for**:
7575+7676+- Consistent terminology throughout the interface
7777+- Same actions produce same results everywhere
7878+- Platform conventions followed (standard UI patterns)
7979+- Visual consistency (colors, typography, spacing, components)
8080+- Consistent interaction patterns (same gesture = same behavior)
8181+8282+**Scoring**:
8383+| Score | Criteria |
8484+|-------|----------|
8585+| 0 | Inconsistent everywhere — feels like different products stitched together |
8686+| 1 | Many inconsistencies — similar things look/behave differently |
8787+| 2 | Partially consistent — main flows match, details diverge |
8888+| 3 | Mostly consistent — occasional deviation, nothing confusing |
8989+| 4 | Fully consistent — cohesive system, predictable behavior |
9090+9191+### 5. Error Prevention
9292+9393+Better than good error messages is a design that prevents problems in the first place.
9494+9595+**Check for**:
9696+9797+- Confirmation before destructive actions (delete, overwrite)
9898+- Constraints preventing invalid input (date pickers, dropdowns)
9999+- Smart defaults that reduce errors
100100+- Clear labels that prevent misunderstanding
101101+- Autosave and draft recovery
102102+103103+**Scoring**:
104104+| Score | Criteria |
105105+|-------|----------|
106106+| 0 | Errors easy to make — no guardrails anywhere |
107107+| 1 | Few safeguards — some inputs validated, most aren't |
108108+| 2 | Partial prevention — common errors caught, edge cases slip |
109109+| 3 | Good prevention — most error paths blocked proactively |
110110+| 4 | Excellent — errors nearly impossible through smart constraints |
111111+112112+### 6. Recognition Rather Than Recall
113113+114114+Minimize memory load. Make objects, actions, and options visible or easily retrievable.
115115+116116+**Check for**:
117117+118118+- Visible options (not buried in hidden menus)
119119+- Contextual help when needed (tooltips, inline hints)
120120+- Recent items and history
121121+- Autocomplete and suggestions
122122+- Labels on icons (not icon-only navigation)
123123+124124+**Scoring**:
125125+| Score | Criteria |
126126+|-------|----------|
127127+| 0 | Heavy memorization — users must remember paths and commands |
128128+| 1 | Mostly recall — many hidden features, few visible cues |
129129+| 2 | Some aids — main actions visible, secondary features hidden |
130130+| 3 | Good recognition — most things discoverable, few memory demands |
131131+| 4 | Everything discoverable — users never need to memorize |
132132+133133+### 7. Flexibility and Efficiency of Use
134134+135135+Accelerators — invisible to novices — speed up expert interaction.
136136+137137+**Check for**:
138138+139139+- Keyboard shortcuts for common actions
140140+- Customizable interface elements
141141+- Recent items and favorites
142142+- Bulk/batch actions
143143+- Power user features that don't complicate the basics
144144+145145+**Scoring**:
146146+| Score | Criteria |
147147+|-------|----------|
148148+| 0 | One rigid path — no shortcuts or alternatives |
149149+| 1 | Limited flexibility — few alternatives to the main path |
150150+| 2 | Some shortcuts — basic keyboard support, limited bulk actions |
151151+| 3 | Good accelerators — keyboard nav, some customization |
152152+| 4 | Highly flexible — multiple paths, power features, customizable |
153153+154154+### 8. Aesthetic and Minimalist Design
155155+156156+Interfaces should not contain irrelevant or rarely needed information. Every element should serve a purpose.
157157+158158+**Check for**:
159159+160160+- Only necessary information visible at each step
161161+- Clear visual hierarchy directing attention
162162+- Purposeful use of color and emphasis
163163+- No decorative clutter competing for attention
164164+- Focused, uncluttered layouts
165165+166166+**Scoring**:
167167+| Score | Criteria |
168168+|-------|----------|
169169+| 0 | Overwhelming — everything competes for attention equally |
170170+| 1 | Cluttered — too much noise, hard to find what matters |
171171+| 2 | Some clutter — main content clear, periphery noisy |
172172+| 3 | Mostly clean — focused design, minor visual noise |
173173+| 4 | Perfectly minimal — every element earns its pixel |
174174+175175+### 9. Help Users Recognize, Diagnose, and Recover from Errors
176176+177177+Error messages should use plain language, precisely indicate the problem, and constructively suggest a solution.
178178+179179+**Check for**:
180180+181181+- Plain language error messages (no error codes for users)
182182+- Specific problem identification ("Email is missing @" not "Invalid input")
183183+- Actionable recovery suggestions
184184+- Errors displayed near the source of the problem
185185+- Non-blocking error handling (don't wipe the form)
186186+187187+**Scoring**:
188188+| Score | Criteria |
189189+|-------|----------|
190190+| 0 | Cryptic errors — codes, jargon, or no message at all |
191191+| 1 | Vague errors — "Something went wrong" with no guidance |
192192+| 2 | Clear but unhelpful — names the problem but not the fix |
193193+| 3 | Clear with suggestions — identifies problem and offers next steps |
194194+| 4 | Perfect recovery — pinpoints issue, suggests fix, preserves user work |
195195+196196+### 10. Help and Documentation
197197+198198+Even if the system is usable without docs, help should be easy to find, task-focused, and concise.
199199+200200+**Check for**:
201201+202202+- Searchable help or documentation
203203+- Contextual help (tooltips, inline hints, guided tours)
204204+- Task-focused organization (not feature-organized)
205205+- Concise, scannable content
206206+- Easy access without leaving current context
207207+208208+**Scoring**:
209209+| Score | Criteria |
210210+|-------|----------|
211211+| 0 | No help available anywhere |
212212+| 1 | Help exists but hard to find or irrelevant |
213213+| 2 | Basic help — FAQ or docs exist, not contextual |
214214+| 3 | Good documentation — searchable, mostly task-focused |
215215+| 4 | Excellent contextual help — right info at the right moment |
216216+217217+---
218218+219219+## Score Summary
220220+221221+**Total possible**: 40 points (10 heuristics × 4 max)
222222+223223+| Score Range | Rating | What It Means |
224224+| ----------- | ---------- | ------------------------------------------------------ |
225225+| 36–40 | Excellent | Minor polish only — ship it |
226226+| 28–35 | Good | Address weak areas, solid foundation |
227227+| 20–27 | Acceptable | Significant improvements needed before users are happy |
228228+| 12–19 | Poor | Major UX overhaul required — core experience broken |
229229+| 0–11 | Critical | Redesign needed — unusable in current state |
230230+231231+---
232232+233233+## Issue Severity (P0–P3)
234234+235235+Tag each individual issue found during scoring with a priority level:
236236+237237+| Priority | Name | Description | Action |
238238+| -------- | -------- | ------------------------------------------ | --------------------------------------- |
239239+| **P0** | Blocking | Prevents task completion entirely | Fix immediately — this is a showstopper |
240240+| **P1** | Major | Causes significant difficulty or confusion | Fix before release |
241241+| **P2** | Minor | Annoyance, but workaround exists | Fix in next pass |
242242+| **P3** | Polish | Nice-to-fix, no real user impact | Fix if time permits |
243243+244244+**Tip**: If you're unsure between two levels, ask: "Would a user contact support about this?" If yes, it's at least P1.
+193
.agents/skills/critique/reference/personas.md
···11+# Persona-Based Design Testing
22+33+Test the interface through the eyes of 5 distinct user archetypes. Each persona exposes different failure modes that a single "design director" perspective would miss.
44+55+**How to use**: Select 2–3 personas most relevant to the interface being critiqued. Walk through the primary user action as each persona. Report specific red flags — not generic concerns.
66+77+---
88+99+## 1. Impatient Power User — "Alex"
1010+1111+**Profile**: Expert with similar products. Expects efficiency, hates hand-holding. Will find shortcuts or leave.
1212+1313+**Behaviors**:
1414+1515+- Skips all onboarding and instructions
1616+- Looks for keyboard shortcuts immediately
1717+- Tries to bulk-select, batch-edit, and automate
1818+- Gets frustrated by required steps that feel unnecessary
1919+- Abandons if anything feels slow or patronizing
2020+2121+**Test Questions**:
2222+2323+- Can Alex complete the core task in under 60 seconds?
2424+- Are there keyboard shortcuts for common actions?
2525+- Can onboarding be skipped entirely?
2626+- Do modals have keyboard dismiss (Esc)?
2727+- Is there a "power user" path (shortcuts, bulk actions)?
2828+2929+**Red Flags** (report these specifically):
3030+3131+- Forced tutorials or unskippable onboarding
3232+- No keyboard navigation for primary actions
3333+- Slow animations that can't be skipped
3434+- One-item-at-a-time workflows where batch would be natural
3535+- Redundant confirmation steps for low-risk actions
3636+3737+---
3838+3939+## 2. Confused First-Timer — "Jordan"
4040+4141+**Profile**: Never used this type of product. Needs guidance at every step. Will abandon rather than figure it out.
4242+4343+**Behaviors**:
4444+4545+- Reads all instructions carefully
4646+- Hesitates before clicking anything unfamiliar
4747+- Looks for help or support constantly
4848+- Misunderstands jargon and abbreviations
4949+- Takes the most literal interpretation of any label
5050+5151+**Test Questions**:
5252+5353+- Is the first action obviously clear within 5 seconds?
5454+- Are all icons labeled with text?
5555+- Is there contextual help at decision points?
5656+- Does terminology assume prior knowledge?
5757+- Is there a clear "back" or "undo" at every step?
5858+5959+**Red Flags** (report these specifically):
6060+6161+- Icon-only navigation with no labels
6262+- Technical jargon without explanation
6363+- No visible help option or guidance
6464+- Ambiguous next steps after completing an action
6565+- No confirmation that an action succeeded
6666+6767+---
6868+6969+## 3. Accessibility-Dependent User — "Sam"
7070+7171+**Profile**: Uses screen reader (VoiceOver/NVDA), keyboard-only navigation. May have low vision, motor impairment, or cognitive differences.
7272+7373+**Behaviors**:
7474+7575+- Tabs through the interface linearly
7676+- Relies on ARIA labels and heading structure
7777+- Cannot see hover states or visual-only indicators
7878+- Needs adequate color contrast (4.5:1 minimum)
7979+- May use browser zoom up to 200%
8080+8181+**Test Questions**:
8282+8383+- Can the entire primary flow be completed keyboard-only?
8484+- Are all interactive elements focusable with visible focus indicators?
8585+- Do images have meaningful alt text?
8686+- Is color contrast WCAG AA compliant (4.5:1 for text)?
8787+- Does the screen reader announce state changes (loading, success, errors)?
8888+8989+**Red Flags** (report these specifically):
9090+9191+- Click-only interactions with no keyboard alternative
9292+- Missing or invisible focus indicators
9393+- Meaning conveyed by color alone (red = error, green = success)
9494+- Unlabeled form fields or buttons
9595+- Time-limited actions without extension option
9696+- Custom components that break screen reader flow
9797+9898+---
9999+100100+## 4. Deliberate Stress Tester — "Riley"
101101+102102+**Profile**: Methodical user who pushes interfaces beyond the happy path. Tests edge cases, tries unexpected inputs, and probes for gaps in the experience.
103103+104104+**Behaviors**:
105105+106106+- Tests edge cases intentionally (empty states, long strings, special characters)
107107+- Submits forms with unexpected data (emoji, RTL text, very long values)
108108+- Tries to break workflows by navigating backwards, refreshing mid-flow, or opening in multiple tabs
109109+- Looks for inconsistencies between what the UI promises and what actually happens
110110+- Documents problems methodically
111111+112112+**Test Questions**:
113113+114114+- What happens at the edges (0 items, 1000 items, very long text)?
115115+- Do error states recover gracefully or leave the UI in a broken state?
116116+- What happens on refresh mid-workflow? Is state preserved?
117117+- Are there features that appear to work but produce broken results?
118118+- How does the UI handle unexpected input (emoji, special chars, paste from Excel)?
119119+120120+**Red Flags** (report these specifically):
121121+122122+- Features that appear to work but silently fail or produce wrong results
123123+- Error handling that exposes technical details or leaves UI in a broken state
124124+- Empty states that show nothing useful ("No results" with no guidance)
125125+- Workflows that lose user data on refresh or navigation
126126+- Inconsistent behavior between similar interactions in different parts of the UI
127127+128128+---
129129+130130+## 5. Distracted Mobile User — "Casey"
131131+132132+**Profile**: Using phone one-handed on the go. Frequently interrupted. Possibly on a slow connection.
133133+134134+**Behaviors**:
135135+136136+- Uses thumb only — prefers bottom-of-screen actions
137137+- Gets interrupted mid-flow and returns later
138138+- Switches between apps frequently
139139+- Has limited attention span and low patience
140140+- Types as little as possible, prefers taps and selections
141141+142142+**Test Questions**:
143143+144144+- Are primary actions in the thumb zone (bottom half of screen)?
145145+- Is state preserved if the user leaves and returns?
146146+- Does it work on slow connections (3G)?
147147+- Can forms leverage autocomplete and smart defaults?
148148+- Are touch targets at least 44×44pt?
149149+150150+**Red Flags** (report these specifically):
151151+152152+- Important actions positioned at the top of the screen (unreachable by thumb)
153153+- No state persistence — progress lost on tab switch or interruption
154154+- Large text inputs required where selection would work
155155+- Heavy assets loading on every page (no lazy loading)
156156+- Tiny tap targets or targets too close together
157157+158158+---
159159+160160+## Selecting Personas
161161+162162+Choose personas based on the interface type:
163163+164164+| Interface Type | Primary Personas | Why |
165165+| ------------------------ | -------------------- | -------------------------------- |
166166+| Landing page / marketing | Jordan, Riley, Casey | First impressions, trust, mobile |
167167+| Dashboard / admin | Alex, Sam | Power users, accessibility |
168168+| E-commerce / checkout | Casey, Riley, Jordan | Mobile, edge cases, clarity |
169169+| Onboarding flow | Jordan, Casey | Confusion, interruption |
170170+| Data-heavy / analytics | Alex, Sam | Efficiency, keyboard nav |
171171+| Form-heavy / wizard | Jordan, Sam, Casey | Clarity, accessibility, mobile |
172172+173173+---
174174+175175+## Project-Specific Personas
176176+177177+If `.github/copilot-instructions.md` contains a `## Design Context` section (generated by `impeccable teach`), derive 1–2 additional personas from the audience and brand information:
178178+179179+1. Read the target audience description
180180+2. Identify the primary user archetype not covered by the 5 predefined personas
181181+3. Create a persona following this template:
182182+183183+```
184184+### [Role] — "[Name]"
185185+186186+**Profile**: [2-3 key characteristics derived from Design Context]
187187+188188+**Behaviors**: [3-4 specific behaviors based on the described audience]
189189+190190+**Red Flags**: [3-4 things that would alienate this specific user type]
191191+```
192192+193193+Only generate project-specific personas when real Design Context data is available. Don't invent audience details — use the 5 predefined personas when no context exists.