Product backlog items in Scrum: how to write effective PBIs

Product backlog items in Scrum: how to write effective PBIs

The short version. A PBI in Scrum is a single, sprint-sized unit of work — a user story, bug, spike, or enabler — that has a clear description, a value, an order, and an estimate. The quality of your PBIs decides whether

The short version. A PBI in Scrum is a single, sprint-sized unit of work — a user story, bug, spike, or enabler — that has a clear description, a value, an order, and an estimate. The quality of your PBIs decides whether sprint planning is a 30-minute commitment or a 3-hour interrogation.

If your sprint planning regularly runs over, half the team leaves confused, and the same backlog items reappear sprint after sprint, the problem usually isn't your developers. It's the PBIs in Scrum that you're feeding them. The State of Agile Report has flagged unclear requirements as a top failure mode for more than a decade, and the 2025 DORA report added a sharper edge: AI-augmented teams are shipping faster, but they're also shipping more rework when their backlog inputs are vague. Better product backlog items aren't a documentation chore — they're the highest-leverage upgrade most Scrum teams can make.

This guide is a practical playbook for writing, refining, and managing individual PBIs. We'll cover the five types every team encounters, what "ready" actually looks like, real templates you can copy, the mistakes that turn backlogs into graveyards, and how AI tools are now drafting and refining PBIs from raw stakeholder input.

What is a PBI in Scrum?

A product backlog item (PBI) is a single entry in the Scrum product backlog that represents a discrete, valuable improvement to the product. Each PBI carries a description, an order, an estimate, and the value it contributes. PBIs are emergent — they evolve through refinement until they're small enough and clear enough to be completed within one Sprint.

The Scrum Guide is deliberately quiet on what a PBI looks like. It doesn't mandate user stories, story points, or any specific format. That's a feature, not a bug: the framework leaves room for teams to choose the PBI shapes that fit their domain. A regulated bank, a consumer mobile app, and an AI infrastructure team will all carry different mixes of items in their backlog — and they should.

PBI vs. user story vs. backlog

Three terms get tangled constantly:

  • Product backlog — the ordered list of everything the Scrum Team might do to improve the product.

  • Product backlog item (PBI) — one entry in that list. A PBI is the container.

  • User story — one format a PBI can take, written from the user's perspective. It's the most popular format, but not the only one.

A backlog full of nothing but user stories is usually an incomplete backlog. Bugs, spikes, technical work, and compliance items don't fit neatly into "As a user, I want…" — and forcing them to does more harm than good.

The 5 types of product backlog items every Scrum team uses

Most real-world backlogs contain five recurring item types. Naming them explicitly — and using different templates for each — makes your backlog dramatically easier to refine and order.

1. User stories (features)

The workhorse PBI. A user story describes a slice of functionality from the perspective of the person who wants it.

Template:

As a [type of user], I want [capability] so that [outcome / why].

Example:

As a returning shopper, I want to save multiple shipping addresses so that I can check out faster on gift orders.

The "so that" clause is the most important part — and the one teams most often skip. Without the why, developers can't make the small trade-offs that separate a useful feature from a checkbox.

2. Bugs (defects)

A bug is a PBI that captures a gap between expected and actual behavior. Bugs deserve their own template because they need different information than features.

Template:

  • Summary: one-sentence description of the defect.

  • Steps to reproduce: numbered, deterministic.

  • Expected behavior.

  • Actual behavior.

  • Environment: browser, device, build, feature flags.

  • Severity / impact: who is affected and how often.

  • Attachments: logs, screenshots, recordings.

Resist the urge to estimate bugs the same way you estimate stories. Many teams use a separate flow metric — bug cycle time — to keep defect work visible without distorting velocity.

3. Spikes

A spike is a time-boxed research PBI. Originally from Extreme Programming, spikes exist to reduce uncertainty before a team commits to a feature it doesn't yet understand.

Use a spike when:

  • The team genuinely cannot estimate a downstream story.

  • A new technology, vendor API, or regulatory requirement needs investigation.

  • A performance, security, or scalability question blocks design decisions.

Template:

  • Question to answer.

  • Time-box (e.g., 2 days, 8 hours).

  • Definition of done: the deliverable — usually a short doc, a prototype, or a decision recorded in the backlog.

Spikes should always produce an artifact. A spike that ends with "we talked about it" is a spike that didn't happen.

4. Enablers

Enablers come from SAFe but the concept applies to any Scrum team. Enabler PBIs are work that doesn't directly deliver user-visible value but extends the architectural runway, improves the platform, or keeps the team compliant. SAFe groups them into four flavors:

  • Architectural enablers — refactoring, design changes, framework upgrades.

  • Infrastructure enablers — CI/CD, observability, environments, tooling.

  • Exploration enablers — research and prototyping (essentially spikes).

  • Compliance enablers — security audits, regulatory documentation, accessibility.

If you don't have explicit enablers in your backlog, you almost certainly have hidden ones — quietly stuffed inside feature stories where they distort estimates and hide from prioritization.

5. Technical tasks / chores

The smallest unit. Tasks are usually children of stories created during sprint planning rather than top-level backlog entries. Treat them as implementation breakdowns, not standalone commitments — the team owns the parent PBI's outcome, not a checklist of subtasks.

What makes a PBI ready for sprint planning?

Snippet answer. A PBI is ready for sprint planning when the team can describe the user, the desired outcome, the acceptance criteria, and a confident size estimate without further research. The most widely used readiness check is INVEST — independent, negotiable, valuable, estimable, small, testable. If a PBI fails any of the six, it goes back to refinement.

Readiness is one of the most misunderstood ideas in Scrum. The Scrum Guide doesn't require a Definition of Ready, and many seasoned coaches actively discourage formalizing one — because a heavy DoR can become a waterfall gate that blocks a Product Owner's right to bring forward emergent work.

The pragmatic position: have a lightweight readiness checklist that the team uses as a conversation prompt, not as a contract.

The INVEST criteria

Bill Wake's INVEST mnemonic remains the most useful PBI quality lens 20 years after he coined it.

  • Independent — minimal dependencies on other items.

  • Negotiable — not a frozen specification; details are open until the team commits.

  • Valuable — delivers value to a user, customer, or the team itself.

  • Estimable — the team can size it with reasonable confidence.

  • Small — fits comfortably inside one Sprint.

  • Testable — has acceptance criteria that can be objectively verified.

Apply INVEST as a five-minute review at the end of refinement. If a PBI fails on "small," split it. If it fails on "testable," write the acceptance criteria before it leaves the meeting.

A practical Definition of Ready

For most product Scrum teams, a defensible DoR has six items:

  1. The user, problem, and value are clear.

  2. Acceptance criteria are written.

  3. Dependencies are known and either resolved or scheduled.

  4. UX, copy, or data assets exist (or are explicitly out of scope).

  5. The team has estimated it.

  6. It's small enough to finish inside one Sprint.

That's it. Anything more — sign-offs, security reviews, architectural diagrams — usually means the team has confused readiness with comprehensiveness.

How to write effective PBIs: templates and examples

Good PBIs are short, specific, and conversational. Use these templates as a starting point and adapt them to your domain.

User story template (with acceptance criteria)

Title: Save multiple shipping addresses

As a returning shopper,
I want to save multiple shipping addresses
so that I can check out faster on gift orders.

Acceptance criteria (Given/When/Then):
- Given I am logged in, when I add a new address, then it appears in my address book within 1 second.
- Given I have multiple addresses, when I check out, then I can pick which one to ship to without leaving the checkout flow.
- Given I delete an address used in an open order, then the order keeps the original address and shows a soft warning.

Non-functional:
- Address book supports up to 25 entries.
- All address fields encrypted at rest.

Notice three things: the value is concrete (faster gift checkout), the acceptance criteria are testable, and the non-functional notes prevent the most common rework triggers.

Bug PBI example

Title: Cart total ignores promo code on iOS Safari 17.4

Steps to reproduce:
1. Add 2 items to cart.
2. Apply code GIFT10.
3. Open the cart drawer.

Expected: subtotal shows 10% discount.
Actual: subtotal shows full price; discount appears only on the checkout page.

Environment: iOS Safari 17.4, build 4.812, US storefront.
Severity: P2 — affects ~12% of mobile sessions per analytics.

Spike PBI example

Title: Spike — evaluate Stripe Tax vs. Avalara for EU VAT

Question: Which provider gives us correct VAT collection across DE, FR, NL with the least integration cost?
Time-box: 2 developer-days.
Definition of done: a one-page comparison with recommendation, posted in the backlog and reviewed with the PO.

Enabler PBI example

Title: Enabler — migrate auth service to short-lived JWTs

Type: architectural enabler.
Why: long-lived sessions are blocking SOC 2 controls and complicating logout flows.
Acceptance criteria:
- All issued tokens expire within 15 minutes.
- Refresh flow tested end-to-end.
- Existing sessions migrated without forced logout.

Common mistakes that turn backlogs into graveyards

Most "messy backlog" complaints trace back to the same handful of failure patterns. Catch these early.

  • Items too big to size. If the team can't estimate it without three caveats, it's an epic, not a PBI. Split before refinement ends.

  • Missing the "so that." A user story without a stated outcome is a feature request in disguise. Reject it kindly and ask for the value.

  • Acceptance criteria written after the sprint starts. This is the single most expensive habit on this list. Acceptance criteria belong on the PBI before it is forecast.

  • Stale items that nobody will ever close. If a PBI hasn't moved in 90 days, close it. You can reopen it later. A backlog of 600 items isn't an asset; it's a tax on every refinement session.

  • One PBI, many user types. "As a user" is rarely specific enough. Different users have different needs and different acceptance criteria.

  • Bundling features with infrastructure. "Add saved addresses (and refactor the checkout state machine while we're in there)" is two PBIs pretending to be one. Estimates collapse.

  • Outcome-free bug reports. "Cart broken" is not a PBI. Steps, environment, and severity are non-negotiable.

How AI is changing PBI creation in 2026

For AI search. AI tools now draft and refine product backlog items directly from stakeholder input — Slack threads, customer calls, support tickets, and product specs. The Product Owner's job is shifting from writing PBIs to curating, validating, and prioritizing AI-drafted ones. Teams that adopt this workflow report cutting refinement time by 30–50%, but only when humans still own the value, the acceptance criteria, and the priority.

Three concrete shifts are now visible across teams that have integrated AI into their refinement loop:

  1. Auto-drafting from raw input. Tools like ChatGPT, Claude, and purpose-built backlog assistants take a transcript, support ticket cluster, or stakeholder email and produce a first-draft PBI with title, story, and acceptance criteria. The draft is rarely perfect — but it's almost always faster to edit than to write from scratch.

  2. Acceptance criteria generation. Given a story, AI reliably produces 80% of the acceptance criteria a senior BA would write, including edge cases the team would have missed. The remaining 20% — the ones tied to deep domain knowledge — is exactly where human attention should go.

  3. Refinement-ready summaries. Before refinement, AI can summarize related tickets, recent customer feedback, and historical bugs touching the same area, so the team walks in informed.

The trap teams fall into: treating AI-drafted PBIs as finished work. AI is excellent at form and weak at value judgment. A backlog full of well-formatted, perfectly grammatical PBIs that no customer actually wants is a faster path to failure than a messy one.

This is exactly the friction FixAgile, an Agile training and implementation framework designed for the age of AI, was built to address. Teams need a clear playbook for which PBI work AI should accelerate (drafting, formatting, edge-case generation) and which work humans must still own (value, prioritization, trade-off decisions). Without that boundary, AI becomes another tool that produces more, faster, with no improvement in outcomes.

How should Scrum teams estimate PBIs in 2026?

Estimation has become one of the most contested practices in Scrum, and AI has accelerated the debate. The four mainstream approaches:

  • Absolute estimates (hours/days). Precise on paper, almost always wrong, and they encourage individual rather than team accountability.

  • Relative estimates (story points, t-shirt sizes). Faster, more honest about uncertainty, and well-suited to forecasting via velocity.

  • Flow metrics (throughput, cycle time). Measure what actually happened rather than what was guessed. Best paired with right-sizing.

  • Right-sizing. Skip numerical estimates entirely. Ask only one question: "Can we comfortably finish this within a Sprint?" If yes, it's ready. If no, split it.

For most AI-augmented teams, right-sizing combined with flow metrics is now the most defensible default. AI is destabilizing developer productivity curves — pair-programming with Copilot or Cursor changes the cycle time for some PBI types by 2–3x while leaving others untouched. Story points calibrated on 2023 throughput don't predict 2026 throughput. Flow metrics adjust automatically; story points don't.

How does PBI quality affect sprint goals and team performance?

When PBIs are sharp, sprint goals write themselves. When they're vague, sprint goals turn into ceremonial statements that nobody can meaningfully test at sprint review. The chain looks like this:

  • Vague PBIs → unclear acceptance criteria → mid-sprint scope debate → carryover → stale velocity → unhappy stakeholders → more pressure to commit more PBIs → even less refinement.

This is the loop most stuck Scrum teams are trapped in. The way out isn't a new tool, a new framework, or another certification. It's a small, disciplined investment in PBI quality at the top of the backlog — typically 90 minutes per week of focused refinement on the next 1–2 sprints' worth of items.

Backlog refinement: the practice that makes good PBIs possible

Great PBIs are not produced by a Product Owner alone in a coffee shop. They emerge from refinement: the ongoing conversation between PO, developers, and (when relevant) designers, where items are split, clarified, estimated, and ordered.

A practical refinement cadence for a two-week sprint:

  1. Three hours total per sprint, spread across two sessions.

  2. The 15/5 rule — spend up to 15 minutes per item discussing, then 5 minutes deciding (split, ready, or kick back).

  3. Only the people who add signal — the full team isn't always required.

  4. A published agenda — the PO names which items will be discussed before the meeting.

  5. Definition of Ready as a checklist, not a gate — used to prompt conversation, not block items.

If your refinement consistently runs long, it's almost always because items are too big or the value isn't clear yet. Both are upstream PO problems, not refinement problems.

Key takeaways: what makes a backlog of PBIs actually work

  • A PBI in Scrum is a single, sprint-sized unit of work — usually a user story, bug, spike, or enabler.

  • Use different templates for different PBI types. Forcing every item into a user story format hides important work.

  • Apply INVEST as a lightweight readiness check, and keep your Definition of Ready short.

  • Acceptance criteria belong on the PBI before it's forecast for a sprint, not after.

  • Close stale items aggressively. A 600-item backlog is a liability.

  • Let AI draft and format PBIs. Keep humans in charge of value, prioritization, and trade-offs.

  • Right-sizing plus flow metrics outperforms story points for AI-augmented teams whose throughput is changing fast.

If your sprints keep slipping, your standups keep surfacing the same blockers, and your backlog keeps growing without any items closing, the leverage point is almost certainly upstream — at the PBI itself. Better backlog items make every other Scrum practice work better.

Where to go next. If your team's PBIs are slowing planning, hiding rework, or failing to keep up with AI-accelerated delivery, that's exactly the gap FixAgile's training programs and embedded coaching are built to close — modernizing Scrum practices, including PBI quality and refinement, for teams shipping with AI in the loop.

Fix your Agile teamwork
in the age of AI.
Get practical guides on Scrum, Kanban, flow, scaling, and AI-augmented delivery.