StatisticalIntermediate

Pareto Chart

A combined bar-and-line chart that ranks categories in descending order of frequency while overlaying a cumulative percentage line — revealing the vital few causes that drive the majority of an effect.

// 01 — The chart

What it looks like

Example — Customer complaint categoriesQ1 2025
0204060801000%20%40%60%80%100%ShippingDefectsBillingReturnsOther40%67%84%94%100%CountCumulative %

A Pareto chart of customer complaints. The two tallest bars (Shipping and Defects) account for 67% of all complaints, illustrating the 80/20 principle.

// 02 — Definition

What is a Pareto chart?

A Pareto chart is a dual-axis visualization that combines a bar chart (sorted in descending order) with a cumulative percentage line. The bars represent the frequency or magnitude of each category, while the line tracks the running total as a percentage. Together, they make it immediately obvious which categories contribute the most to the overall total.

The chart is named after the Pareto principle (also called the 80/20 rule), which states that roughly 80% of effects come from 20% of causes. In practice, this means a small number of defect types, complaint categories, or failure modes tend to drive the vast majority of problems.

Pareto charts are one of the seven basic tools of quality control introduced by Kaoru Ishikawa. They are used extensively in manufacturing, Six Sigma, lean management, and any domain where prioritizing the most impactful issues is essential for resource allocation.

Origin: The Pareto principle was formulated by economist Vilfredo Pareto in 1896 when he observed that 80% of Italy’s land was owned by 20% of the population. Quality pioneer Joseph Juran later applied this concept to defect analysis and coined the term “vital few and trivial many,” creating the modern Pareto chart in the 1940s.

// 03 — Anatomy

Parts of a Pareto chart

ABCDE
A — Left Y-axis (frequency): Shows the raw count or value for each category as bars
B — X-axis (categories): Categories ranked from highest to lowest frequency, left to right
C — Descending bars: Bars sorted tallest to shortest, emphasizing which categories dominate
D — Cumulative line: A line connecting running-total percentages, always rising to 100%
E — Right Y-axis (cumulative %): Percentage scale from 0% to 100% for reading the cumulative line

// 04 — Usage

When to use it — and when not to

✓Use a Pareto chart when…
  • Identifying the most significant categories from a set of factors
  • Performing root-cause analysis in quality control or Six Sigma projects
  • You need to show the cumulative contribution of ranked categories
  • Prioritizing which issues to address first for maximum impact
  • Communicating the 80/20 rule to stakeholders or management
  • Comparing before-and-after improvements by re-ranking categories
×Avoid a Pareto chart when…
  • All categories contribute roughly equal amounts — the chart provides no insight
  • You need to show trends over time — use a line chart instead
  • Categories don’t represent parts of a meaningful whole
  • You have fewer than three categories — a simple bar chart suffices
  • The data is continuous rather than categorical
  • Your audience needs precise values rather than relative prioritization

// 05 — Reading guide

How to read a Pareto chart

Follow these steps whenever you encounter a Pareto chart in the wild.

1

Read from left to right

The bars are sorted from tallest (most frequent) to shortest (least frequent). The leftmost bars are the most impactful categories. This ranking is the core message of the chart.

2

Check the left Y-axis

This axis shows the raw count or value for each category. Use it to understand the absolute magnitude of each bar — not just the relative ranking.

3

Follow the cumulative line

The line rises from left to right, always approaching 100%. It tells you what fraction of the total is accounted for by the first N categories. Where the line crosses 80% is especially significant.

4

Find the 80% threshold

Draw a mental horizontal line at 80% on the right Y-axis. The categories whose bars fall to the left of where the cumulative line crosses this threshold are your 'vital few' — the small number of causes driving most of the effect.

5

Identify action items

The vital few categories are where to focus improvement efforts. Addressing these first will yield the greatest return on investment. The remaining 'trivial many' can be deprioritized.

// 06 — Pitfalls

Common mistakes

Not sorting bars in descending order

Fix: The descending sort is what makes a Pareto chart a Pareto chart. Without it, the cumulative line loses its meaning and the 'vital few' cannot be identified at a glance.

Omitting the cumulative percentage line

Fix: Without the line, you just have a sorted bar chart. The cumulative line is essential because it lets readers quickly see where the 80% threshold falls.

Using an 'Other' category that is too large

Fix: If 'Other' is one of the tallest bars, break it into sub-categories. A catch-all bucket that dominates the chart defeats the purpose of prioritization.

Misaligning the two Y-axes

Fix: The left axis (raw count) and right axis (cumulative percentage) must be scaled so that the maximum bar total aligns with 100%. Misalignment makes the cumulative line impossible to read correctly.

Including too many categories

Fix: More than 8–10 bars creates clutter. Combine the smallest categories into 'Other' and keep the focus on the vital few that matter most.

// 07 — In the wild

Real-world examples

Manufacturing defect analysis

Toyota and other lean manufacturers routinely use Pareto charts on the factory floor to identify which defect types cause the most rework. A single chart might reveal that scratches and misalignment account for 75% of all rejects, directing inspection resources to those two failure modes.

Software bug triage

Development teams use Pareto charts to rank bug categories by frequency or severity. When a sprint retrospective reveals that 80% of production incidents trace back to two modules, those modules become the priority for refactoring and additional test coverage.

Healthcare incident reporting

Hospitals apply Pareto analysis to patient safety incidents. A chart showing that medication errors and falls account for 70% of all reported events helps administrators allocate training budgets and redesign workflows where they will have the greatest impact on patient outcomes.

// 08 — Quick reference

Key facts

Also known asPareto diagram, Pareto analysis chart
Best forPrioritizing categories by frequency or impact
Data typesCategorical frequencies with cumulative percentages
Key elementsDescending bars, cumulative percentage line, dual Y-axes
Principle80/20 rule — vital few vs. trivial many
Classic useQuality control, Six Sigma, lean manufacturing
Common toolsMinitab, Excel, R (ggplot2), Python (matplotlib), JMP
Common mistakesUnsorted bars, missing cumulative line, oversized 'Other'

// 09 — Variations

Types of Pareto charts

Pareto charts adapt to different analytical contexts while keeping the same core structure of ranked bars plus a cumulative line.

Standard Pareto chart

The classic format with descending bars and a cumulative percentage line. Used in most quality control applications.

Comparative (before/after) Pareto

Two Pareto charts side by side or color-coded to show improvement after an intervention. Demonstrates which causes were addressed.

$40k$25k

Weighted Pareto chart

Bars represent cost or impact rather than raw frequency. A defect that occurs rarely but costs $50k per incident may outweigh a frequent but cheap one.

Horizontal Pareto chart

Bars run horizontally, useful when category labels are long. The cumulative line runs along the horizontal axis instead.

// 10 — FAQs

Frequently asked questions

What is a pareto chart?+

A Pareto chart is a dual-axis visualization that combines a bar chart (sorted in descending order) with a cumulative percentage line. The bars represent the frequency or magnitude of each category, while the line tracks the running total as a percentage. Together, they make it immediately obvious which categories contribute the most to the overall total.

When should you use a pareto chart?+

Use a pareto chart when identifying the most significant categories from a set of factors. It also works well when performing root-cause analysis in quality control or Six Sigma projects, and when you need to show the cumulative contribution of ranked categories.

When should you avoid a pareto chart?+

Avoid a pareto chart when all categories contribute roughly equal amounts — the chart provides no insight. It is also a poor fit when you need to show trends over time — use a line chart instead, or when categories don’t represent parts of a meaningful whole.

Is a pareto chart suitable for dashboards?+

Yes — a pareto chart can work well in dashboards as long as the panel is large enough for readers to perceive the encoded values, has a clear title, and includes the legend or axis labels needed to interpret it.

What category of chart is a pareto chart?+

Pareto Chart belongs to the Statistical family of charts. Charts in that family are designed to answer the same kind of question, so they often work as alternatives when one doesn't quite fit your data.

How do you read a pareto chart?+

Start with the axis labels and legend, then look at the overall shape before zooming into individual marks. Compare prominent features against the rest of the data, and verify any conclusion against the underlying numbers when precision matters.