Skip to main content

Global Sequencing Market Observatory


This section uses large-scale public repository metadata (with a particular emphasis on ENA) to approximate global academic sequencing activity and expenditure trends over 2016–2025.

Scope & Caveats: the goal is trend analysis (rates of change, relative shifts, concentration), not perfect absolute spend. Cost estimates necessarily rely on approximations where key commercial parameters are not public (e.g. flow-cell utilisation, pricing packages, discounting, etc.).

What this section aims to answer:
  • How has sequencing expenditure evolved over time, and how concentrated is spend by vendor/instrument?
  • Which platforms and instrument families are gaining share?
  • How do instrument models compare in economic positioning (cost-efficiency vs. throughput scale)?

Using ENA and related public sources, I compiled sequencing metadata estimated to cover ~95% of all experiments deposited in major repositories between 2016 and 2025, corresponding to approximately ~30 million experiments, ~30 million runs, and ~3 × 10¹⁴ reads.

Loading chart…

For each accession level (sample, run, experiment, study), the dataset captures:

  • Sequencing platform and instrument model
  • Total read and base counts
  • Library metadata (strategy, source, selection)
  • Submitting center / institute name

Cost estimations are derived using platform and model-specific formulas, combining vendor reagent lists and (where possible) adjustments for run-throughput differences and historical pricing changes.
Unless otherwise specified, these are best interpreted as run-level reagent cost proxies rather than fully-loaded cost-of-ownership.

Loading chart…
Loading chart…

Top: Annual growth in sequencing expenditures, with recent peaks. Illumina remains the leading platform, while MGI / BGI, PacBio, and Oxford Nanopore show a steady rise, consistent with a gradual diversification of the platform mix.

Bottom: Cumulative expenditure by instrument model (2016–2024), highlighting strong concentration in a small number of short-read instrument families and reinforcing their market dominance.

  • Concentration signal: a small number of instrument families dominate spend, suggesting high switching costs and inertia in exploring alternative platforms.
  • Disruption signal: despite increasing cost competitiveness of several challenger platforms, adoption remains constrained by non-price factors such as installed base and workflow inertia.

2023 – 2024 % expenditure changes per platform and model

2023–2024 % expenditure changes per platform and model

Cost-efficiency, Scale, and Read-length Positioning


Economic positioning across sequencing instrument models

Tip: pan to adjust view

Loading chart…

Bubble plot summarizing economic positioning across sequencing instrument models.

  • Bottom-right: high-throughput short-read systems with strong cost-efficiency at scale.
  • Upper regions: legacy or low-throughput systems with higher per-Gb costs.
  • Distinct clusters: long-read platforms occupy separate cost–throughput regimes.

Relationship between average read length per experiment and experiment-level sequencing cost


Experiment Cost vs Average Read Length

Short-read platforms dominate the low read-length region (~100–300 bp) and span a wide cost distribution, reflecting diverse use cases and budget constraints. Long-read platforms cluster at longer read lengths with higher cost variability, consistent with their use in specialised applications rather than broad sequencing.