Wrong Level¶

Slug	Severity	Detection Scope	Protects
`wrong-level`	Medium	cross-suite	Fast, Maintainable

Summary¶

The test is well-written but lives at the wrong pyramid tier: a "unit" test running a subprocess, an "integration" test that mocks every dependency, a build smoke colocated with millisecond-scoped assertions.

Aliases¶

"wrong level"
"wrong tier"
"wrong pyramid level"
"unit test doing integration"
"integration test that's really unit"

Description¶

Runner conventions tie test location to test tier (*.unit.test.ts vs *.integration.test.ts, tests/ vs tests/integration/, @slow tags, -tags=integration build constraints). When a test drifts, it incurs the worst cost of both tiers: slow enough to hurt dev-loop latency, yet not trustworthy enough to count as real integration coverage. The semantic judgment is classifying each test as unit / component / integration and comparing the answer to where it currently lives.

Do not try to enforce a global layer policy across repos — read the repo's existing conventions first.

Signals¶

A single test file imports both high-level rendering harnesses (@inquirer/testing-style) and pure computed exports.
A "unit" test wraps execSync('npm run build'), subprocess.run([...]), or instantiates a DB client.
A "unit" test mocks every dependency and asserts on toHaveBeenCalledWith(...) — it's actually a contract test, mis-named.
A spec sends to private methods (described_class.send(:foo, x)) — should either become a public-API test or extract the private helper as a pure function with its own unit test file. See implementation-coupled for the related reach-through smell.

False-positive guards¶

Tier signals are repo-conditional and over-trigger when applied as if they were universal:

The repo's existing layer convention is authoritative. Before flagging a test for living at the wrong tier, read the repo's tier conventions: directory layout (tests/integration/), filename suffixes (*.unit.test.ts, *_integration_test.go), runner markers (pytest.mark.integration, @Tag("slow")), and build constraints (//go:build integration). If the test sits in a tier the repo treats as appropriate for the test's content, do not flag — even if a different repo would file it differently. The signal is mismatch between a test's content and its repo's stated tier, not mismatch with an idealized pyramid.
Co-location-by-convention single-file ecosystems. Some ecosystems place all tier coverage for a unit in one file by design — Go's package_test.go adjacent to package.go, Rust's #[cfg(test)] mod tests — and the build system shards via tags or build constraints. Flag only when the colocation incurs the cost the smell warns about (slow tests blocking the dev-loop tier), not when the colocation is the ecosystem's idiomatic shape and the cost is already mitigated by the runner.

Prescribed Fix¶

Classify each test as unit / component / integration via describe-before-edit plus the signals above.
Split files by level, respecting repo conventions: foo.test.ts → foo.unit.test.ts + foo.integration.test.ts.
Apply the runner's markers (@slow, @integration, pytest.mark.integration, //go:build integration) so CI can shard correctly.
For private-method tests: convert to public-surface coverage, or extract the helper as a pure function. This is the only "refactor for testability" move the taxonomy permits, and only because it also clarifies architecture — see the no-extract-for-testability governor rule for the exception.
Gate: preservation of regression-detection power plus no change in test count plus CI still green at each tier.

Example¶

Before¶

// src/__tests__/page-sizing.test.ts
describe('checkboxSearch page sizing', () => {
  it('calculates dynamic page size', () => {
    expect(calculateDynamicPageSize(80, 24)).toBe(10);
  });

  it('renders the full dropdown', async () => {
    const ui = render(checkboxSearch, { choices: MANY });
    await ui.waitFor(/loaded/);
    expect(ui.lastFrame()).toContain('cursor');
  });
});

After¶

// src/__tests__/page-sizing.unit.test.ts
it('calculates dynamic page size', () => {
  expect(calculateDynamicPageSize(80, 24)).toBe(10);
});

// src/__tests__/page-sizing.integration.test.ts
it('renders the full dropdown', async () => {
  const ui = render(checkboxSearch, { choices: MANY });
  await ui.waitFor(/loaded/);
  expect(ui.lastFrame()).toContain('cursor');
});

The pure calculation belongs in the fast unit tier. The render test needs @inquirer/testing and takes ~200ms. Split by level; CI shards them independently.

monolithic-test-file — level-mixing is one common reason files get monolithic.
implementation-coupled — private-method tests are often wrong-level and implementation-coupled; fix together.

Polyglot notes¶

The layer vocabulary is universal; markers differ per runner:

Python: pytest.mark.integration, path convention tests/integration/.
JS/TS: filename suffix, Vitest test.concurrent, Playwright for e2e.
Go: //go:build integration build tag, _integration_test.go convention.
Ruby: spec/integration/, RSpec tags.
JVM: Gradle source sets, Surefire/Failsafe, @Tag.