Skip to content

Reading the Dashboard

When you navigate to a project’s test page, flakiness.io shows you the default test report. This is an aggregated status view across all environments and all tests of the project’s default branch, for a single day – the day of the last commit, in the project’s timezone.

The default test report is also what powers other features:

Once on this page, you can navigate to a different day, environment, or branch.

Hover or click a numbered region on the screenshot to jump to its explanation, or use the “On this page” outline on the right.

Defaults to the repository’s main branch and lets you switch to any other branch. The branch defines the history window used throughout the dashboard; all testing data is scoped to the selected branch.

Scoping results to a single branch is essential to keep statistics and health signals unaffected by on-going development and testing on other branches. For example, a test that passes on main but fails on a feature branch is a regression on that branch, but not on main.

Branch Selector
  1. Branch filter — substring-matching filter over branch names.
  2. Recent branches — the 5 most recently modified branches that match the filter.
  3. Last Commit — the branch’s most recent commit, including committer and timestamp.

Shows every testing environment by default. You can filter by environment category, or pick a single environment (operating system, browser, or custom environment) to narrow results.

Environment Selector
  1. Categories — environments are grouped by category, usually by test runner. A project with Vitest front-end tests and pytest back-end tests will have two categories here: vitest and pytest.
  2. Category Environments — every environment inside the selected category.

A heat-map of last 3 months. Each cell is one day, coloured by its aggregated status by the end of the day: green (passed), yellow (flaked), red (failed & regressed). Click a cell to jump to that day’s report.

Test Health calendar
  1. Calendar view — opens a date picker to jump to any day.
  2. Range selector — sets the report range. The dashboard shows every test that ran at least once within the range, and each row reports the test’s last result inside the range.
  3. Last tested — jumps to the most recent day with test activity.

Filters and slices the tests table using Flakiness Query Language (FQL).

Common filters:

  • click — tests whose name, suite, or file contains click
  • s:flaked — tests by status
  • f:login.spec.ts — tests by file name
  • @skip — tests by annotation
  • #smoke — tests by tag
  • duration>1s — tests by last-run duration
  • flip>0% — tests by flip rate
  • $timeout — tests whose error text contains timeout

One card per status: Passed, Regressed, Failed, Flaked, Skipped. Each card shows the count and percentage of tests currently in that state, respecting the active FQL query.

The card block’s background color reflects the day’s aggregated status: green when everything is healthy, yellow when the number of flaked tests exceeds the acceptable flakiness rate, red when anything failed or regressed.

Status Cards
  1. Passed — tests whose last run of the day is passing.
  2. Regressed — tests with failing last run of the day that had perfect history before the selected day.
  3. Failed — tests with failing last run of the day, but had history with flakes or failures before selected day.
  4. Flaked — tests that flaked during their last run of the day.
  5. Skipped — tests that were skipped during their last run of the day.

An overview of the filtered set, with one row per test × environment. The same test can appear on multiple rows if it ran in multiple environments — for example, on different operating systems, or, for browser end-to-end tests, in different browsers. Click any column header to sort.

Tests Table
  1. History — the last 10 commit-executions of this test, one bar per run. The height of the bar is the run’s duration relative to the other runs in the window; the color of the bar is the test outcome in the given commit.
  2. Status — the test’s last status on the selected day.
  3. Test name — the full test title and its source location.
  4. Environment — the environment that produced this row.
  5. Duration — the last run’s wall-clock time.
  6. Duration Trend — difference between the first entry in the 10-run history and the current duration.
  7. Flip Rate — how many times the status flipped across the 10-run history window. See Flip Rate.

Duration Trend and Flip Rate are history-based columns: they summarize the last 10 runs rather than just the selected day.