What is the scoring benchmark?

The scoring benchmark shows how your candidates scored relative to other candidates that have taken the same tests.

The most basic way we represent the score of a test is by the percentage of questions answered correctly. However, you may wish to analyze these scores on a deeper level by comparing how difficult the test is for all candidates. If you don't have many candidates, you may have trouble determining this.

This is where our scoring benchmark comes in.

The scoring benchmark shows your scores as percentile ranks. For example, if the percentile rank of one of your candidates is 80% on a specific test, it means that they did better than 80% of all candidates who have taken that test. In this calculation, we use the results of all candidates from all of our customers. There are two major benefits to expressing the score as a percentile rank:

  1. A percentile rank score normalizes for differences in the difficulty level of a test. So the scores of different tests in an assessment become comparable. 
  2. Percentile rank scores give you great insight into the performance of a test, even if you have only one candidate. While it's helpful to have the scores from many candidates (it increases the odds that you have at least a few very good ones), you can interpret the performance of an individual candidate using a percentile rank score.

We also make it possible to choose the norm group that is most relevant to your assessment. For example, if you assess backend developer candidates on their Critical Thinking skills, you may want to see how they did relative to other software engineers specifically. This is why we ask candidates to optionally provide some demographics at the end of every assessment. 

The following norm groups can be selected:

  • All candidates (this is the default group)
  • Based on education level:
    • Some high school
    • High school diploma / GED
    • Some college / associate degree
    • Bachelor's degree
    • Master's degree or higher
  • Based on business function:
    • Administrative
    • Copywriting/editing
    • Customer/IT support
    • Design
    • Engineering (other than software)
    • Finance/Accounting
    • Human Resources
    • Legal
    • Management
    • Marketing
    • Operations
    • Quality Assurance
    • Research & Development
    • Sales/Account Management
    • Software development
  • Based on seniority:
    • Junior (up to 3 years of experience)
    • Senior (4 or more years of experience)
  • % correct - this is technically not a norm group but simply shows the percentage of questions answered correctly

It is not possible to create a cross-cut. For example, software developers with a Bachelor's degree.

For all tests, we have a sample size of at least 1,000 test-takers across all relevant norm groups. For most tests, the sample sizes are orders of magnitude higher across norm groups. For irrelevant combinations (such as the React test combined with "Customer service" as the norm group), we use the "All candidates" default to express the results.

The scoring benchmark is a setting at the level of an assessment. It is not possible to set it at the level of an individual candidate, as differences among candidates would distort the ranking.