Most modern intelligence tests use a deviation IQ scale. Under this system, the average IQ score is set at 100, and the standard deviation is typically 15 points. This allows scores to be interpreted consistently across different versions of the test.
Using this framework:
- 100 IQ represents average performance.
- 115 IQ is one standard deviation above average.
- 85 IQ is one standard deviation below average.
- 130 IQ is approximately two standard deviations above average.
The statistical distribution of IQ scores makes interpretation easier. Approximately:
- 68% of people score between 85 and 115.
- 95% of people score between 70 and 130.
- Only a small percentage score above 130 or below 70.
Another important way of interpreting IQ is through percentiles. A percentile rank indicates the percentage of people in the normative sample who scored at or below a given score.
For example:
- IQ 100 is approximately the 50th percentile.
- IQ 115 is roughly the 84th percentile.
- IQ 130 is approximately the 98th percentile.
Percentiles often provide a clearer understanding of rarity than the IQ number itself because they directly show how an individual's performance compares to others.
Why Age Adjustment and Measurement Error Matter
One of the most important aspects of IQ calculation is the use of age-adjusted norms. Cognitive abilities develop throughout childhood and continue to change during adulthood. Because of this, IQ tests compare individuals to others within their own age group rather than to the population as a whole.
A raw score that is typical for a ten-year-old child might be far below average for an adult. Age norming ensures that IQ scores reflect relative cognitive performance rather than simple developmental differences.
Psychologists also recognize that no intelligence test is perfectly precise. Every IQ score includes a degree of measurement error, which means the reported result should be viewed as an estimate rather than an exact value.
For example, a reported IQ score of 110 may actually reflect an underlying ability level within a range of several points above or below that value. This is why professionals often discuss confidence intervals when interpreting intelligence test results.
Understanding measurement error is especially important when evaluating very high or very low scores, where small differences may not be statistically meaningful.
In summary, IQ is calculated by converting raw test performance into standardized scores based on representative population norms. These scores are combined across multiple cognitive domains and positioned on a scale with an average of 100. The process allows psychologists to compare individuals fairly across age groups and populations while accounting for statistical variation and measurement uncertainty. Although IQ scores provide valuable insight into cognitive ability, they are best understood as estimates of performance rather than absolute measures of intelligence.IQ is not a raw score that comes directly from a test sheet. It is a standardized score based on how a person performs relative to a normative sample. Most modern IQ tests use a deviation IQ method, which sets the average score to 100 and places scores on a scale defined by standard deviations. This approach helps make results comparable across age groups and test editions.
The process begins with raw scores from individual subtests. A typical IQ battery includes sections for verbal comprehension, perceptual reasoning, working memory, and processing speed. Each subtest produces a raw score based on the number of correct responses or the time taken to complete tasks. These raw scores are then converted to scaled scores using tables derived from the test’s normative sample.
Norming is the foundation of IQ calculation. Test developers administer the instrument to a representative sample of people, matched to the target population by age, education, and demographics. The raw scores from that sample are used to determine how common or rare each level of performance is. A scaled score of 10 on a subtest, for example, typically represents average performance for the normative group, while scores above or below 10 show relative strength or weakness.
Once subtest scores are scaled, they are combined into composite scores. For many tests, the average of the scaled subtests yields an overall IQ score centered on 100. The standard deviation is usually 15, so one standard deviation above the mean is 115 and one below is 85. This makes it easy to interpret where a person stands compared to the normative sample: 68% of people score between 85 and 115, and 95% score between 70 and 130.
Age norms are also important. IQ tests take into account the fact that cognitive abilities develop and change over time. A raw score that is typical for ten-year-olds may be low for adults, so scores are compared to same-age peers. This is why IQ scores are most meaningful when they are age-adjusted. The same numeric IQ can mean different things depending on whether it is based on child, adolescent, or adult norms.
Percentiles are another way of expressing IQ results. A percentile indicates the percentage of people in the normative sample who scored at or below a given value. An IQ of 100 usually corresponds to the 50th percentile, while an IQ of 130 corresponds to roughly the 98th percentile. Percentiles provide a clearer sense of rarity than the raw IQ number alone, especially for people who are curious about how their result compares to others.
Measurement error and test reliability are also part of the story. No test is perfectly precise, so a reported IQ score is an estimate with a confidence interval. A score of 110 might really reflect an underlying ability somewhere between 106 and 114, for example. That is why professionals interpret IQ as a band rather than as a single fixed number, particularly for scores at the extremes.
In short, IQ is calculated by converting raw task performance into standardized scores based on representative norms, combining those scores across subtests, and positioning the result on a scale with a mean of 100. The process aims to make individual performance comparable across people and time, but it is also shaped by the choice of test, the quality of norms, and the way results are reported.