Foundations of Psychological Testing
Understand the basics of psychological testing, the importance of validity and reliability, and how test scores reflect a representative sample of behavior.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz
Quick Practice
What are psychological test scores intended to reflect?
1 of 11
Summary
Definition and Foundations of Psychological Testing
What is Psychological Testing?
Psychological testing is the process of administering psychological tests to individuals or groups in order to measure psychological constructs. At its core, psychological testing involves three key elements: (1) trained evaluators who administer and score tests according to standardized procedures, (2) carefully prescribed guidelines that ensure responses are evaluated consistently, and (3) test scores that are interpreted as reflecting differences in the specific psychological construct that the test is designed to measure.
The broader scientific field that studies psychological testing is called psychometrics—this is the discipline that develops, validates, and refines psychological tests through rigorous empirical research.
Understanding Psychological Tests
The Concept of a Behavioral Sample
Psychological tests work by observing a sample of behavior—that is, a person's performance on a carefully selected subset of tasks administered in advance. The key principle here is that this sample must be representative of the larger domain of behavior we're interested in assessing.
Think of it this way: we cannot test everything a person knows or how they typically behave in all situations, so we strategically sample their behavior and use that sample to draw conclusions about their underlying abilities or characteristics. For example, an intelligence test doesn't measure every aspect of intelligence, but rather samples performance on specific cognitive tasks that researchers have determined reflect broader intellectual abilities.
Observable Behavior vs. Latent Constructs
An important distinction in psychological testing is between what we can directly observe and what we are actually trying to measure. Tests observe observable behavior directly—how someone answers a question, whether they solve a problem correctly, or what they report about their experiences. However, tests often measure unobserved constructs (also called latent variables)—abstract psychological qualities that we cannot directly observe but can infer from behavior.
For example, a test might observe whether someone answers math problems correctly (observable behavior), but what we're actually measuring is their mathematical ability (a latent construct). Similarly, an anxiety questionnaire observes someone's responses to statements about worry and nervousness, but what we're measuring is their level of anxiety—something we cannot directly see but can infer from their reported experiences.
Test Formats
Psychological tests can take many formats. They may consist of:
Tasks and problem-solving items (e.g., solving math problems, arranging blocks into patterns)
Statements about behaviors or symptoms that respondents affirm or deny (e.g., "I have trouble falling asleep")
Questionnaires and interviews that ask people about their experiences, thoughts, or typical patterns of behavior
An important distinction exists between different types of tests based on what they measure:
Questionnaire and interview scales assess typical behavior—how people normally think, feel, or act in everyday life. These tests ask about characteristic patterns and are often used in personality or symptom assessment.
Psychoeducational tests assess maximum performance—how well someone can perform when they're trying their hardest. These include intelligence tests, achievement tests, and aptitude tests.
Essential Qualities of Psychological Tests
For a psychological test to be useful, it must possess two fundamental properties:
Validity means the test actually measures what it claims to measure. A valid IQ test measures intelligence, not reading ability or motivation. Validity is about accuracy—does the test actually assess the construct it purports to measure?
Reliability means the test provides consistent results. A reliable test should produce similar scores across different items that measure the same construct, should yield consistent results regardless of who administers or scores it (consistency across raters), and should produce similar results when the same person takes the test at different times (consistency across time). Reliability is about consistency and stability.
Both validity and reliability are essential. A test can be reliable (consistent) without being valid (a broken watch is reliable but not valid), but a test cannot be truly useful without both properties.
Interpreting Test Scores
Test scores are interpreted as reflecting individual differences in underlying constructs. Common constructs that tests measure include:
Achievement (what someone has learned)
Cognitive ability (general intelligence or specific abilities like reasoning or processing speed)
Personality dimensions (traits like extraversion, conscientiousness, or neuroticism)
The fundamental assumption is that differences in test scores indicate meaningful differences in the underlying construct. If one person scores higher than another on an anxiety test, we interpret this as indicating that the first person experiences more anxiety. This interpretation depends entirely on the test being both valid (actually measuring anxiety) and reliable (doing so consistently).
Flashcards
What are psychological test scores intended to reflect?
Individual or group differences in the construct being measured
What is the definition of psychometrics?
The scientific study of psychological testing
What do psychological tests observe in an individual?
A carefully chosen sample of behavior
What is the term for the unobserved constructs that tests often measure?
Latent variables
What do questionnaire and interview scales specifically assess?
Typical behavior
What do psychoeducational tests specifically assess?
Maximum performance
What does it mean for a psychological test to be valid?
It measures what it purports to measure
What does it mean for a psychological test to be reliable?
It provides consistent results across items, raters, and time
What is the definition of a sample of behavior in testing?
Performance on a subset of tasks selected in advance
What must a sample of behavior be in relation to the larger domain of interest?
Representative
What are three examples of constructs that test scores are interpreted as reflecting?
Achievement
Cognitive ability
Personality dimensions
Quiz
Foundations of Psychological Testing Quiz Question 1: In the context of psychological measurement, a “sample of behavior” refers to what?
- The performance on a subset of tasks selected in advance (correct)
- The complete range of all possible behaviors in a domain
- Random observations of behavior in naturalistic settings
- Self‑report statements about typical behavior
Foundations of Psychological Testing Quiz Question 2: What is the term for the scientific study of psychological testing?
- Psychometrics (correct)
- Neuropsychology
- Behavioral genetics
- Psychophysiology
Foundations of Psychological Testing Quiz Question 3: Which pair of characteristics must a useful psychological test possess?
- Validity and reliability (correct)
- Objectivity and anonymity
- Accuracy and speed
- Flexibility and creativity
In the context of psychological measurement, a “sample of behavior” refers to what?
1 of 3
Key Concepts
Psychological Testing Concepts
Psychological testing
Psychological test
Sample of behavior
Construct (psychology)
Psychometrics Fundamentals
Psychometrics
Validity (psychometrics)
Reliability (psychometrics)
Latent variable
Definitions
Psychological testing
The systematic administration and scoring of tests to assess individual or group differences in psychological constructs.
Psychometrics
The scientific discipline concerned with the theory and technique of psychological measurement, including test development and validation.
Psychological test
An instrument that observes a selected sample of behavior to infer unobserved (latent) psychological constructs.
Latent variable
A theoretical construct that is not directly observable but is inferred from patterns of responses on test items.
Validity (psychometrics)
The degree to which a test accurately measures the construct it purports to assess.
Reliability (psychometrics)
The consistency of test scores across items, raters, and time.
Sample of behavior
A representative subset of tasks or responses selected for testing, intended to reflect the broader domain of interest.
Construct (psychology)
An abstract concept (e.g., achievement, cognitive ability, personality) that psychological tests aim to measure.