Key takeaways
Six things to know before reading further:
- MBTI per-dimension test-retest reliability is approximately 0.5-0.6 (Pittenger 2005, DOI 10.1037/1065-9293.57.3.210; Capraro & Capraro 2002 meta-analysis, DOI 10.1177/00131640221102234). Big Five per-dimension reliability is typically 0.7-0.9.
- Approximately 50% of MBTI test-takers receive a different four-letter type code on retest within five weeks. This is the most-quoted MBTI psychometric statistic and a direct consequence of categorical scoring at modest reliability.
- Most type "changes" are single-dimension flips at the cutoff — typically the J/P dimension if your score there is near 50%. Multi-dimension flips are much rarer and often indicate test-condition differences (mood, fatigue, framing) rather than personality change.
- Reliability of 0.5-0.6 per dimension is not zero — it carries real signal. The signal is just modest enough that categorical type assignment near the cutoff is unstable. Strong dimension scores (far from 50%) reproduce reliably; weak dimension scores (near 50%) flip.
- Druckman & Bjork's 1991 National Research Council review ("In the Mind's Eye") was the most prominent independent assessment of MBTI's psychometric properties at the time. The review found reliability and validity evidence below the level required for selection contexts (consistent with Pittenger's later 2005 review).
- Practical move: when reading your MBTI result, look at the dimension-level percentages. The dimensions far from 50% are the ones reliably capturing your preferences. The dimensions near 50% are weak preferences that flip on retest — that is itself useful self-knowledge, not a measurement error.