David Pittenger's 2005 paper "Cautionary Comments Regarding the Myers-Briggs Type Indicator" (published in Consulting Psychology Journal: Practice and Research, 57(3), pp. 210–221, DOI 10.1037/1065-9293.57.3.210) is the single most cited critical review of MBTI. When someone says "MBTI has been debunked," they are usually citing — directly or indirectly — this paper. It is worth knowing what Pittenger actually argued, because the paper is more careful than the summary versions.
Pittenger's paper is not a new empirical study. It is a meta-review: a synthesis of existing research on MBTI's reliability, validity, and utility. The paper has three main sections and a conclusion. Each deserves its own reading.
On reliability: Pittenger reviews test-retest studies and finds that across multiple studies, roughly 50% of MBTI test-takers receive a different four-letter type when retested within five weeks. This is the famous 50% figure. Pittenger is careful to note that most of the changes are on a single dimension, and most are on items where the original score was near the midpoint. He does not claim that half of all test-takers have unstable personalities. He claims that MBTI's dichotomous scoring turns modest score fluctuations into categorical flips, which inflates the appearance of instability.
The nuance matters. If you score 51% Sensing today and 49% Sensing next week, the MBTI framework records that as a change from S to N — a "different type." A continuous-score framework (like the Big Five) would record a tiny shift on the Sensing-Intuition axis, which is more honest. Pittenger's critique is less that people's personalities are unstable and more that MBTI's categorical framing exaggerates whatever instability exists.
On validity: Pittenger evaluates whether MBTI scores predict what they should predict — occupational success, team performance, educational outcomes, interpersonal dynamics. The evidence is mixed. Some correlations are meaningful (introversion predicts preference for solitary work, intuition predicts preference for abstract roles). Many claimed correlations are weak or absent when tested rigorously. Pittenger concludes that MBTI's predictive validity is modest at best, and substantially weaker than the framework's marketing suggests.
Importantly, Pittenger does not say MBTI has zero validity. He says MBTI's validity is weaker than claimed by The Myers-Briggs Company, weaker than its broad organizational adoption would suggest, and weaker than alternative frameworks (notably the Big Five) demonstrate in the same domains. There is a real difference between "no validity" and "modest validity that has been oversold" — and Pittenger sits firmly in the latter camp.
On factor structure: this is Pittenger's most technically damaging critique. Factor analysis is a statistical method for finding the underlying dimensions that explain patterns in test responses. When researchers apply factor analysis to MBTI item responses, they do not recover four clean orthogonal dimensions (I/E, S/N, T/F, J/P) as the framework claims. Instead, the items cluster into patterns that often map onto Big Five dimensions (especially Extraversion and Openness) with additional noise. This finding suggests MBTI is not measuring what it claims to measure — or at least, the categorical boundaries it draws do not correspond to natural joints in personality.
This is the strongest psychometric critique of MBTI and one that has held up across multiple replications. It does not mean MBTI is useless — you can still find that I/E correlates with real introvert/extrovert patterns — but the four-dimensional framework as MBTI presents it is not what the data actually shows.
Pittenger's conclusion: the paper ends with recommendations, not condemnations. Pittenger argues MBTI should not be used for employment decisions, academic placement, or clinical evaluation — domains where weak reliability and validity cause real harm. He is more neutral on MBTI's use in self-reflection, counseling conversation, and team dialogue, where the framework's coherent vocabulary provides value even if its statistical properties are limited.
This is a critical but nuanced position. Pittenger is not saying "MBTI is fake astrology." He is saying "MBTI has measurable weaknesses; use it only in contexts where those weaknesses do not bite." The popular summary of Pittenger as "MBTI is debunked" loses that nuance and transforms a careful academic critique into a dismissive slogan.
What this means for you: if you are using MBTI for self-understanding, relationship conversation, or workplace communication vocabulary, Pittenger's critique does not invalidate that use. You are using MBTI the way it can actually be used well. If you are using MBTI for hiring decisions, clinical diagnosis, or personnel selection, Pittenger's critique is decisive — you should not be using it there, and neither should your organization. The Myers-Briggs Company's own guidelines explicitly agree on the hiring question.