TOEFL iBT Band Scores and CEFR Levels 2026: A Student's Field Guide
Most articles about the new TOEFL scoring system hand you a conversion chart and call it a day. That's useful for admissions officers. It's useless for a test-taker sitting in a chair trying to figure out whether a band 4.5 or a band 5.0 is realistic in eight weeks. This guide does the opposite job: for every band from 3.0 to 6.0 we show you a real passage, a real speaking transcript, and the question types you start missing below that line. By the end you should know, within half a band, where you stand on the CEFR scale and what the shortest path to your target looks like.
1. From 0-120 to 1-6: What Actually Changed
On January 21, 2026, ETS retired the 0-120 total score that TOEFL had used since 2005 and replaced it with a 1.0-6.0 band scale in 0.5-point increments. Each of the four sections (Reading, Listening, Speaking, Writing) now receives its own band, and the overall score is the average of the four section bands, rounded to the nearest half point. The new report also includes a CEFR label on every section, which is the part most students care about because it travels easily across borders and across other English tests.
Two things are worth internalising before we go further. First, the new scale has only 11 possible scores per section, not 121. Second, ETS has committed to a dual-reporting period through January 2028 during which your official score report lists both the new 1-6 band and a legacy 0-120 equivalent, so universities that haven't updated their published minimums can still process your application. After 2028, only the band score appears. For a full breakdown of format changes, see our TOEFL iBT 2026 format changes guide.
2. CEFR in 90 Seconds: A1 Through C2
CEFR stands for the Common European Framework of Reference for Languages. It was published by the Council of Europe in 2001 and has quietly become the international reference scale for language proficiency. It is not a test. It is a set of descriptors. Six levels from A1 (beginner) to C2 (mastery) describe what a learner can actually do with the language, written as "can do" statements like "can understand the main ideas of complex text on both concrete and abstract topics."
| CEFR | Name | Core Descriptor |
|---|---|---|
| C2 | Mastery | Understands virtually everything heard or read; expresses precise meaning even in complex situations. |
| C1 | Effective Operational | Understands demanding, longer texts and uses language flexibly for academic and professional purposes. |
| B2 | Upper Intermediate | Understands main ideas of complex text; interacts with fluency enough to make regular interaction with native speakers possible. |
| B1 | Intermediate | Deals with most situations likely to arise in travel; produces simple connected text on familiar topics. |
| A2 | Elementary | Communicates in simple routine tasks requiring direct exchange of information on familiar matters. |
| A1 | Beginner | Understands and uses everyday expressions; introduces themselves and asks basic personal questions. |
For an academic English test like TOEFL, the meaningful range is B1 to C2. Below B1 is rarely tested, and C2 is so high that most native speakers without an academic background would not automatically qualify. Your target, in practical terms, is somewhere between B2 and C1.
3. The TOEFL Band-to-CEFR Map
Here is the official alignment ETS published alongside the 2026 scale refresh, combined with the legacy 0-120 concordance for the dual-reporting period. This is the single most cited table in 2026 admissions conversations, so keep it handy.
| Band (1-6) | CEFR | Legacy 0-120 | In Plain English |
|---|---|---|---|
| 6.0 | C2 | 114-120 | Near-native academic English. Ceiling of the test. |
| 5.5 | C1+ | 110-113 | Competitive for Ivy League and Oxbridge. |
| 5.0 | C1 | 100-109 | Comfortable at any top-20 university. |
| 4.5 | B2+ | 87-99 | Meets most top-50 university minimums. |
| 4.0 | B2 | 72-86 | Meets the floor for many top-100 programs. |
| 3.5 | B1+ | 57-71 | Some state and regional universities. |
| 3.0 | B1 | 42-56 | Community colleges and pathway programs. |
| 2.0-2.5 | A2 | 18-41 | Pre-academic ESL. Not admissions-ready. |
| 1.0-1.5 | A1 | 0-17 | Foundational English study needed. |
Two notes on reading this table. The legacy 0-120 ranges are approximate because ETS uses statistical equating, not a fixed formula, so a 5.0 might be anywhere from 100 to 109 depending on the test form. And the CEFR column is by section: your overall CEFR is the lowest of the four. A student with a 5.0 Reading and a 3.5 Speaking is not a C1 student; they are a B1+ student with strong reading.
4. What Each Band Reads Like (Sample Passages)
The clearest way to feel the difference between bands is to try passages written at each level. Read each of the four sample paragraphs below. The one you can understand on first read, without looking up a single word, is roughly your current Reading band.
Band 3.5 / B1+ — Can you read this easily?
Bees are important for farming. They move pollen from one flower to another flower, which helps plants make fruit and seeds. Without bees, many crops would not grow well. In recent years, scientists have seen that the number of bees is falling. Some reasons for this are new kinds of farming chemicals, diseases, and changes in the weather. Farmers and researchers are working together to protect bees because our food depends on them.
Band 4.0-4.5 / B2 — Still comfortable?
Although bees are small, their economic role is enormous. Pollination by managed honeybee colonies contributes an estimated fifteen billion dollars a year to U.S. agriculture, particularly in the production of almonds, apples, and blueberries. Since the early 2000s, however, beekeepers have reported annual colony losses as high as forty percent, a phenomenon that has been linked to a combination of neonicotinoid pesticide exposure, the parasitic Varroa mite, and the stress of long-distance commercial transport.
Band 5.0 / C1 — Can you still follow every clause?
The prevailing narrative, that neonicotinoids alone account for widespread colony collapse, has recently come under more granular scrutiny. Longitudinal studies published over the last five years suggest that pesticide exposure, while demonstrably harmful in isolation, interacts synergistically with Varroa-borne viruses and pathogen-induced immunosuppression, such that attributing causation to any single factor risks mischaracterising what is more plausibly an emergent, multi-stressor phenomenon whose mitigation will demand coordinated rather than siloed intervention.
Band 5.5-6.0 / C1+ to C2 — Nuance and tone?
That the decline in pollinator populations is attributable to neonicotinoids has, in certain quarters, hardened into orthodoxy, and not without some justification; yet the accumulating evidence, once read against the grain, compels a more chastened hypothesis, one in which anthropogenic pesticide load operates less as a prime mover than as an accelerant upon pre-existing and mutually reinforcing biological stresses, a reading that would, if taken seriously, displace the regulatory lever from a narrow chemical prohibition toward the harder work of ecological restoration.
The passages are progressively harder along three axes: vocabulary frequency (neonicotinoid, synergistic, chastened), grammatical density (subordinate clauses, passive chains, hedging), and cohesion (each sentence depending on what came before). If paragraph three felt effortless and paragraph four made you slow down twice, you are likely a solid C1.
5. What Each Band Sounds Like (Speaking Transcripts)
The 2026 Speaking section now tests you with a Listen & Repeat task, a short interview task, and a think-aloud academic task. Here are three transcripts from the same prompt ("Describe a habit you want to change and why") at three different bands. Read them aloud, time yourself, and notice what changes.
Band 3.5 / B1+ — 45-second response
"OK, so, one habit I want to change is, um, I sleep late every night. I usually go to bed at two in the morning because I watch videos on my phone. Then I feel tired next day. I want to stop because, um, because it is not good for my health and also my grades go down. So I try to put my phone in other room, but sometimes I forget."
Raters see: intelligible but hesitant, heavy reliance on basic linkers (so, because), repeated simple structures, one self-correction, limited vocabulary range.
Band 4.5 / B2+ — 45-second response
"A habit I'd really like to change is scrolling on my phone late at night. I usually tell myself I'll stop at eleven, but I end up watching videos until almost two, and the next morning I'm wrecked for classes. The reason I want to fix it is partly health, because my sleep is clearly suffering, and partly productivity, since my focus in afternoon lectures has dropped. What I've started doing is leaving the phone in the kitchen, which works about half the time."
Raters see: fluent, natural rhythm, appropriate linking (partly…and partly, since, which), idiomatic touches ("wrecked"), one slightly off collocation but no errors that interrupt meaning.
Band 5.5 / C1+ — 45-second response
"Honestly, the habit I'm most motivated to change is late-night phone use, and I think the reason it matters to me goes beyond the obvious sleep argument. Every morning I've convinced myself I'll cut it off at eleven, and every night I cave, which is as much a confidence problem as a behavioral one. The cost isn't just a groggy morning; it's the slow erosion of trusting my own commitments. So the change I'm attempting is less about willpower and more about removing the device from the bedroom entirely, which, when I actually do it, works."
Raters see: complex argumentation, self-awareness, precise vocabulary (erosion, groggy, cave), varied sentence structure, confident pacing. The candidate would almost certainly score the task at 5.5 or 6.0.
6. What Each Band Writes Like (Essay Samples)
The 2026 Writing section includes Build-a-Sentence (new), Write-an-Email (revised), and Academic Discussion. Here are opening paragraphs from the Academic Discussion task ("Should universities require students to complete community-service hours before graduation?") at three bands.
Band 4.0 / B2
"I think universities should require community service because it helps students to learn about their community and also to become more responsible adults. Many students only focus on their study and they don't know what happen outside the campus. If they do some volunteer work, they can meet different people and understand their problems. This will make them better citizens in the future. Also, service hours can be useful for their CV when they apply for jobs."
Band 5.0 / C1
"While I agree with the broad idea that universities should encourage civic engagement, I'm less convinced that mandating a specific number of service hours is the right mechanism. Required service risks turning what should be a voluntary, self-directed contribution into a bureaucratic checkbox, and there is reasonable evidence, both anecdotal and in the education literature, that compulsion dampens the very intrinsic motivation the policy aims to cultivate. A better approach might be to embed service learning within existing coursework, where reflection is structured and the work is tied to academic content."
Notice what the 5.0 response adds: hedging ("I'm less convinced"), concession structure (the "while…but" move), a named mechanism of failure (bureaucratic checkbox, intrinsic motivation), and a concrete alternative. These are the moves a raters' rubric calls "developed, well-supported response" versus "response addresses the task but lacks development."
7. Listening Comprehension by Band
Listening bands are defined less by vocabulary and more by which question types you can reliably answer. Here is a rough mapping, drawn from the question-type families on the 2026 test.
| Band | Question types you handle reliably | Where you start to slip |
|---|---|---|
| 3.0-3.5 | Main idea, explicit detail | Speaker attitude, function, connecting information |
| 4.0-4.5 | Main idea, detail, basic function ("why does the professor say X?") | Inference, sarcasm, multi-speaker connection |
| 5.0 | All standard types including inference and attitude | Fast back-and-forth in conversations with overlapping ideas |
| 5.5-6.0 | Everything, including pragmatic understanding and tone | Almost nothing predictable |
If you consistently nail main-idea and detail questions but lose 2-3 questions per passage on attitude/function/inference, you're sitting between 4.0 and 4.5. That's the most common profile we see among students who start preparing seriously. Use our free 2026-format Listening practice tests to check which question types are dragging your band down.
8. Which Band Do Universities Actually Want?
Universities have spent 2026 updating their published minimums. The pattern that has emerged is remarkably consistent across the English-speaking world:
- Elite research universities (top 20): Band 5.0 minimum, 5.5 competitive. That is solid C1. MIT, Stanford, Cambridge, ETH Zurich, and U of Toronto all cluster here.
- Strong research and state flagships (top 21-100): Band 4.5 minimum, 5.0 competitive. Upper B2 to low C1. Most U.S. state flagships, Russell Group in the UK, Group of Eight in Australia.
- Regional universities: Band 4.0 minimum. Solid B2.
- Community colleges and pathway programs: Band 3.0-3.5. B1 to B1+.
Graduate programs in English, law, and medicine typically add half a band on top of their institution's undergraduate minimum, and teaching assistantships (which require fluent classroom speech) usually require 5.0-5.5 on the Speaking section specifically. For a detailed breakdown, see our TOEFL score requirements 2026 guide and the companion scores for universities breakdown.
9. Why the New Scale Is Less Forgiving
Here is a piece of arithmetic no one else is writing about. On the old 0-120 scale, a single missed Speaking rating moved you about 3-4 points, from 26 to 22, say. Nobody cared. On the new 1-6 scale with 0.5-point increments, a single missed rating can move you from 4.5 to 3.5 — a full band — because the same raw-score range is now divided into eleven buckets instead of a hundred and twenty-one.
Concrete example. A student scores 24, 24, 23, 23 on the old Speaking rubric across four tasks. That averages to 23.5, which rounds to a scaled score of 24, old-scale B2+. On the 2026 rubric, those same four ratings map to 4.5, 4.5, 4.0, 4.0 on section bands, averaged to 4.25 and rounded to 4.0, or low B2. A single extra point on one task (24 to 25) would move the average to 4.5. On the old scale that same improvement barely registered.
The practical lesson: tight pacing and accuracy on the last question of each section matter more than they used to. Our Speaking tips guide and Writing tips guide both include pacing drills designed around the new scoring curve.
10. Self-Diagnose Your Band in 20 Minutes
You can get a surprisingly accurate CEFR placement by working through one Reading passage and one Listening lecture under timed conditions. Here is the shortest version of the drill:
- Reading (10 minutes). Take one passage from our free 2026-format Reading Test 1. Answer all questions without stopping. Score yourself.
- Listening (10 minutes). Play one lecture from Listening Test 1. Answer without pausing or replaying. Score yourself.
- Map to a band. Use the question-type table above and your raw score. If you got 8 of 10 right on both, you're roughly 4.5. If you got 6 of 10, you're roughly 4.0. If you got 9-10, push up to 5.0.
The Speaking and Writing sections are harder to self-score without a rubric-trained evaluator, which is why we built AI-assisted evaluation into the full-length 2026 practice test. Feedback arrives in seconds and is calibrated against the published 2026 rubrics.
11. Frequently Asked Questions
What CEFR level is a TOEFL iBT band 5.0?
Is a TOEFL band 4.0 a B2 or B1?
Why does the new TOEFL scale only have 11 possible scores?
How do I know my current CEFR level without taking the real TOEFL?
Which CEFR level do universities require?
Can I reach C1 from B2 in three months?
Find Your Current CEFR Band
Take a free 20-minute diagnostic in the 2026 format. See your band and CEFR level instantly, plus targeted practice for the section holding you back.
Take the DiagnosticExpert TOEFL preparation content, updated for the 2026 exam format. Our team includes certified English language instructors and test preparation specialists.