Scoring & CEFR 2026 Update

TOEFL iBT Band Scores and CEFR Levels 2026: A Student's Field Guide

12 min read
TOEFLMock Team

Most articles about the new TOEFL scoring system hand you a conversion chart and call it a day. That's useful for admissions officers. It's useless for a test-taker sitting in a chair trying to figure out whether a band 4.5 or a band 5.0 is realistic in eight weeks. This guide does the opposite job: for every band from 3.0 to 6.0 we show you a real passage, a real speaking transcript, and the question types you start missing below that line. By the end you should know, within half a band, where you stand on the CEFR scale and what the shortest path to your target looks like.

1. From 0-120 to 1-6: What Actually Changed

On January 21, 2026, ETS retired the 0-120 total score that TOEFL had used since 2005 and replaced it with a 1.0-6.0 band scale in 0.5-point increments. Each of the four sections (Reading, Listening, Speaking, Writing) now receives its own band, and the overall score is the average of the four section bands, rounded to the nearest half point. The new report also includes a CEFR label on every section, which is the part most students care about because it travels easily across borders and across other English tests.

Two things are worth internalising before we go further. First, the new scale has only 11 possible scores per section, not 121. Second, ETS has committed to a dual-reporting period through January 2028 during which your official score report lists both the new 1-6 band and a legacy 0-120 equivalent, so universities that haven't updated their published minimums can still process your application. After 2028, only the band score appears. For a full breakdown of format changes, see our TOEFL iBT 2026 format changes guide.

2. CEFR in 90 Seconds: A1 Through C2

CEFR stands for the Common European Framework of Reference for Languages. It was published by the Council of Europe in 2001 and has quietly become the international reference scale for language proficiency. It is not a test. It is a set of descriptors. Six levels from A1 (beginner) to C2 (mastery) describe what a learner can actually do with the language, written as "can do" statements like "can understand the main ideas of complex text on both concrete and abstract topics."

CEFR Name Core Descriptor
C2MasteryUnderstands virtually everything heard or read; expresses precise meaning even in complex situations.
C1Effective OperationalUnderstands demanding, longer texts and uses language flexibly for academic and professional purposes.
B2Upper IntermediateUnderstands main ideas of complex text; interacts with fluency enough to make regular interaction with native speakers possible.
B1IntermediateDeals with most situations likely to arise in travel; produces simple connected text on familiar topics.
A2ElementaryCommunicates in simple routine tasks requiring direct exchange of information on familiar matters.
A1BeginnerUnderstands and uses everyday expressions; introduces themselves and asks basic personal questions.

For an academic English test like TOEFL, the meaningful range is B1 to C2. Below B1 is rarely tested, and C2 is so high that most native speakers without an academic background would not automatically qualify. Your target, in practical terms, is somewhere between B2 and C1.

3. The TOEFL Band-to-CEFR Map

Here is the official alignment ETS published alongside the 2026 scale refresh, combined with the legacy 0-120 concordance for the dual-reporting period. This is the single most cited table in 2026 admissions conversations, so keep it handy.

Band (1-6) CEFR Legacy 0-120 In Plain English
6.0C2114-120Near-native academic English. Ceiling of the test.
5.5C1+110-113Competitive for Ivy League and Oxbridge.
5.0C1100-109Comfortable at any top-20 university.
4.5B2+87-99Meets most top-50 university minimums.
4.0B272-86Meets the floor for many top-100 programs.
3.5B1+57-71Some state and regional universities.
3.0B142-56Community colleges and pathway programs.
2.0-2.5A218-41Pre-academic ESL. Not admissions-ready.
1.0-1.5A10-17Foundational English study needed.

Two notes on reading this table. The legacy 0-120 ranges are approximate because ETS uses statistical equating, not a fixed formula, so a 5.0 might be anywhere from 100 to 109 depending on the test form. And the CEFR column is by section: your overall CEFR is the lowest of the four. A student with a 5.0 Reading and a 3.5 Speaking is not a C1 student; they are a B1+ student with strong reading.

4. What Each Band Reads Like (Sample Passages)

The clearest way to feel the difference between bands is to try passages written at each level. Read each of the four sample paragraphs below. The one you can understand on first read, without looking up a single word, is roughly your current Reading band.

Band 3.5 / B1+ — Can you read this easily?

Bees are important for farming. They move pollen from one flower to another flower, which helps plants make fruit and seeds. Without bees, many crops would not grow well. In recent years, scientists have seen that the number of bees is falling. Some reasons for this are new kinds of farming chemicals, diseases, and changes in the weather. Farmers and researchers are working together to protect bees because our food depends on them.

Band 4.0-4.5 / B2 — Still comfortable?

Although bees are small, their economic role is enormous. Pollination by managed honeybee colonies contributes an estimated fifteen billion dollars a year to U.S. agriculture, particularly in the production of almonds, apples, and blueberries. Since the early 2000s, however, beekeepers have reported annual colony losses as high as forty percent, a phenomenon that has been linked to a combination of neonicotinoid pesticide exposure, the parasitic Varroa mite, and the stress of long-distance commercial transport.

Band 5.0 / C1 — Can you still follow every clause?

The prevailing narrative, that neonicotinoids alone account for widespread colony collapse, has recently come under more granular scrutiny. Longitudinal studies published over the last five years suggest that pesticide exposure, while demonstrably harmful in isolation, interacts synergistically with Varroa-borne viruses and pathogen-induced immunosuppression, such that attributing causation to any single factor risks mischaracterising what is more plausibly an emergent, multi-stressor phenomenon whose mitigation will demand coordinated rather than siloed intervention.

Band 5.5-6.0 / C1+ to C2 — Nuance and tone?

That the decline in pollinator populations is attributable to neonicotinoids has, in certain quarters, hardened into orthodoxy, and not without some justification; yet the accumulating evidence, once read against the grain, compels a more chastened hypothesis, one in which anthropogenic pesticide load operates less as a prime mover than as an accelerant upon pre-existing and mutually reinforcing biological stresses, a reading that would, if taken seriously, displace the regulatory lever from a narrow chemical prohibition toward the harder work of ecological restoration.

The passages are progressively harder along three axes: vocabulary frequency (neonicotinoid, synergistic, chastened), grammatical density (subordinate clauses, passive chains, hedging), and cohesion (each sentence depending on what came before). If paragraph three felt effortless and paragraph four made you slow down twice, you are likely a solid C1.

5. What Each Band Sounds Like (Speaking Transcripts)

The 2026 Speaking section now tests you with a Listen & Repeat task, a short interview task, and a think-aloud academic task. Here are three transcripts from the same prompt ("Describe a habit you want to change and why") at three different bands. Read them aloud, time yourself, and notice what changes.

Band 3.5 / B1+ — 45-second response

"OK, so, one habit I want to change is, um, I sleep late every night. I usually go to bed at two in the morning because I watch videos on my phone. Then I feel tired next day. I want to stop because, um, because it is not good for my health and also my grades go down. So I try to put my phone in other room, but sometimes I forget."

Raters see: intelligible but hesitant, heavy reliance on basic linkers (so, because), repeated simple structures, one self-correction, limited vocabulary range.

Band 4.5 / B2+ — 45-second response

"A habit I'd really like to change is scrolling on my phone late at night. I usually tell myself I'll stop at eleven, but I end up watching videos until almost two, and the next morning I'm wrecked for classes. The reason I want to fix it is partly health, because my sleep is clearly suffering, and partly productivity, since my focus in afternoon lectures has dropped. What I've started doing is leaving the phone in the kitchen, which works about half the time."

Raters see: fluent, natural rhythm, appropriate linking (partly…and partly, since, which), idiomatic touches ("wrecked"), one slightly off collocation but no errors that interrupt meaning.

Band 5.5 / C1+ — 45-second response

"Honestly, the habit I'm most motivated to change is late-night phone use, and I think the reason it matters to me goes beyond the obvious sleep argument. Every morning I've convinced myself I'll cut it off at eleven, and every night I cave, which is as much a confidence problem as a behavioral one. The cost isn't just a groggy morning; it's the slow erosion of trusting my own commitments. So the change I'm attempting is less about willpower and more about removing the device from the bedroom entirely, which, when I actually do it, works."

Raters see: complex argumentation, self-awareness, precise vocabulary (erosion, groggy, cave), varied sentence structure, confident pacing. The candidate would almost certainly score the task at 5.5 or 6.0.

6. What Each Band Writes Like (Essay Samples)

The 2026 Writing section includes Build-a-Sentence (new), Write-an-Email (revised), and Academic Discussion. Here are opening paragraphs from the Academic Discussion task ("Should universities require students to complete community-service hours before graduation?") at three bands.

Band 4.0 / B2

"I think universities should require community service because it helps students to learn about their community and also to become more responsible adults. Many students only focus on their study and they don't know what happen outside the campus. If they do some volunteer work, they can meet different people and understand their problems. This will make them better citizens in the future. Also, service hours can be useful for their CV when they apply for jobs."

Band 5.0 / C1

"While I agree with the broad idea that universities should encourage civic engagement, I'm less convinced that mandating a specific number of service hours is the right mechanism. Required service risks turning what should be a voluntary, self-directed contribution into a bureaucratic checkbox, and there is reasonable evidence, both anecdotal and in the education literature, that compulsion dampens the very intrinsic motivation the policy aims to cultivate. A better approach might be to embed service learning within existing coursework, where reflection is structured and the work is tied to academic content."

Notice what the 5.0 response adds: hedging ("I'm less convinced"), concession structure (the "while…but" move), a named mechanism of failure (bureaucratic checkbox, intrinsic motivation), and a concrete alternative. These are the moves a raters' rubric calls "developed, well-supported response" versus "response addresses the task but lacks development."

7. Listening Comprehension by Band

Listening bands are defined less by vocabulary and more by which question types you can reliably answer. Here is a rough mapping, drawn from the question-type families on the 2026 test.

BandQuestion types you handle reliablyWhere you start to slip
3.0-3.5Main idea, explicit detailSpeaker attitude, function, connecting information
4.0-4.5Main idea, detail, basic function ("why does the professor say X?")Inference, sarcasm, multi-speaker connection
5.0All standard types including inference and attitudeFast back-and-forth in conversations with overlapping ideas
5.5-6.0Everything, including pragmatic understanding and toneAlmost nothing predictable

If you consistently nail main-idea and detail questions but lose 2-3 questions per passage on attitude/function/inference, you're sitting between 4.0 and 4.5. That's the most common profile we see among students who start preparing seriously. Use our free 2026-format Listening practice tests to check which question types are dragging your band down.

8. Which Band Do Universities Actually Want?

Universities have spent 2026 updating their published minimums. The pattern that has emerged is remarkably consistent across the English-speaking world:

  • Elite research universities (top 20): Band 5.0 minimum, 5.5 competitive. That is solid C1. MIT, Stanford, Cambridge, ETH Zurich, and U of Toronto all cluster here.
  • Strong research and state flagships (top 21-100): Band 4.5 minimum, 5.0 competitive. Upper B2 to low C1. Most U.S. state flagships, Russell Group in the UK, Group of Eight in Australia.
  • Regional universities: Band 4.0 minimum. Solid B2.
  • Community colleges and pathway programs: Band 3.0-3.5. B1 to B1+.

Graduate programs in English, law, and medicine typically add half a band on top of their institution's undergraduate minimum, and teaching assistantships (which require fluent classroom speech) usually require 5.0-5.5 on the Speaking section specifically. For a detailed breakdown, see our TOEFL score requirements 2026 guide and the companion scores for universities breakdown.

9. Why the New Scale Is Less Forgiving

Here is a piece of arithmetic no one else is writing about. On the old 0-120 scale, a single missed Speaking rating moved you about 3-4 points, from 26 to 22, say. Nobody cared. On the new 1-6 scale with 0.5-point increments, a single missed rating can move you from 4.5 to 3.5 — a full band — because the same raw-score range is now divided into eleven buckets instead of a hundred and twenty-one.

Concrete example. A student scores 24, 24, 23, 23 on the old Speaking rubric across four tasks. That averages to 23.5, which rounds to a scaled score of 24, old-scale B2+. On the 2026 rubric, those same four ratings map to 4.5, 4.5, 4.0, 4.0 on section bands, averaged to 4.25 and rounded to 4.0, or low B2. A single extra point on one task (24 to 25) would move the average to 4.5. On the old scale that same improvement barely registered.

The practical lesson: tight pacing and accuracy on the last question of each section matter more than they used to. Our Speaking tips guide and Writing tips guide both include pacing drills designed around the new scoring curve.

10. Self-Diagnose Your Band in 20 Minutes

You can get a surprisingly accurate CEFR placement by working through one Reading passage and one Listening lecture under timed conditions. Here is the shortest version of the drill:

  1. Reading (10 minutes). Take one passage from our free 2026-format Reading Test 1. Answer all questions without stopping. Score yourself.
  2. Listening (10 minutes). Play one lecture from Listening Test 1. Answer without pausing or replaying. Score yourself.
  3. Map to a band. Use the question-type table above and your raw score. If you got 8 of 10 right on both, you're roughly 4.5. If you got 6 of 10, you're roughly 4.0. If you got 9-10, push up to 5.0.

The Speaking and Writing sections are harder to self-score without a rubric-trained evaluator, which is why we built AI-assisted evaluation into the full-length 2026 practice test. Feedback arrives in seconds and is calibrated against the published 2026 rubrics.

11. Frequently Asked Questions

What CEFR level is a TOEFL iBT band 5.0?
A TOEFL iBT band 5.0 corresponds to CEFR level C1, described as effective operational proficiency. At this level a test-taker can follow extended academic lectures, read dense university texts, and express complex ideas fluently with only occasional minor errors. On the old 0-120 scale this maps to approximately 100-109.
Is a TOEFL band 4.0 a B2 or B1?
A band 4.0 is the lower end of CEFR B2 (upper intermediate). Test-takers at this level can understand the main ideas of complex text on concrete and abstract topics but often miss nuance, tone, and detail. Band 3.5 sits at B1+ and band 4.5 sits at solid B2+.
Why does the new TOEFL scale only have 11 possible scores?
The 2026 TOEFL iBT uses 0.5-point increments from 1.0 to 6.0, giving only 11 possible band scores per section. The old 0-120 scale had 121 possible scores. This means a single missed question can shift your band by a full half-point, whereas the same miss on the old scale was rarely visible. Practice with tight pacing is now more important.
How do I know my current CEFR level without taking the real TOEFL?
The most reliable way is to take a full-length TOEFL 2026 practice test and score each section using the published band descriptors. A 20-minute diagnostic on one Reading passage and one Listening lecture is enough to place you within one band. You can take a free 2026-format practice test at toeflmocktests.com/practice-tests.
Which CEFR level do universities require?
Most universities require B2 (TOEFL band 4.0-4.5) as a minimum for undergraduate admission. Competitive research universities and Ivy League programs typically require C1 (band 5.0-5.5). Graduate programs in law, medicine, and the humanities may require C1+ (band 5.5) or C2 (band 6.0) for teaching assistantships.
Can I reach C1 from B2 in three months?
Moving from solid B2 (band 4.5) to C1 (band 5.0) in three months is realistic for students who study 1-2 hours daily with authentic academic materials, take weekly full-length practice tests, and review mistakes systematically. Moving from low B2 (band 4.0) to C1 usually takes 5-6 months because the gap is larger.

Find Your Current CEFR Band

Take a free 20-minute diagnostic in the 2026 format. See your band and CEFR level instantly, plus targeted practice for the section holding you back.

Take the Diagnostic
TOEFLMock Team

Expert TOEFL preparation content, updated for the 2026 exam format. Our team includes certified English language instructors and test preparation specialists.