Skip to content
National Albanian Registry United States of America
9 min read

Gheg vs Tosk Albanian: The Two Main Dialect Groups Explained

One river in central Albania splits the language in two. North of it people speak Gheg; south of it, Tosk. Most US Albanian families grew up on the Gheg side without ever naming it.

Enri Zhulati

By Enri Zhulati

National Albanian Registry · 501(c)(3) editorial desk

Gheg vs Tosk Albanian: The Two Main Dialect Groups Explained
In this article Show
  1. 01 Where Gheg is spoken
  2. 02 Where Tosk is spoken
  3. 03 How the two dialects differ
  4. 04 Why standard Albanian is Tosk-based
  5. 05 Which dialect US Albanians grew up with
  6. 06 One language, drawn across borders
Audio Listen to this article
0:00 / —:—

One river decides which Albanian you speak. The Shkumbin winds through central Albania for about 181 kilometers, and linguists treat it as the rough boundary between the two main branches of the language. North of it: Gheg. South of it: Tosk. According to Wikipedia, the Shkumbin “is approximately the boundary of the primary dialect division for Albanian, Tosk and Gheg.”

That single split shapes how Albanian sounds, how it is written, and which version most US families carry. If you grew up hearing Albanian at home in Detroit, the Bronx, or Worcester, there is a good chance it was Gheg, even if no one in the house ever used the word. This guide lays out where each dialect lives, how they differ, why the official standard leans Tosk, and what all of that means for the diaspora.

Where Gheg is spoken

Gheg (gegërisht) is the language of the north. Wikipedia places it in “northern and central Albania, Kosovo, northwestern North Macedonia, southeastern Montenegro and southern Serbia.” Put plainly, Gheg covers the whole northern arc of the Albanian-speaking world, and almost all of it sits outside the borders of the modern Albanian state.

That last point matters. Kosovo is Gheg. So are the Albanian areas of North Macedonia, the Albanian communities of Montenegro, and the Malësia highlands. Gheg has roughly 4.1 million native speakers by recent counts, and it breaks into several subdialects, including Northwestern, Northeastern, Central, and Southern Gheg.

Gheg also has a deep written history. The first long text in Albanian, the 1555 Meshari of Gjon Buzuku, is in a Gheg variety. Catholic northern writers kept a Gheg literary line going for centuries, and major twentieth-century poets such as Gjergj Fishta wrote in it. So Gheg is not a spoken-only folk register that lacks a tradition of its own. It simply does not hold official status as a written standard in any country today, a point Wikipedia states directly.

For a sense of scale, the Gheg-speaking world is larger by population than the Tosk one. Recent figures put Gheg at roughly 4.1 million native speakers against about 1.8 million for Tosk, according to the counts cited on Wikipedia. That gap is part of why the standardization debate stayed alive: the dialect chosen as the base for written Albanian is not the one with the most speakers.

Where Tosk is spoken

Tosk (toskërisht) is the language of the south. Wikipedia places it “south of the Shkumbin River,” with communities also in parts of Kosovo, North Macedonia, Italy, Greece, and Turkey. Inside Albania, that means cities such as Korça, Gjirokastra, Vlora, and Berat.

Tosk’s reach extends well beyond the Balkans, because two of the oldest Albanian diaspora communities are Tosk-rooted. The Arbëresh of southern Italy, whose ancestors left the Balkans starting in the 1400s, speak a Tosk-derived variety. So did the Arvanites of Greece, whose language, Arvanitika, Wikipedia also classifies as Tosk-derived. The southern Tosk subgroup further includes the Lab and Çam varieties, the latter tied to Çamëria, the historically Albanian region of what is now northwestern Greece.

These older offshoots drifted for hundreds of years and now sound archaic to a speaker of modern Albanian. But their family line is clear: they sit on the southern, Tosk branch. Tosk itself counts roughly 1.8 million native speakers, fewer than Gheg, yet it carries outsized weight because it became the base of the written standard.

The Tosk side also splits internally. Wikipedia distinguishes Northern Tosk from Southern Tosk, the latter holding the Lab and Çam groups. The schwa, the ë sound, even varies within Tosk: in the Lab varieties it tends to be pronounced further back in the mouth. None of these internal splits break Tosk apart; they are the normal texture of any spoken dialect spread across a region.

How the two dialects differ

The differences are real but not extreme. They cluster in a handful of sounds and a couple of grammar habits.

Nasal vowels. Gheg keeps nasal vowels, the kind made by letting air through the nose. Tosk lost them. Wikipedia notes “a lack of nasal vowels in Tosk.” A word a Gheg speaker pronounces with a nasal color comes out flat in Tosk.

Rhotacism, the n-to-r switch. This is the textbook Tosk marker. In a set of words, an older sound that surfaces as n in Gheg became r in Tosk. Wikipedia gives the example of rëra for “sand,” from an earlier form with n. Linguists call this change rhotacism, and it is one of the cleanest ways to tell the two branches apart.

Consonant clusters. Tosk holds onto certain consonant sequences that Gheg often simplifies. Wikipedia notes that “Tosk dialects preserve the consonant sequences mb, ngj and nd,” where Gheg tends to reduce them. To a listener, this is part of why southern speech can sound a touch fuller on those clusters while northern speech runs them together.

Vowel inventory. Gheg’s vowel system is larger overall, in part because it keeps the nasal set and distinguishes long from short vowels in ways Tosk does not. Wikipedia counts a sizeable Gheg vowel inventory once the long, short, oral, and nasal variants are all included. The practical effect is that a single written word can carry more spoken shades in Gheg than the flattened standard spelling suggests.

The ë. The letter ë, Albanian’s schwa, behaves differently across the two. In much of Tosk it is pronounced clearly; in many Gheg varieties it is weakened or dropped, often with the preceding vowel lengthened to make up for it. Its exact value shifts by region even within each branch.

The infinitive. This is the grammar split people cite most. Gheg builds the infinitive, the “to do” form, with the particle me plus a participle, as in me punue (“to work”). Tosk and the written standard instead use a për të construction, as in për të punuar. The two systems express the same idea in different shapes, and it is one of the first things a learner notices when moving between dialects. (This contrast is standard in Albanian linguistics, though the specific Wikipedia pages reviewed for this article did not spell it out, so treat the exact framing as the commonly taught version rather than a direct quotation.)

For all these differences, the two sides understand each other. Wikipedia states the dialects “are mutually intelligible in their standard varieties.” The gap is closer to American versus Scottish English than to two separate languages.

Why standard Albanian is Tosk-based

Albanian shares one alphabet, settled at the 1908 Congress of Manastir. But spelling rules and a single literary standard came later, and that standard leans south.

The unified standard was finalized at the 1972 Congress of Orthography in Tirana, which produced the rulebook Drejtshkrimi i gjuhës shqipe. The result rests mainly on the Tosk dialect. Wikipedia describes standard Albanian as “a standardised form of spoken Albanian based mainly on Tosk.”

The choice has been debated ever since, and it is worth presenting fairly. The 1972 standardization happened under Albania’s communist government, whose leadership came largely from the Tosk-speaking south. Critics argue that picking a Tosk base sidelined Gheg, the dialect of the north and of the millions of Albanians in Kosovo and beyond who were not inside Albania’s borders at the time. Defenders point out that a single shared standard let Albanians everywhere read the same books, sit the same exams, and write to one another without friction, and that delegates from Kosovo and the diaspora took part in the process. Both readings hold weight, and this article does not pick a side. Kosovo and other Albanian-speaking regions did go on to adopt the same written standard, even as their everyday speech stayed Gheg, and a long-running conversation about giving Gheg more written room continues among writers and linguists today.

Which dialect US Albanians grew up with

Here is where it gets personal for the diaspora. Standard Albanian is Tosk-based, so the textbooks and formal writing most people learned in school lean south. But the spoken Albanian in US homes often does not.

A large share of the US Albanian community traces its roots to Kosovo, North Macedonia, Montenegro, and northern Albania. Every one of those is Gheg territory. So the Albanian a US-born kid hears at home, the words for family, food, and home traded across the kitchen table or at a parish event, is frequently Gheg, even when the parents’ formal schooling ran on the Tosk-based standard.

That can produce a small, familiar gap: the Albanian you speak at home and the Albanian on the page do not line up exactly. A child might say a verb the way a grandparent from Gjakova or Tetovo says it, then meet a different form in a textbook or a song from Tirana. The infinitive is a common flashpoint, since the Gheg me construction and the standard për të form sit side by side in many diaspora households. So are the nasal vowels that color northern speech and flatten out in the written standard.

None of that is a flaw, and it does not make your Albanian less real. It is the expected result of a Gheg-speaking family using a Tosk-based written standard. Knowing the map makes the gap make sense, and it can take the sting out of being told your home Albanian is somehow wrong. It is not wrong. It is Gheg, and Gheg is Albanian.

One language, drawn across borders

Gheg and Tosk track a deeper truth: Albanians have always lived across borders, in Albania, Kosovo, North Macedonia, Montenegro, southern Italy, Greece, and the United States. The dialect line follows that spread. It does not break the language into pieces. A Gheg speaker from Pristina and a Tosk speaker from Vlora are speaking the same Albanian, with different music.

That is the spirit behind the National Albanian Registry. It counts Albanian-Americans as one community, whether your family’s Albanian is Gheg or Tosk, whether home was Kosovo or Korça, whether you speak it fluently or only catch a few words at the dinner table. The count does not ask which dialect you grew up with. It just asks you to be counted.

Add your name to the National Albanian Registry. It takes about 2 minutes.

National Albanian Registry

National Albanian Registry Published by National Albanian Registry · 501(c)(3) editorial desk · Editorial standards

Published by the National Albanian Registry — the volunteer count of Albanian Americans. Learn more

FAQ

Common questions

Is Gheg or Tosk the 'real' Albanian?

Neither. Gheg and Tosk are the two main branches of the same language, and both are spoken natively by millions of people. Standard Albanian is built mainly on a Tosk base, which gives Tosk an official footing in writing and schools. But that is a codification choice, not a ranking. Gheg has its own long written tradition and is the everyday speech of northern Albania, Kosovo, and much of the diaspora.

What is standard Albanian based on, Gheg or Tosk?

Tosk. The unified literary standard, finalized at the 1972 Congress of Orthography in Tirana, rests mainly on the Tosk dialect of the south. Wikipedia describes standard Albanian as 'a standardised form of spoken Albanian based mainly on Tosk.' Kosovo and other Albanian-speaking areas later adopted the same standard in writing, even though their everyday speech is Gheg.

Can a Gheg speaker and a Tosk speaker understand each other?

Yes, in practice. Wikipedia notes that the two dialects 'are mutually intelligible in their standard varieties.' A Gheg speaker from Kosovo and a Tosk speaker from Gjirokastra can hold a normal conversation. They will notice differences in vowels, a few sounds, and some everyday words, the same way an American and a Scot both speak English while sounding clearly distinct.

Which Albanian dialect do US Albanians speak?

Most often Gheg. A large share of the US Albanian community traces back to Kosovo, North Macedonia, Montenegro, and northern Albania, all Gheg-speaking areas. So the Albanian a US-born child hears at home, at the kitchen table or a parish event, is usually Gheg, even though the textbook standard their parents learned in school is Tosk-based.

What is the main grammar difference between Gheg and Tosk?

The most cited one is how each forms the infinitive, the 'to do' form of a verb. Gheg uses a construction with the particle 'me' plus a participle, while Tosk and the written standard use 'për të' plus a participle. There are sound differences too: Gheg keeps nasal vowels that Tosk lost, and Tosk turned an older 'n' into 'r' in some words, a change called rhotacism.

Are Arbëresh and Arvanitika Gheg or Tosk?

Both descend from Tosk. Arbëresh is the Albanian of communities in southern Italy, and Arvanitika is the Albanian historically spoken by the Arvanites of Greece. Wikipedia classifies both as Tosk-derived varieties. They split off centuries ago and developed on their own, so they sound archaic to modern ears, but their roots are firmly on the southern, Tosk side of the language.

Was this useful?

One tap. No email. We read every reply.

Discussion

Comments

Loading discussion…

    Leave a comment

    Comments are reviewed before they go live.

    Never published. Used only to verify your address.