Tiny Wiki : Fast loading, text only version of Wikipedia.

Balkan Sprachbund

The Balkan sprachbund or linguistic area is the ensemble of areal features—similarity in grammar, syntax, vocabulary and phonology—among languages of the Balkans, which belong to various branches of Indo-European, such as Slavic, Greek, Romance and Albanian. While they share little vocabulary, their grammars also have similarities; for example they have similar case systems and have all become more analytic, although to differing degrees.

History



The earliest scholar to notice the similarities between Balkan languages belonging to different families was the Slovenian scholar Jernej Kopitar in 1829. August Schleicher (1850) more explicitly developed the concept of areal relationships as opposed to genetic ones, and Franc Miklošič (1861)studied the relationships of Balkan Slavic and Romance more extensively.

Nikolai Trubetzkoy (1923), Kristian Sandfeld-Jensen (1930), and Gustav Weigand developed the theory in the 1920s and 1930s.

In the 1930s, the Romanian linguist Alexandru Graur criticized the notion of “Balkan linguistics,” saying that one can talk about “relationships of borrowings, of influences, but not about Balkan linguistics”.

The term "Balkan linguistic union" was coined by the Romanian linguist Alexandru Rosetti in 1958, when he claimed that the shared features conferred the Balkan languages a special similarity. Theodor Capidan went further, claiming that the structure of Balkan languages could be reduced to a standard language. Many of the earliest reports on this theory were in German, hence the term "Balkansprachbund" is often used as well.

Languages



The languages that share these similarities belong to five distinct branches of the Indo-European languages:
* Albanian
* Greek
* Indo-Aryan (Arli Romany/Gypsy)
* Romance languages (Romanian, Aromanian, Megleno-Romanian and Istro-Romanian)
* Slavic languages (Bulgarian, Macedonian, Serbian — especially the Torlakian dialect which is transitional between Macedonian, Bulgarian and Serbian)

However, not all of these languages have the same number of features shared. That is why they are divided into three groups:

# Albanian, Romanian, Macedonian, Aromanian and Bulgarian have the most properties in common
# Serbian language (especially transitional Torlak dialect) and Greek share with the others a lower number of properties
# Turkish - shares mainly vocabulary and replacement of infinitive with subjunctive.

The Finnish linguist Jouko Lindstedt computed in 2000 a "Balkanization factor" which gives each Balkan language a score proportional with the number of features shared in the Balkan linguistic union. The results were:



Another language that may have been influenced by the Balkan language union is the Judeo-Spanish variant that used to be spoken by Sephardi Jews living in the Balkans. The grammatical features shared (especially regarding the tense system) were most likely borrowed from Greek.

Origins



The source of these features as well as the directions have long been debated, and various theories were suggested.

Thracian, Dacian or Illyrian


Since most of these features cannot be found in languages related to those that belong to the linguistic union (such as other Slavic or Romance languages), early researchers, including Kopitar, believed they must be inherited from the Paleo-Balkan languages (Thracian, Dacian and Illyrian) which formed the substrate for the modern Balkan languages. But since very little is known about any of these languages, it cannot be determined whether the features were present.

Greek


Another theory, advanced by Kristian Sandfeld in 1930, was that these features were an entirely Greek influence, under the presumption that since Greece "always had a superior civilization compared to its neighbours", Greek could not have borrowed its linguistic features from them. However, no ancient dialects of Greek possessed Balkanisms, so that the features shared with other regional languages appear to be post-classical innovations. Also, Greek appears to be only peripheral to the Balkan linguistic union, lacking some important features, such as the postposed article.

Latin and Romance


The Roman Empire ruled all the Balkans, and local variation of Latin may have left its mark on all languages there, which were later the substrate to Slavic newcomers. This was proposed by Georg Solta. The weak point of this theory is that other Romance languages have few of the features, and there is no proof that the Balkan Romans were isolated for enough time to develop them.

An argument for this would be the structural borrowings or "linguistic calques" into Macedonian from Aromanian, which could be explained by Aromanian being a substrate of Macedonian, but this still does not explain the origin of these innovations in Aromanian.

Multiple sources


The most commonly accepted theory, advanced by Polish scholar Zbigniew Gołąb, is that the innovations came from different sources and the languages influenced each other: some features can be traced from Latin, Slavic or Greek languages, while others, particularly features that are shared only by Romanian, Albanian, Macedonian and Bulgarian, could be explained by the substratum kept after Romanization (in the case of Romanian) or Slavicization (in the case of Bulgarian). Albanian was influenced by both Latin and Slavic, but it kept many of its original characteristics.

Several arguments favour this theory. First, throughout the turbulent history of the Balkans, many groups of people moved to another place, inhabited by people of another ethnicity. These small groups were usually assimilated quickly and sometimes left marks in the new language they acquired. Second, the use of more than one language was common in the Balkans before the modern age, and a drift in one language would quickly spread to other languages. Third, the dialects that have the most "balkanisms" are those in regions where people had contact with people of many other languages.



Timeline of contacts



''(under development)''

Most likely the earliest contact was between the Proto-Romanians and Proto-Albanians, (1st century - 5th century AD) this theory being supported by the Albanian vocabulary borrowed from Balkan Latin, as well as the Romanian substrate, which has words cognate to Albanian words.

The exact area where contact occurred is under debate, ranging from Northern Albania to Transylvania. For more, see Origin of Romanians and Origin of Albanians. All Romanian varieties (from the Republic of Moldova to the Vlachs of Serbia) are part of the sprachbund, which shows that the contact happened before they diverged.

The invasion of the Slavs led to a period of migrations throughout the Balkans which created multi-ethnic communities and this led to the sprachbund beginning around the 8th century; most features were present by the 12th century, but in some parts it continued until the 17th century.

Serbian was perhaps the last language to join, as shown by the low number of features, most of which were only in the Torlak dialect, a dialect intermediary to Bulgarian which emerged rather late, after most features were settled in the sprachbund.

Features



Grammatical features


Case system


The number of cases is reduced, several cases being replaced with prepositions, the only exception being Serbian. In Bulgarian, on the other hand, this development has actually led to the loss of all cases except the vocative.

A common case system of a Balkan language is:

* Nominative
* Accusative
* Dative / Genitive (merged)
* Vocative

=

Syncretism of genitive and dative

=
In the Balkan languages, the genitive and dative cases (or corresponding prepositional constructions) undergo syncretism.

''Example:''

(* ''Marija in the Nominative case)''

=

Syncretism of locative and directional expressions

=

(* ''Grčka in the Nominative case'')

Verb tenses


=

Future tense

=
The future tense is formed in an analytic way using an auxiliary verb or particle with the meaning "will, want", referred to as de-volitive, similar to the way the future is formed in English and originating with Greek innovation around the 1st century AD. This feature is present to varying degrees in each language. Decategoralization is less advanced in Romanian ''voi'' and in Serbian ''ću, ćeš, će'', where the future marker is still an inflected auxiliary. In Modern Greek, Bulgarian, Macedonian, and Albanian, decategoralization and erosion have given rise to an uninflected tense form, where the frozen 3rd person singular of the verb has turned into an invariable particle followed by the main verb inflected for person. ''съм'' (Bul.); ''сам'' (Ser.) - "I am";
''обещал съм''; ''обећао сам'' (Ser.) - "I have promised" (lit. "I am one who has promised"), perfect tense. Constructions using the verb ''to have'' are characteristic of Macedonian language (Имам ветено./Imam veteno. = I have promised.).

=

Avoidance of infinitive

=
The use of the infinitive (common in other languages related to some of the Balkan languages, such as Romance and Slavic) is generally replaced with subjunctive constructions, following early Greek innovation.

* in Bulgarian, Macedonian and Tosk Albanian, the loss of the infinitive is complete
* in demotic Greek, the loss of the infinitive was complete, whereas in literary Greek it was not; the natural fusion of the demotic (vernacular) form with the literary (archaic) one resulted in the creation of the contemporary common Greek (Koine Neohellenic), where the infinitive is principally used as noun (e.g. λέγειν "speaking, fluency, eloquence", γράφειν "writing", είναι "being", etc.)
* in Aromanian and Southern Serbian dialects, it is almost complete
* in Gheg Albanian and Megleno-Romanian, it is used only in a limited number of expressions
* in standard Romanian and Serbian, the infinitive shares many of its functions with the subjunctive. In these two languages, the infinitive will always be found in dictionaries and language textbooks. In Romanian, the long infinitives, which are identical to the Italian ones (-are, -ere, and -ire) can also be used in both formal and informal conversation.
* Turkish as spoken in Sliven and Šumen has also almost completely lost the infinitive, clearly due to the influence of the Balkan Sprachbund.

For example, "I want to write" in several Balkan languages:

But here is an example of a relict form, preserved in Bulgarian:


Bare subjunctive constructions


Sentences which include only a subjunctive construction can be used to express a wish, a mild command, an intention or a suggestion.

This example translates in the Balkan languages the phrase "You should go!", using the subjunctive constructions.


Morphology


=

Postposed article

=
With the exception of Greek and Romani, all languages in the union have their definite article attached to the end of the noun, instead of before it. None of the related languages (like other Romance languages or Slavic languages) shares this feature and it is thought to be either an innovation or Albanian borrowing spread in the Balkans.

However, each language created its own internal articles, so the Romanian articles are related to the articles (and demonstrative pronouns) in Italian, French, etc., while the Bulgarian articles are related to demonstrative pronouns in other Slavic languages.


=

Number formation

=
The Slavic way of composing the numbers between 10 and 20, e.g. "one + on + ten" for eleven, called superessive, is widespread. Modern Greek does not follow this.



=

Clitic pronouns

=
Direct and indirect objects are cross-referenced, or doubled, in the verb phrase by a clitic (weak) pronoun, agreeing with the object in gender, number, and case or case function. This can be found in Romanian, Greek, Bulgarian, Macedonian, and Albanian. In Albanian and Macedonian, this feature shows fully grammaticalized structures and is obligatory with indirect objects and to some extent with definite direct objects; in Bulgarian, however, it is optional and therefore based on discourse. In Greek, the construction contrasts with the clitic-less construction and marks the cross-referenced object as a topic. Southwest Macedonia appears to be the location of innovation.

For example, "I see George" in Balkan languages:


Note: The neutral case in normal (SVO) word order is without a clitic: "Виждам Георги." However, the form with an additional clitic pronoun is also possible in colloquial speech: "Виждам го Георги." And the clitic is obligatory in the case of a topicalized object (with OVS-word order), which serves also as the common colloquial equivalent of a passive construction. "Георги го виждам."

=

Adjectives

=
The replacement of synthetic adjectival comparative forms with analytic ones by means of preposed markers is common. These markers are:
*Bulgarian: ''по-''
*Albanian: ''më''
*Romanian: ''mai''
*Modern Greek: ''pió''; though Greek has retained a few of the earlier synthetic forms.
*Aromanian: ''(ca)ma''

=

Suffixes

=
Also, some common suffixes can be found in the linguistic area, such as the diminutive suffix of Slavic origin "-ica" that can be found in Albanian, Greek and Romanian.

Vocabulary


Loan words


Several hundred words are common to the Balkan union languages; the origin of most of them is either Greek or Turkish, as the Byzantine Empire and later the Ottoman Empire directly controlled the territory throughout most of its history, strongly influencing its culture and economics.

Albanian, Aromanian, Bulgarian, Greek, Romanian, Serbian and Macedonian also share a large number of words of various origins:



Calques


Apart from the direct loans, there are also many calques that were passed from one Balkan languages to another, most of them between Albanian, Macedonian, Bulgarian, Greek, Aromanian and Romanian.

For example, the word "ripen" (as in fruit) is constructed in Albanian, Romanian and (rarely) in Greek (''piqem'', ''a (se) coace'', ψήνομαι) by a derivation from the word "to bake" (''pjek'', ''a coace'', ψήνω).

Another example is the wish "(∅/to/for) many years":



Idiomatic expressions for "whether one or not" are formed as "-not-".



Phonetics


The main phonological features consist of:
*the presence of an unrounded central vowel, either a mid-central schwa or a high central vowel phoneme
**''ë'' in Albanian; ''ъ'' in Bulgarian; ''ă'' in Romanian; ''`'' in Macedonian; ''ã'' in Aromanian
**In Romanian and Albanian, the schwa is obtained via centralizing unstressed
***Example: Latin ''camisia'' "shirt" > Romanian ''cămaşă'' , Albanian ''këmishë'' )
**The schwa phoneme occurs in most dialects of the Macedonian language, even in some cases in the western-central dialects, on which the standard is based (сл`нце, к`лбас, к`смет etc.)
*some kind of vowel harmony in stressed syllables with differing patterns depending on the language.
**Romanian: a mid-back vowel ends in a low glide before a nonhigh vowel in the following syllable
**Albanian and Bulgarian: back vowels are fronted before ''i'' in the following syllable.

This feature also occurs in Greek, but it is lacking in some of the other Balkan languages; the central vowel is found in Romanian, Bulgarian, some dialects of Albanian, Macedonian and Serbian but not in Greek or Standard Macedonian.

Less widespread features are confined largely to either Romanian or Albanian, or both:
*frequent loss of ''l'' before ''i'' in Albanian, Romanian, and some Romani dialects
*the alternation between ''n'' and ''r'' in Albanian and Romanian
*change from ''l'' to ''r'' in Romanian, Greek, Albanian, and very rarely in Bulgarian
*the raising of ''o'' to ''u'' in unstressed syllables in Bulgarian, Romanian, Albanian and Northern Greek dialects.
*change from ''ea'' to ''e'' before ''i'' in Bulgarian and Romanian.

Balkan_sprachbund
Source: Wikipedia