Saturday, November 21, 2009

Songhay and Nilo-Saharan

Following up on the preceding post, I've been looking at Greenberg's (1966) Nilo-Saharan comparisons - specifically, the 29 ones involving Songhay that have reflexes in Kwarandzyey, the Songhay language least likely to be involved in recent contact with Nilo-Saharan. Of these, 20 have comparanda in Saharan (Kanuri/Kanembu + Teda/Daza + Berti + Beria/Zaghawa), 17 in Eastern Sudanic (Nubian, Nilotic, Surmic, etc.), vs. a maximum of 13 for any other branch. (At least 7 also have plausible Mande comparisons.) Now, Saharan only consists of about 4 languages (9 by Ethnologue standards.) For Eastern Sudanic, excluding Kuliak, the Ethnologue counts 103 languages, and a huge amount of internal diversity. If Songhay were equally distant from the whole of Nilo-Saharan, you would expect far more cognates with Eastern Sudanic than with Saharan; the figures suggest that the link (whatever its nature) is primarily with Saharan, and only secondarily, if at all, with the rest of the languages he classified as Nilo-Saharan.

The grammatical comparisons that Greenberg offers are interesting but not compelling; there are only 10 of them (only 4 with Kwarandzyey reflexes), and they often incorporate misrepresentations (as Lacroix noted, for example, -ma forms verbal nouns, not relatives/adjectives, and 1sg ay < *agay, reducing the similarity to forms like Zaghawa ai.) Some of the lexical ones, however, are rather good; similarities such as Koyraboro Senni kokoši “scale (of fish)” = Manga Kanuri kàskàsí “scale (of fish)” cry out for explanation, and, though quite rare, look sufficiently numerous that chance seems unlikely. But whether they should be explained by contact or borrowing remains unclear. Either scenario would be historically interesting, since at present rather a large expanse of Tuareg and Hausa-speaking land separates Songhay from even Kanuri, and Saharan originated closer to modern-day Darfur than to Lake Chad.

Sunday, October 18, 2009

Arabic loanwords in "proto-Nilo-Saharan"

Ehret 2001 (or see Nostratic.ru) looks at first sight like an astonishingly detailed reconstruction of Nilo-Saharan, with nice binary splits and loads of technology-related words for archeologists and anthropologists to sink their teeth into. Why shouldn't specialists take advantage of this amazing opportunity to correlate historical developments to linguistic ones?

I just found a handy answer to that question. Bender (1997:175ff) gives the 15 cognate sets in Ehret 2001 that are represented in the most sub-families of Nilo-Saharan. 3 of the 15 look distinctly like Arabic loans.

1387 *wàs “to grow large”: Fur wassiye “wide” and Songhay wásà “to be wide” are both from Arabic wāsi`- واسع. The other items cited – Ik “stand”, Kanuri “yawn”, Kunama “increase, augment”, and Uduk “to tassel, of corn” – are scarcely obvious candidates for being related to one another in the first place.

1297 *là:l “to call out (to someone)”: Kanuri làn “to abuse, curse” and Songhay láalí “to curse” are obviously from Arabic la`an- لعن; Kunama lal- “to denigrate” might be from the same source. That only leaves Uduk “to persuade, incite to do something” and Proto-Central-Sudanic “to call out”.

718 *t̪íwm “to finish, complete”: almost certainly Songhay tímmè “to be finished”, very likely Uduk t̪ím “to finish”, Ocolo t̪um “to finish”, and maybe even Fur time “total”, are from Arabic tamm- تمّ (impf. -timm-), as Bender (ibid:177) considers probable. That leaves Proto-Central-Sudanic, Kunama, and Maba “all”, Kanuri “ideophone of dying animal” (!), and Proto-Kuliak “buttocks”. The “all” set looks rather promising – the whole etymology, not so much.

There are plenty of other Arabic loanwords in Ehret's “Proto-Nilo-Saharan” – a particularly egregious example is Kanuri zàmzàmíyɑ̀ “leather bottle-shaped water vessel for journeys” (#1223 *zɛ̀m “to become damp, moist”), and other especially clear-cut cases include #1173 < sawṭ, #1185 < šamm – but the fact that they include a significant proportion of the best cognate sets is what really strikes me. If a reconstruction attempt can't distinguish a widely distributed recent loan from a cognate set that split more than eleven thousand years ago, any information it gives about readily diffused items like technologies is completely unreliable. For another review from a similar perspective, try Blench 2000 (not sure why it appeared a year before the book's nominal publication date...)

The more I read about Nilo-Saharan, the less convinced I am that it exists (much less that Songhay belongs to it.) That means the classification of the languages of quite a lot of Africa is basically up for grabs. It would be great to have a reexamination of the area.

Tuesday, September 29, 2009

Why would "qaswarah" be claimed to be Ethiopic?

In the Qur'ān, 74:51, an interesting word occurs:

{ كَأَنَّهُمْ حُمُرٌ مُّسْتَنفِرَةٌ } * { فَرَّتْ مِن قَسْوَرَةٍ }
ka'annahum ħumurun mustanfirah * farrat min qaswarah
As if they were wild donkeys. Fleeing from a Qaswarah.

This tends to be rendered as "lion" in English, but the early commentators indicate that that is only one of several possible meanings of the word. al-Ṭabari (d. 310 AH), gives four (all supported by chains of transmitters whose reliability I am not competent to judge): الرماة archers, القُنَّاص hunters, جماعة الرجال a group of men, الأسد a lion. The point of interest here is that two of these explanations are supported by allusions to Ethiopic:
حدثنا هناد بن السريّ، قال: ثنا أبو الأحوص، عن سِماك، عن عكرِمة، في قوله: { فَرَّتْ مِنْ قَسْوَرَةٍ } قال: القسورة: الرماة، فقال رجل لعكرِمة: هو الأسد بلسان الحبشة، فقال عكرِمة: اسم الأسد بلسان الحبشة عنبسة.

...[`Ikrimah] said: "al-qaswarah is archers." Then a man told `Ikrimah: "It is 'lion' in the language of the Ḥabashah (Ethiopians)." Ikrimah said: "The name of the lion in the language of the Ḥabashah is `anbasah."

حدثني محمد بن خالد بن خداش، قال ثني سلم بن قتيبة، قال: ثنا حماد بن سلمة، عن عليّ بن زيد، عن يوسف بن مهران عن ابن عباس أنه سُئل عن قوله: { فَرَّتْ مِنْ قَسْوَرَةٍ } قال: هو بالعربية: الأسد، وبالفارسية: شار، وبالنبطية: أريا، وبالحبشية: قسورة.

...[Ibn `Abbās] said: It is 'asad (lion) in Arabic, and in Persian šēr (شير), and in Nabataean 'aryā (ܐܪܝܐ), and in Ethiopic: qaswarah.
The thing is, it looks like `Ikrimah was right: in Ethiopic, "lion" is indeed `anbasā (ዐንበባ), and no Ethiopic word qaswarah has been found. Qaswarah is most likely an originally Arabic word. But these were intelligent people, and the saying attributed to Ibn `Abbās above is obviously right about Persian and Nabataean; why would they say that qaswarah was the Ethiopic word for "lion" if it wasn't? One obvious possibility is that they were referring to another language of the Ethiopia region. This cannot be ruled out, since many languages of the area have no doubt gone extinct without documentation since then; but it looks as though the words for "lion" in Somali, Oromo, Beja, Agaw, Sidamo, Nubian, Nara, and Kunama are rather different. One might momentarily be tempted to think of Berber, cp. Nafusi war, but that's certainly not long enough.

Could the idea that qaswarah is "lion" in Ethiopic have derived from a misreading of `anbasa at some point? That certainly wouldn't be plausible in Arabic. It doesn't look all that plausible in Ethiopic either: ዐንበባ doesn't look all that similar to ቀስወራ. But there is another alphabet that might conceivably have been involved: the musnad, the Old South Arabian letters that continued to be used in Yemen into the Islamic period. In this alphabet, ` ع is quite similar to q ق, and n to s. The other two letters are rather less similar, but I can imagine b plus the right side of s being miscopied as w, and the remainder of s being reinterpreted as r. Here's roughly how the two words (qswr on the left, `nbs on the right) would have looked (ignoring the possibility of a final feminine -t):


Suppose this is right. Why then would someone at the time have learned an Ethiopic word from a text written in the musnad, rather than by asking an Ethiopian? Histories and travelogues are both genres attested in the Middle East of the time, and might have found occasion to mention in passing the Ethiopian word for "lion", given its cultural importance (it is a common theme in Aksumite art, and in later Ethiopia was adopted as a royal title.) Some Yemeni scholar who's never been to Ethiopia reads a miscopied version of such a history, thinks: ah, this must be the same word as in the Qur'ān, and goes on to tell everyone he knows, including (if the attribution is correct) Ibn `Abbās.

But there's a difficulty here: all that's ever been discovered in the musnad is stone inscriptions and occasional letters. No books have survived at all, much less histories or travelogues. And if there were books, you would think they would be written in the cursive script used in the letters, rather than the monumental script of the inscriptions - which reduces the similarity of the two words even more (see the table on p. 13 of History of the Arabic Script for cursive forms.)

On the other hand - anyone have a better idea?

Tuesday, September 22, 2009

Ibn Hazm again, and Cypriot Arabic

I just found a full translation online of the fifth chapter of Ibn Hazm's 11th-century work Iħkām fī Uṣūl al-Aħkām, discussed previously - a chapter remarkable for anticipating the ideas of a language instinct and of conlanging, and for clearly stating the relationship between Arabic, Hebrew, and Syriac. Enjoy! The Origins of Language: Divine Providence or Human Codification.

Not long before Ibn Hazm's time, some Arabic-speaking Maronites fled the Levant for Cyprus. In the village of Kormakiti, they have kept their language up to the present. YouTube being what it is, you can hear some on a program called Sanna (ie لساننا - our language) - go straight to 2:40, 5:00, 7:04 to hear the language itself. (Ignore the video's ill-informed claims that this is descended from Aramaic, by the way.) If you speak Greek, there are even lessons at Hki Fi Sanna. This is far more incomprehensible to me than any mainstream Arabic dialect I've ever heard, including the Levantine Arabic from which it presumably derives - a remarkable case study in how much isolation from related varieties speeds up language differentiation.

Monday, September 07, 2009

BBC Berber report

A couple of people have forwarded me this BBC article: Trail-blazing for Morocco's Berber speakers. It's a rare instance of Anglophone media noticing North African developments - in this case, the gradual establishment of Berber as a subject in Morocco's educational system. The phenomenon is rather interesting, and their efforts to create a common Tamazight "Fusha" would be a great subject for debate. But this article, sadly, is a pretty poor effort. Some of the errors of fact:

"previously oral-only language": Berber has been written, on and off, for 2500 years or more. The biggest single source of surviving Berber manuscripts (in the Arabic script) is southern Morocco. While Arabic has been - and still is - the main language of literacy for Berber speakers, Berber has not been "oral-only" in Morocco for millennia.

"an alphabet based partly on the mystical signs and symbols of the Tuareg found inscribed on tombs and monuments" - the Tifinagh characters of the Tuareg, on which Moroccan Neo-Tifinagh is based, are not "mystical signs and symbols", they're a perfectly normal consonantal alphabet, used mainly for graffiti and short letters.

"Berbers, until recently excluded from jobs in education and government": no. Their language has been excluded from both, but Berbers have held posts in both positions for as long as Morocco has existed. (The first prime minister of independent Morocco, Mbarek Bekkai, is one of many examples.) Negative attitudes towards Berber language and culture can disadvantage Berbers, but a statement like this one is frankly dishonest.

"young Moroccans either listen to Western music, or to rap in Amazigh" - I won't swear this is wrong, but that sure isn't the impression I got last time I was in Morocco. As far as I could tell, most popular Moroccan Berber music is not rap (thankfully), and certainly much (probably most) Moroccan popular music - including rap - is in Arabic.

Also, they quote Abdallah Aourik saying "Most Moroccans grow up speaking Berber" - this is possible, but is probably no longer true. Most recent-ish estimates on Berber speakers for Morocco (like within the past 50 years) hover around a third. (Wikipedia, for once giving reasonable references.)

For a more opinionated/less polite takedown, try Lounsbury. I guess the lesson is the usual one that the past decade has really drummed in: treat all reporting with scepticism.

Tuesday, September 01, 2009

Child language acquisition and constructions

A memorable line from a talk by Ewa Dabrowska that I went to recently:

"It is generally agreed that the representations assumed by generative theories cannot be learned from the input. For generative linguists, this fact is a fundamental premise of arguments for the innateness of at least some aspects of these representations: since they cannot have been learned from the input, they must be available a priori. An alternative conclusion, of course, is that we need a better theory - one that does not assume representations that are unlearnable."

Her answer is construction grammar: kids first learn individual low-level constructions like "What's ___ doing?" as unanalysed units, and only later come up with higher-level schemas of which these constructions are special cases (the next stage in this case would be "What's ___ ___ing?") Judging from the evidence she presented, showing that the vast majority of a 3 year old's utterances could be accounted for solely on the basis of simple substitutions within sentences they are known to have already heard, "children's [linguistic] creativity seems to involve superimposing and juxtaposing memorised chunks." This view of language more or less inverts the usual grammarian's perspective: the most general rules are developed only after specific cases have been learned, and the specific cases presumably continue to be stored independently. It strikes me as a rather promising way of thinking about historical syntax.

Tuesday, August 25, 2009

The Piraha discussion continues

Via Language Log/John Cowan: Dan Everett's finally gotten around to publishing a few more examples of his claims about Piraha - notably, that they have no recursion, and in particular no subordinate clauses Even quoted speech and conditionals, he claims, are not embedded. Here it is: Pirahã culture and grammar: A response to some criticisms.

Now, recursion means being able to embed a given kind of phrase within another example of the same kind of phrase, as many times as you want. In "the door of the house", one noun phrase ("the door") is embedded within another one ("the door of the house"); in "I will visit you when it stops raining", a clause "it stops raining" is embedded within a larger one ("I will visit you when it stops raining"). You can also keep doing this ("the edge of the handle of the door of the house", "I will visit you when I know whether Khaled said that James is right about the forecast that it will rain tomorrow.") In Piraha, Everett reports that for noun phrases you can only do this once (no more than one possessor), and for clauses that you can't do it at all (he insists that all the examples that look like subordinate or adverbial clauses are actually separate sentences whose linkage is left for the listener to interpret, and in this paper presents some arguments for this.)

The thing is, a language with such properties has obvious potential to be expanded into a language like English or Arabic. For possessors, all it would take is a little analogical expansion - that's what allows us to interpret a phrase like "my brother's wife's cousin's friend's cat's teeth" as grammatical, even though you may well never have heard a noun phrase with six possessors before. For subordinate clauses, all it would take is grammaticalising some kind of erstwhile adverb or intonation pattern or quotative marker into a signal that these two clauses are more closely bound than others; such changes occur all the time in languages that already have subordinate clauses (eg "with what" > "in order to" in Algerian Arabic.) If the Piraha haven't done this, then why not? If they used to speak a language with multiple possessors and subordinate clauses in the past, why and how did they abandon these features - and if they never have, then why have most languages gained these features? In short, what motivates the expansion of grammar, and how does it happen?

One place (doubtless not the only one) where I think you can see expansion of grammar in action is technical terminology; consider mathematics. "The set of all p/q such that q!=0 and p, q are integers" is perfectly clear mathematical English, but is rather unlikely to be heard in everyday English (? "the set of all couples such that the husband is not an accountant and both the husband and wife are from Belgium"). The needs of mathematical communication have motivated the use of a kind of relative clause, with a complementiser and neither a gap nor a resumptive pronoun nor a relative pronoun, which is at best marginal in normal English; if enough people were trained as mathematicians, it might get used more widely. Maybe multiple possessors and subordinate clauses are technical features to cope with the demands of socialising with large numbers of people. Or maybe Piraha has a little more embedding than Everett reports. Speculation is fun, but a nice big, searchable, publicly available corpus would be a lot more convincing.