Wednesday, June 04, 2025

Eastern Sudanic subgroup reconstructions

This is basically a note to myself, and may be updated.

Eastern Sudanic is generally taken to embrace most of the languages of Sudan, including the following families:

  • Nubian
  • Nara
  • Taman
  • Nyima
  • Jebel
  • Daju
  • Surmic
  • Nilotic
  • Temeinic

Its existence, however, remains debatable (cf. Güldemann 2022). A reconstruction of Eastern Sudanic (much less anything above it, such as Nilo-Saharan) remains out of reach. If it is possible at all, it will most likely need to be based on prior reconstructions of each of these subgroups. It is therefore useful to outline what has been done in terms of reconstruction.

Rilly's (2010) monograph identifies a clearer family consisting of Nubian, Nara, Taman, and Nyimang (along with the extinct Meroitic), which he labels North Eastern Sudanic ("soudanique oriental du nord"), and for which he proposes some 200 lexical reconstructions. In the process, he also offers 200-word reconstructions of proto-Nubian and proto-Taman, finding it necessary for the former to amend Bechhaus-Gerst's reconstruction of 97 items significantly, and drawing for the latter primarily on Edgar (1991).

Nara is a single language, whose dialectal diversity is not sufficiently well documented to make even internal reconstruction feasible.

Nyima consists of two languages, both poorly documented; Rilly gives provisional reconstructions.

For (Eastern) Jebel, Bender (1998) proposes an extremely provisional reconstruction of 100 items, outlining major sound correspondences.

Proto-Daju is reconstructed in the Ph.D. thesis of Thelwall (1981), who provides more than 300 lexical reconstructions along with the principal sound correspondences, but keeps discussion of morphology and syntax to a minimum.

Proto-Surmic has yet to be reconstructed; Yigezu (2001), however, reconstructs 200-300 words for each of two of its three subgroups, Southwest and Southeast. (The third is a single language, Majang.)

For Proto-Nilotic, Dimmendaal (1988) provides a "first reconnaissance", giving 204 items and ignoring tone; the work of Hall et al. (1975) and Hieda (2006) also deserves notice. Much more elaborated monograph-length reconstructions are available for Eastern Nilotic (Vossen 1982) and Southern Nilotic (Rottland 1982); each of these provides about 200 items for the relevant proto-language along with quite a few more for lower-level subgroups. Western Nilotic has not been reconstruced, but one sub-subgroup, Southern Luo, has been reconstructed in Heusing (1983).

Temein, with three poorly documented members, has not been reconstructed.

In brief: out of nine primary Eastern Sudanic families, none has yet been reconstructed in detail. Where reconstructions at this level exist, they cover a limited number of sound correspondences (usually segmental, ignoring tone), and a couple of hundred basic words; discussion of morphology is limited to a few prominent affixes.

Wednesday, December 11, 2024

More Mabaan pharyngeals

Thomas Anour has posted a number of Bible extracts: Mark 10:13-18, John 1:1-13, and James 4:1-3. Comparing these to a published translation from 2002 (from which he sometimes diverges slightly) and to the anonymous dictionary linked in the previous post makes it possible for a beginner to parse much of the text. No more examples of /ħ/ were heard; but another pharyngeal, /ʕ/, was. This phoneme is absent from the online audio version of this Bible translation, but can be heard clearly in Thomas Anour's pronunciation of at least three frequent words, despite occasional variation, and seems to contrast with the glottal stop /ʔ/, as illustrated by the the last few lines of the following table. While one of the words with /ʕ/ is an Arabic loan, the rest clearly are not.

Unfortunately, I don't know yet where it's coming from. I have yet to find any useful cognates to the words with the pharyngeal in the rest of Nilotic, or even in the meager Jumjum dictionary. "We" corresponds to Nuer <kɔn> and (probably?) Dinka /wɔ̂ɔk/.

English Mabaan
(Anour)
Mabaan
(anon)
Mabaan (Anderson)
and [ʕɔ́sì] ɔci ʔɔ́cé
so that [ʕáŋkàː] aŋ-ka ʔáŋkà
because (< Ar.) [ʕásàan] acaan
where [ʔáŋɛ̀] aŋɛ
quotative particle [ʔàgɪ́] agi ʔàgē
we [ʔɔ̂ːn] ɔɔn ʔɔ̆ɔn

Tuesday, December 10, 2024

Mabaan pharyngeals

The least well documented subgroup of West Nilotic is the Burun group, spoken around the borders between Sudan, South Sudan, and Ethiopia. The largest language in this subgroup is Mabaan, spoken in South Sudan, for which there exists at least one dictionary (available without bibliographic information on Roger Blench's site), and several very interesting articles by Torben Andersen. But we are no longer in the era where a non-field linguist could be content to look at printed sources alone; there is a fair amount of Mabaan content on YouTube, including a channel by a BA-trained linguist and first language speaker of Mabaan, Thomas Anour: Learn Maban, African Language with Thomas Anour. (Like and subscribe, or whatever it is you're supposed to do on YouTube to encourage creators.) Between these, that makes enough material to observe an interesting phonological difference.

In Mabaan as described by Torben Andersen and in the aforementioned anonymous dictionary, /h/ seems to show up only in interjections or loans, and /ħ/ is not mentioned at all. The variety spoken by Thomas Anour, however, features a number of words with initial [ħ] (occasionally varying with [h]). A single cognate in a North Burun language, Mayak, suggest that this is the reflex in his variety of *r, which otherwise becomes a semivowel in Mabaan; more would be desirable.

English Mabaan
(Anour)
Mabaan
(anon)
Mabaan
(Andersen)
Jumjum
(Fadul et al.)
Mayak
(Andersen)
sorghum field (?) <hill> [ħîl] <yielo>
"field for dura grain"
- <yiil>
"field, farm"
-
rat <heeñ> [ħéːɲ] <yyeño> "rat" yiiêɲ-ʌ̀
"~, mouse"
<yiiñ> rii-nit̪
sausage tree <heeṭṭa> [ħétà] <wyeṭṭa>
"pod of ~"
- - -
desert <hong> [ħʌ̂ːŋ] <wɔɔŋ>
"wilderness, desert"
- - -
salmon (sic) <hitta> [ħítàː] - - - -
excuse (Ar. izin) <honda> [ħʌ̀ndá] - - - -

Edit (12/12/2024): The Elenchus comparativus (von Hurter, 1800) records, s.v. "souris" (mouse), <hén> for "Abugonos Burun" vs. <rine> for "J. Kurmuk". This is the only word in the list transcribed with initial h - and the only word on the list corresponding to any of the ones above - but seems sufficient to suggest that this pronunciation is indeed old. Among words with *r, one notes Abugonos <yonga> "meat" and <ímaghi> "blood" (Kurmuk <rin>), which do not support the hypothesis of *r > ħ, but, given the imprecise transcription, do not disprove it either. My thanks to Shuichiro Nakao for sending me a link to this exceptionally early source.

Thursday, September 26, 2024

Tlemcen: medieval folk etymologies and their implications

In the mid-14th century work Bughyat al-ruwwād fī dhikr il-mulūk min banī ʕAbd al-Wād, Yaḥyā Ibn Khaldūn (brother of the more famous Ibn Khaldūn) ventures two possible etymologies for the name of Tlemcen (Standard Arabic Tilimsān, dialectal Arabic Tləmsān):

تسمى بلغة البربر تلمسنين كلمة مركبة من تلم ومعناه تجمع وسين ومعناه اثنان اي الصحراء والتل فيما ذكر شيخنا العلامة ابو عبد الله الابلي رحمه الله وكان حافظا بلسان القوم ويقال ايضا تلشان وهو ايضا مركب من تل ومعناه لها وشان اي لها شان
In the Berber language it is called "T.l.msīn", a word composed of t.l.m, meaning "she/it gathers", and sīn, meaning "two" - i.e. the Sahara and the Tell - according to our shaykh the most learned Abū ʕAbd Allāh al-Ābilī, may God have mercy on him, who was well-versed in the people's tongue. It is also said "T.l.šān", which is also a compound, of t.l., meaning "she/it has", and šān, i.e. "it has status".

Both etymologies are easy enough to interpret in the light of comparative Berber data. In the nearest (barely) surviving Berber variety - Beni Snous (Aṯ Snus), some 40 km west of the town - "Tlemcen" is indeed Tləmsin, not Tləmsan (cf. Destaing's Etude, pp. 368, 370, 371, etc.) This variety, however, does not use the word sin for "two" - it uses ṯnayən, like the Rif to its west (cf. Destaing, Dictionnaire, p. 98). The closest varieties to preserve a Berber word for "two" - geographically and genetically - use sən, in common with the rest of the Zenati subgroup to which Beni Snous belongs. The nearest varieties using the form sin are Kabyle, far to the east, and Middle Atlas Tamazight and Tashlḥiyt, far to the west. For the verb, one might consider t-əlləm "she/it spun", but the gloss given better matches a widespread dialectal Arabic word that could well have been borrowed into Berber: t-ləmm "she/it gathers". The second is obviously a compound of Arabic ša'n "affair, rank, status" and the Berber verb t-la "she/it has". Today this verb survives in Beni Snous, as in Kabyle, only residually, in the construction wi-h y-il-ən "who does it belong to?" (Destaing, Grammaire, p. 88). But it may have been more productive at that time, as it still is in Middle Atlas Tamazight.

Obviously, the first of these etymologies is implausible, while the second is a self-aggrandising play on words rather than an attempt to explain the name. But the fact that the first one could seriously be suggested is strong evidence that the meaning of Tlemcen was no more transparent to 14th century Berber speakers than it is to 21st century ones - as is not unusual for placenames. A better etymology can be proposed by taking into account comparative data - and allows us to explain the cross-linguistics differences in the final vowel - but I'll leave that for another day.

Sunday, September 15, 2024

"Berber" language in early Arabic texts

Searching Shamela, I recently realised that the earliest references to a language of (al-)Barbar in Arabic go back further than I had assumed, to the second century AH. While these are unlikely to shed much light on actual linguistic practices, they are worth a look.

Two occur in the Qur'ānic commentary (tafsīr) of Mujāhid ibn Sulaymān (d. 150 AH = 767 AD); one in his discussion of sūrat al-Isrā':

فقال عبد الله بن الزبعري السهمي: إن الزقوم بلسان بربر التمر والزبد. قال أبو الجهل: يا جارية ابغنا «٣» تمرا فجاءته. فقال لقريش وهم حوله تزقموا من هذا الزقوم الذي يخوفكم به محمد «٤» .
And ʕAbd Allāh ibn al-Zabʕarī al-Sahmī said: Zaqqūm means 'date' and 'butter' in the tongue of Barbar. Abū Jahl said "Slave-girl, find us some dates", and she came to him, and he said to Quraysh as they were about him: "Have some of this zaqqūm that Muhammad is scaring you with".

And one, almost identical, in his discussion of sūrat al-Dukhān:

بلسان بربر وأفريقية الزقوم يعنون التمر والزبد، زعم ذلك عبد الله بن الزبعري السهمي، وذلك أن أبا جهل قال لهم: إن محمدا يزعم أن النار تنهت الشجر وإنما النار تأكل الشجر، فما الزقوم عندكم؟ فقال عبد الله بن الزبعري: التمر والزبد. فقال أبو جهل بن هشام: يا جارية، ابغنا تمرا وزيدا. فقال: تزقموا.
In the tongue of Barbar and Ifrīqiyah, zaqqūm they mean 'date' and 'butter'. So claimed ʕAbd Allāh ibn al-Zabʕarī al-Sahmī, on the basis that Abū Jahl said to them "Muhammad claims that the Fire grows trees, yet fire consumes trees! So what is zaqqūm according to you?" And ʕAbd Allāh ibn al-Zabʕarī said: "Dates and butter". And Abū Jahl ibn Hishām said: "Slave-girl, find us some dates and butter!" And he said: "Have some zaqqūm."

One is vaguely reminded of Siwi a-zəggar "a (single) date"; but if indeed this were a word of "Barbar and Ifrīqiyah", one would hardly expect it to trip from the lips of Abū Jahl, and still less to be familiar to his audience. (For those unfamiliar with the context: Abū Jahl was the foremost enemy of early Islam, and ʕAbd Allāh ibn al-Zabʕarī was an anti-Islamic poet; their assertion that in some foreign language zaqqūm means "dates and butter" was almost certainly intended as mockery of the Qur'ān, not as serious lexicography.) The juxtaposition of "Barbar and Ifrīqiyah", however, seems to corroborate that the intended reference is indeed to the Berbers rather than to any East African groups.

Another is to be found in the earliest Arabic dictionary - the Kitāb al-ʕAyn, by al-Khalīl ibn Aḥmad:

والقَيطونُ: المخدع في لغة البربر ومصر.
qayṭūn: chamber, in the language of the Barbar and Egypt.

The ultimate origin of this word, unlike that of zaqqūm, is perfectly clear: it comes from Greek κοιτών, probably via Aramaic. The term is used in Coptic too: ⲕⲟⲓⲧⲱⲛ. In modern Berber varieties its form (e.g. aqiḍun) has a q, suggesting a more recent borrowing from Arabic, but one may reasonably suspect that Berbers in Cyrenaica and the Western Desert (most of whom have since switched to Arabic) would have been familiar with some version of the Greek term before Arabic influence.

A number of references to the "Barbar" are to be found in the works of Imām Mālik ibn Anas. None of these refer to the language, but one, in al-Mudawwanah, makes a clear reference to skin colour in the context of what counts as a legally punishable insult, confirming (unless this was a later scribe's addition) that the term's reference was indeed to North rather than (as Rouighi's hypothesis might suggest) East Africa:

بَلَغَنِي أَنَّ مَالِكًا قَالَ فِي الْمَوَالِي كُلِّهِمْ: مَنْ قَالَ لِبَرْبَرِيٍّ يَا فَارِسِيُّ أَوْ يَا رُومِيُّ أَوْ يَا نَبَطِيُّ أَوْ دَعَاهُ بِغَيْرِ جِنْسِهِ مِنْ الْبِيضِ كُلِّهِمْ فَلَا حَدَّ عَلَيْهِ فِيهِ، أَوْ قَالَ يَا بَرْبَرِيُّ وَهُوَ حَبَشِيٌّ فَلَا حَدَّ عَلَيْهِ وَهُوَ قَوْلُ مَالِكٍ.
It has reached me that Mālik said of mawlās in general: "Whoever says to a Berber 'Hey, Persian!' or 'Hey Roman!' or 'Hey Nabaṭī!', or calls him by any other nation of the whites - he is not to be punished for it. Or if he says 'Hey Berber!' when he is really Ethiopian, he is not to be punished for it. That is Mālik's statement.

Monday, August 19, 2024

Miscellaneous Darja notes, 2024

When I wrote my paper on the Arabic dialect of Dellys a couple of decades ago, I described it as using the participle "going (to)" - ṛayəħ رايح (m.) / ṛayħa رايحة f. / ṛayħin رايحين pl., depending on the gender/number of the subject - to form the future. In more recent years, I've started to notice women speakers reducing feminine ṛayħa and plural ṛayħin to ħa حا and ħin حين respectively. On this trip, I heard a young woman say waš ħa-yəqṛa? واش حايقرا؟ "what will he study?", with ħa unmistakeably generalised even to the masculine, as in Egyptian.

Another probable innovation in progress is the spread of -ti- as a possessive linker between CCaC nouns and pronominal suffixes. I was already familiar with forms like ṣbaʕ-ti-k صباعتيك "your fingers", but ṛwaħ-ti-na رواحتينا for "ourselves" and even šɣal-ti-hŭm شغالتيهُم "their tasks" (alongside šɣalat شغالات "tasks"!) are harder to motivate.

I alluded in my last post to a sort of possessive perfect construction; the reference was to ʕənd- ma..., meaning "have Xed plenty". It can quantify the event, as in ʕənd-u ma lbəs-hŭm عندهُ ما لبسهُم "he's worn them plenty", or the object, as in ʕənd-u ma lbəs عندهُ نل لبس "he's worn plenty", or ʕənd-u ma šaf عندهُ ما شاف "he's seen a lot".

The usual "whatchamacallit" filler word in this region is laxŭṛ لاخُر "the other (thing)". I heard one example showing that it can follow the passive prefix, and therefore substitute for verb roots as well as full stems: yə-t-laxŭṛ - yə-t-rigla يتلاخُر - يتريڨلا "get whatsited - fixed".

As in Standard Arabic, the past imperfective is regularly formed with kan "be" plus the imperfective. It's worth noting, however, that either of the two verbs can be negated: yəmma kanət ma tŭxrəj-ši يمّا كانت ماتُخرجشي "Mom used to not go out".

Codeswitches always raise the question of complement selection. In siṛaṛ win nərgŭd سيرار وين نرڨُد "rarely do I fall asleep", the loan c'est rare would take the complementiser que "that" in French, but shows up here with win "where" instead, corresponding not to French but to the construction normally used with the corresponding Arabic word, qlil قليل "few".

From an elderly aunt, I heard a double-object form that intuitively seems completely impossible in Darja to me: ila ma tḏəkkəṛnihaš إيلا ما تذكّرنيهاش "if you don't remind me of it". I think it's a religious classicism (unusually, she's literate despite having grown up before the Revolution), but noting it here in case more examples turn up.

There's a lot to be said about feminine -a in plurals. Usually it corresponds to final nisba -i in the singular, but it consistently shows up in the plural of family names (e.g. ṣwawga صواوڨة "Souags"), and I noticed it on a non-nisba profession noun: šifuṛ شيفور "driver", pl. šwafṛa شوافرة, where one might otherwise have expected *šwafəṛ. I don't recall hearing any of these words in the construct state, but for "brothers" (xiwa خيوة or xawa خاوة), forms like xiwətna خيوتنا "our brothers" seem to confirm that this really is morphologically identical to the feminine singular ending, rather than just being a different morpheme with the same vowel.

Conflicting evidence on the phonological representation of the French loanword ṣak صاك "purse, bag": "my bag" is ṣakki, with a geminate, but "bags" is (or can be) ṣikạn صيكان, with no geminate, and with an unexplained emphatic [ɑ] in the second syllable.

Secondary gemination in central Algerian Arabic is too large a topic to cover here - I have a draft paper on it I really ought to publish - but I was amused to hear it applied to the English loanword bəznəs "do business (esp. shady)": 3pl. impf. ibəzzənsu يبزّنسوا "they do business".

A couple of idioms: ma tkəssəṛ-š ṛaṣ-ək ما تكسّرش راصك "don't bother yourself, don't go to the trouble" (lit. "don't break your head"); ma fiha walu ما فيها والو "no problem, it's not an issue" (lit. "there's nothing in it"). "Easier than easy" isn't an idiom as such, but a construction worth attention: əlqur'an sahəl fuq əsshuliyya u waʕər fuq əlwʕuriyya القرآن ساهل فوق السهولية وواعر فوق الوعورية "the Qur'an is easier than easy and harder than hard."

Some proverbs: duga duga təbbəʕ əṭṭṛig əlməgduda دوڨا دوڨا تبّع الطريڨ المڨدودة "little by little, follow the level path"; kŭll ṯqil fəlmizan xfif كُلّ ثقيل في الميزان خفيف "any load is light in the balance"; ərrʷgad ṣəlṭan الرّڨاد صلطان "sleep is a despot". A few Darja words I've learned this summer - etymologically obscure, except the last:

  • gŭrgab ڨُرڨاب - fishing-ground, spot in the sea with lots of fish.
  • ẓəṛtiṭa زرطيطة - grape stalk (pedicel), subset of ʕənqud عنقود "cluster"; also refers to the last grapes of the season
  • mčəʕčək متشعتشك - (of hair) tangled, messy
  • ṭəṛṛaħ طرّاح - mattress maker

Wednesday, August 14, 2024

Spengler and morphology

While Spengler is better known for his efforts (in Decline of the West) to establish a historical morphology of cultures, he also briefly branches out there into linguistic historical morphology:
Instead of sum, Gothic im, we say ich bin, I am, je suis; instead of fecisti, we say tu habes factum, tu as fait, du habes gitân, and again, daz wîp, un homme, man hat. This has hitherto been a riddle because families of languages were considered as beings, but the mystery is solved when we discover in the idiom the reflection of a soul. The Faustian soul is here beginning to remould for its own use grammatical material of the most varied provenance. The coming of this specific ‘‘I’’ is the first dawning of that personality-idea which was so much later to create the sacrament of Contrition and personal absolution. This “ego habeo factum,” the insertion of the auxiliaries ‘‘have’’ and ‘‘be’’ between a doer and a deed, in lieu of the "feci" which expresses activated body, replaces the world of bodies by one of functions between centres of force, the static syntax by a dynamic. And this “I” and “Thou” is the key to Gothic portraiture. A Hellenistic portrait is the type of an attitude — a confession it is not, either to the creator of it or to the understanding spectator. But our portraits depict something sui generis, once occurring and never recurring, a life-history expressed in a moment, a world-centre for which everything else is world-around, exactly as the grammatical subject ‘‘I’’ becomes the centre of force in the Faustian sentence. (Atkinson translation, 1926, pp. 262-3)
A linguist - or a scientist - is not particularly well-positioned to judge airy intuitions about "the dawn of a new life-feeling" or the emergence of a "Faustian soul"; how would one even go about testing such claims rigorously? But the emergence of forms like these is rather better studied. On a world scale, there is nothing unusual about fecisti - plenty of languages collapse the subject pronoun, tense/aspect/mood, and the verb into a single word. In fact, a good 70% of languages in Siewierska's WALS survey mark subject agreement on the verb one way or another. Nor is their completely analytic separation all that rare (22% of the same sample). However, Siewierska's work reveals that there really is something unusual about a form like ich bin, where the independent pronoun is obligatory yet the verb still agrees with it:
My cross-linguistic investigations of verbal person markers reveal that person markers which require the presence of accompanying independent nominals or pronominals are very rare. In a sample of 272 languages I found only two such markers, in Dutch and Vanimo, a New Guinea language of the Sko family. The only other languages that I have come across which display such markers are: English, German, Icelandic, Faroese, some Rhaeto-Romance dialects, Standard French and perhaps Labu, an Austronesian language of New Guinea, and Anejom a language of Vanatu. (Siewierska 2001:219
Perfect marking based on "have" also turns out to be a European feature with almost no parallels elsewhere, as shown by Dahl and Velupillai's WALS chapter on the perfect; they emerged and spread there only in the Middle Ages as an areal innovation. (Bridget Drinka has worked on this question in more detail.)

For Spenglerians, then, these two would at first sight seem to be very promising features to focus on - two globally very rare features, known to have emerged in Europe only after the fall of the Roman Empire, equally innovative in Romance and Germanic and prominent in both. However, there are naturally a couple of hitches.

Person marking requiring independent pronominals is essentially a North Sea feature; though found in French, it never made it far enough south to be shared by Italian and Spanish. Given the prominent role of Italy in Spengler's account of the emergence of Faustian culture, a Spenglerian would presumably be forced to dismiss this feature or to find some workaround. If he retained it, the obvious next task would be interesting: to examine the scattering of South Pacific languages which share this feature and see if their speakers' attitudes to the ego (?) show any relevant parallels to Western European ones.

Perfect marking with "have" shows a better match with the hypothesised Faustian culture-area - but extends a little beyond it, to Albania and to some extent Greece. (Indeed, even Algerian Arabic shows a rather marginal possessive perfect construction.) This presumably reflects Romance influence on these languages in the context of Western Europe's rise to power in the Mediterranean, though one wonders why nothing similar happened outside the Mediterranean. Not a problem for a historical linguist; but would a Spenglerian be forced to take this as evidence for a change in ego-conceptions in those regions, and seek corroboration for it in literature and painting?

Saturday, May 18, 2024

Three Mubi proverbs via YouTube

In an episode of "Chadian Wisdom and Proverbs", Yaqub Muhammad Musa discusses three Mubi proverbs, providing the Mubi versions along with translation and extensive commentary in Arabic. Mubi is comparatively well-documented as East Chadic languages go, thanks primarily to Jungraithmayr's excellent 2013 monograph; but even this miniscule corpus reveals some aspects of the language that I was unable to find there.

Proverb 1:

njólĭí-lògùbá,ùfáadàmìlá
personifhave.PFV-NEGbig.M,fall.IMPFinwell
"If a person has no elder, s/he will fall into a well."
الما عنده كبير، يقع في البير

Every element of this proverb is attested in Jungraithmayr, but the phonology contains some surprises: I hear a secondary gemination after the preposition which is not discussed, and in ùfáad we observe that regular final devoicing fails to hold, presumably due to the following vowel. Moreover, mà is glossed there as "et, puis" (and, then), which is clearly not the intended sense here.

Proverb 2:

sín-dábáagìɗíil-ít,ísìwèelít-ɗòàagŏ
brother-2Mif[remaining]sweet-ADJ,refuse(?)lick.PFV-NEGa.lot
"If your brother is sweet, don't lick him a lot"
أخوك كان بقي عسل، ما تلحسه زيادة

Based on the Arabic version of the proverb, I assume báagì is a loan from Arabic, corresponding to classical bāqī "remaining", and functioning as a sort of copula; it's not in Jungraithmayr. ísì is not there either; in fact, J doesn't discuss the formation of negative imperatives at all that I can see. But it matches well with the attested verb íisí "to refuse", and presumably reflects grammaticalisation from that (cf. Latin noli). So this gives us a new construction.

Proverb 3:

írín-dàmáarsùsúm-gīísìhéegín-nò
eye-2Meven.if(?)2Mitch-3Frefuse(?)scratch.PFV-NEG
"Even if your eye itches you, don't scratch it"
عينك دي كان قاعدة تاكلك دا ما تحكّه

Here, máar is just a guess informed by context; no such form is found in J. But we find a second instance of the prohibitive construction, as well as an otherwise unattested verb sùsúm "itch". Note the OV-S construction, which alternates with SV-O under circumstances which seem to remain a mystery.

The fact that new things can be discovered about Mubi from a recording this short illustrates just how much remains to be done in terms of describing this morphologically fascinatingly complex language. At the same time, it tends to confirm the high accuracy of J's transcriptions. I hope further study will be possible.

Tuesday, February 20, 2024

Loanwords examined via Pozdniakov's Proto-Fula-Sereer

I recently finished Pozdniakov's Proto-Fula-Sereer, freely available through Language Science Press. This is obviously a very welcome and valuable contribution to West African historical linguistics, an area where much remains to be done. I have little experience of Atlantic languages as such, and therefore not much useful to say about most of the book (though it made me want to also read Merrill's work, with which much of it is in dialogue.) However, while proto-Fula-Sereer is dated by the author to 2000 years ago or more, some of the comparisons are relevant for studying contact with other regional families. Two forms are particularly interesting to me for exploring contact with Berber:

  • *xiris "slay (vb)": Sereer xiris 'couper le cou, décapiter, égorger' (Merrill: 'slit the throat') ~ Fula hirsa 'égorger; sacrifier (un animal, pour en rendre licite la consommation)' [p. 63]
    Sereer x- : Fula h- is a very well represented regular correspondence; however, Fula -r- in -rC- would normally be lost in Sereer (p. 173), and no regular pattern of vowel elision is given in the book. The word also looks like Soninke xùrùsi "to kill by cutting the jugular vein", yet the vowel correspondence is difficult there as well. The explanation is to be found in their common source as a loanword from widespread (non-Zenaga!) Berber əɣrəs, with the same meaning. The religious importance of slaughtering an animal for meat in this precise manner is sufficient to motivate the borrowing, which would thus have spread with Islam - presumably through northern Saharan travellers rather than Zenaga scholars, given the form.
  • *Guf "foam": Sereer kuf 'gonfler, écumer en bouillant', kuf a...al / kuf a... ak 'écume de la mer, à la marée montante' ~ Fula ngufo / (n)gufooji 'mousse, écume' (cf. Fula ƴufa 'mousser, écumer (trans.)', ƴufo 'mousse, écume) (Laala kuuɓ 'mousse', Nyun Gubaher gʊ-gʊfʊri 'mousse', Nyun Guñamolo tɪ-gʊf / tɪ-gʊf-ɔŋ 'écume, mousse', Joola Fonyi ka-gʊf 'bave, écume de mer, mousse du savon'). [p. 102]
    The correspondence of Sereer k to Fula ŋg (let alone ƴ) is completely irregular, with no other examples cited. A comparison to Berber forms such as Tamasheq tə-kuffe, Tamazight a-kuffi, Zenaga tu-ʔffukkaʔ-n "froth" is thus not ruled out, although the other Atlantic forms make it more likely that the resemblance is coincidental. Cp. also Zarma kùfú "écumer" and related forms in Songhay, which probably do derive from Berber.

Other forms are interesting to examine in the context of Songhay and Mande:

  • *bon "bad (svb)": Sereer bon 'être mauvais, être méchant, être maigre', ponu l / ponu k "le mal [la chose mauvaise]' ~ Fula bona 'être mauvais, être mal; être méchant', mbonki / bonkiji 'méchanceté ; malfaisance ; perversité' (widespread root in Atlantic and Mel) [p. 86]
    Also widespread well beyond; looks originally Atlantic, but the suffixed vowel in Bambara bɔ̀nɛ and Zarma bòné betrays a borrowing path via Soninke rather than directly from Fula.
  • *bul "blue (svb)": Sereer bule 'bleu' ~ Fula bula 'rincer au bleu (du linge blanc); passer au bleu de lessive; colorer en bleu pâle' (The root *bulu is common for Atlantic and Mel languages. It is not a European borrowing). [p. 86]
    If so, then this is also the source for Bambara búla and Zarma búlà "blue", and other forms across the region. But this is a widespread Wanderwort, and one wonders how a European source was ruled out.
  • *mbedd "road, path": Sereer mbed o...ong/ped k 'petit chemin laissé entre deux champs à l'hivernage, ruelle, rue, allée" ~ Fula mbedda / mbeddaaji 'grand route' (Wolof mbedd 'rue', Jaad mbɛdɛ 'grand route'; Manjaku umbɛra 'chemin carrossable, route'). May be an ancient Soninke borrowing: < béddè 'rue principale, route'. [p. 87]
    Gao Songhay has albedda / mbedda, with an interesting prefix alternation; Heath very tentatively suggests a link to Arabic blṭ, but that probably doesn't work.
  • *Birq (mb-/w-) "manure": Sereer mbiqi n 'fumier, tas de fumier' ~ Fula wirga 'labourer le sol en éparpillant la terre (en luttant au sol ou pour la mélanger ou encore pour brouiller des traces...); disperser du fumier (sur un champ)' [p. 88]
    The correspondence mb:w is not regular, arguably reflecting differences in consonant mutation; only four examples are found, although they look like plausible retentions. The loss of r in Sereer would be regular (p. 173). The correspondence of q to g does not appear regular either (p. 192), unless this is related to the preceding r; one would expect q:kk. It's just as well that the correspondence is irregular, since the Fula term is clearly at least in part a borrowing from Songhay, not vice versa: it reflects a merger of two tonally distinct verbs, found in Zarma as bírjí "fumer le sol; fumier" and bìrjí "mélanger, embrouiller", used in the expression laabu birji "mélanger la terre". Conceivably the "spread manure" sense could be original to Fula, with only the "mix" sense being borrowed; but it strains credulity to imagine Zarma borrowing the same verb but giving it two different tonal patterns depending on the intended meaning. Soninke boroko "manure" is suspiciously similar, but the vowels rule it out as an intermediary.
  • *gaw "hunt (vb); throw (vb)": Sereer xaƴ 'lancer, envoyer un projectile, tirer une arme à feu; lancer un dard, pêcher au harpon', nGawlax n / qawlax k ~ nGaƴlax n / qaƴlax k 'la chasse [gibier]' ~ Fula gawoo 'chasser, être chasseur (professionel)'. [p. 111; poorly justified correspondences - 5 words for x:g]
    The Fula term is certainly the same root as (Songhay) Zarma găw "hunter", gáwáy "hunt (v.)". The term doesn't seem to be used in Mande, from a quick look. If the Sereer form is related to the Fula one, then the direction of borrowing must be Fula to Songhay. However, the correspondence looks rather poorly justified. For x-:g-, only 5 correspondances are given, including such eminently borrowable words as "indigo" and "okra". For -ƴ:-w, the expected regular correspondence is rather ƴ:ƴ (p. 192), cf. "limp" (p. 180), "lick" (p. 174). The question of borrowing direction thus remains open.

The following cases may be only coincidentally similar, but perhaps they reflect contact at a much earlier period in prehistory, related to the spread of the practice of milking:

  • *Gang "chest": Sereer ngang n / kang k ~ Fula gannde / ganndeeje (Fula < gang-nde?) [p. 103; irregular initial correspondence with only two other examples found)
    Cp. Zarma gàndè "chest".
  • *gand "nipple": Sereer hand 'être pleine (femelle), être en gestation, porter [femelle]', hand l / qand a...ak 'mamelle (des animaux), pis', and l / and a...ak 'mamelle (des animaux), pis, téton, tétine' (to note a variety of Sereer forms: h-,q-,Ø-) ~ Fula ʔenndu ~ ʔenɗi 'sein, mamelle; pis, trayon'
    Cp. Zarma gánì "udder".

The Fulani abstract noun formative -(aa)ku is analysed (p. 231) as an "extension suffix" -aa- plus a class suffix -ku explained as a taboo-motivated allomorph of -ngu, citing Koval 2000:230 (a source in Russian). This requires further investigation; it certainly cannot be unrelated to Soninke -aaxu with the same function, but what was the direction of borrowing?

Efforts to exclude Arabic loanwords were largely successful, but even so, one crept in: Fula waabiliire "pluie d'orage" is from Arabic waabil rather than proto-Fula-Sereer *(b)waam/b (p. 79). On the other hand, Sereer tuɓaaɓ and Fula tuubako "European, white man" are derived from nonexistent Arabic *tubaab (pp. 115-116), following a long if poorly evidenced tradition connecting this to the real Arabic word ṭabiib "doctor".

"Punching up/down" in comedy: dating a lexical innovation in English

Any educated English speaker nowadays is likely to be familiar with the idea that comedy should punch up, not punch down: i.e., that it's okay to make fun of people more powerful than yourself, but not of people less powerful. But I remember being struck by the novelty of this expression when I first encountered it, well into adulthood. Notwithstanding the recency illusion, a bit of research suggests that my impression was correct. The earliest attestations I've been able to track down online go back to July 2012, in connection with a controversy about rape jokes made by some comedian named Daniel Tosh:

"Kilstein trots out the old trope that all comics are victims who have been bullied and that’s why we’re doing standup. Total bullshit, of course, but he uses the tired cliche to glorify himself and others– who are “punching up”– and characterizes Tosh and others as tyrants or bully comics who are now punching down." (Brian McKim & Traci Skene, Tosh.Opus, 16 July 2012)
"The answer is that in both cases, the comedians were “punching down.”
Punching down is a concept in which you’re assumed to have a measurable level of power and you’re looking for a fight. Now, you can either go after the big guy who might hurt you, or go after the little guy who has absolutely no shot. Either way, you’ve picked a fight, but one fight is remarkably more noble and worthwhile than the other. Going after the big guy, punching up, is an act of nobility. Going after the little guy, punching down, is an act of bullying." (the pseudonymous "Kaoru Negisa", Punching Up, 19 July 2012)

All three writers are, naturally, American, and at least two of them are standup comedians themselves. Presumably the expression would already have been in use in some circles - perhaps backstage in standup comedy - for some years before that. But internal evidence suggests that it was still not assumed to be familiar to a general audience; both sources feel the need to put it between quotation marks on first use, and one even provides a definition, treating it as a metaphorical extension of a meaning used in the context of fights rather than as a familiar term in the context of comedy. (As further evidence, one may point to its complete absence from this 2012 Jezebel article about the same controversy; had it been written a few years later, it would seem unthinkable not to use the term "punching down" in expressing these ideas.) The term's use on MSNBC (as mentioned in the first source) would have been a good first step towards making the term familiar to a wider audience. By 2014, it was already appearing in The Atlantic (""We like standing up for the little guy, we like punching up," Bolton said."). On Google Books, however, the earliest hits in the relevant sense show up only in 2016, at which time the "'punching up' vs. 'punching down' dichotomy" could still be described as a way in which this tension has "recently been encoded" (Taboo Comedy.) Before that date, the object of "punching down" mostly seems to have been bread dough.

Can anyone find an attestation predating July 2012? And does this new terminology represent a new concept of comedians' moral duties, or just relabel an older one? If the latter, what did earlier American comedians call it?


Via @sanddorn on Twitter and Matt Farthing, a 2011 attestation - once again by a stand-up comedian, but from England this time.

"And a lot of comedians do jokes that I think aren’t funny enough to justify what they are about, and there’s plenty of ways you can be offensive without ‘punching downwards’. When FB does jokes about Palestine or black people there’s much more of a point behind it really. But it’s difficult because that’s his job, that’s how he sees himself – as this comedian who’ll say anything and make jokes about anything." (Richard Herring, 18 January 2011, )

And using this, I find that Ben Zimmer managed to discover an even earlier attestation, in a good discussion of this term's origins: a blogpost, also by Richard Herring, in December 2010. Note that, in these earliest attestations, it appears as part of a broader metaphor of likening satire to punching rather than as a preset cliché: "the weak punching the strong, rather than the strong bullying the weak", "Though there are no rules, comedy, I feel, should be siding with the weak and the oppressed and punching either inwards (at the comedian him or herself) or upwards (at the powerful or the oppressors)."

The metaphor derives, as Zimmer notes, from the world of boxing: "If you’re punching up, you’re taking on an opponent who might be taller or perhaps in a higher weight class, while punching down would be for an opponent who’s shorter or in a lower weight class." But its transfer to comedy doesn't appear to have been direct: the earliest relevant metaphorical uses found by Zimmer reflect power differentials in the contexts of British football (2002), then American politics (2006).

Saturday, February 10, 2024

Abu'l-Atahiya in Korandje

A friend in Tabelbala just posted a translation of some lines from Abu'l-Atahiya into Korandje. Given the general unreliability of Facebook, I think this deserves to be recorded elsewhere, and re-translated into English:
آغَمْفْ بَا قُوُخْ * نَمْڨَآنَا لَكْوَانْكَا
الْكَاسْفْ نِيرْ بَا يَّا نَوْ * نَمْنِينَانَا اَيْصْفَا
مَسْفْكَا اَيْضِيقْ * مْبَاڨُّوكَا نَنْفُونِي
وَلاَّ تَمْزْڨِيدَافْ بَايَعْزَرْ * آطَّرَّف بَيْكاَ
نَمْتْيُوَ آكَا الْقُرْءَانْ * اْمْبَا سَّنَّدْ السَّراْيَتْكَا
نَمْفَكَّرْ أُغُوڎِي اَدْرِى * نْڎَا الْقُرُونْڎِي اَيفُوتْ
أَبَّغْ السَّاعَاتڎِي اِيفُوتْ نِيكَا * اْمْبَا يْعِيشْ ڨَآ بَيَيْكَا
إِڎْ بَكَّا لَخْبَآڎْ امْبَنْضَا * نَمْطَوْ نْڎَانَا أُورْكُورْكَا
أَيْتَا عَلاَّوْصَايَتَ اَغُوڎِي * آبْتْبَآ نْڎِيسْ عَلْحَالَتْ
ءَاسَعْدْ أُغْ مَّاوَانَا * غَارْ إِڎَ اَمَّوْ أَمْطَاسِي
وَامَّوْ أُغْسْ ءآيْشْفَقْ اْنْڎِيكَا * ءآمَّآ أَبُو الْعَتَاهِيَةْ
aɣəm=fʷ ba-qqux * nə-m-ɣ-ana ləkwan=ka
əlkas=fʷ n-ir ba-yyanəw * nə-m-nin-ana a-yəṣfa
mməs=fʷ=ka a-yḍiq * n-ba-ggʷạ=a-ka nə-n=funi
wəlla taməzgida-fʷ ba-yəʕzər * a-ṭṭəṛṛəf bạ=y=ka
nə-m-tyuw=a-ka lqurʔan * ən-ba-ssənnəd əssaryət=ka
nə-m-fəkkəṛ uɣudzi a-dri * ndza lqṛun=dzi a-yfut
a-bbəɣ əssaʕat-dzi i-yfut=nika * ən-ba-yʕiš gạ bya=y=ka
idz ba-kka ləxbạ=dz ən=bənḍạ * nə-m-ṭəw ndz-ana ur=kʷər=ka
əyta ʕə-n=ləwṣayət a-ɣudzi * a-b-tbạ=ndzi-s ʕə-n=lħalət
a saʕd uɣ mmạw-ana * ɣar idz a-mmə̣w a-m-ṭ=a-si
wə-mmə̣w uɣ=s a-yəšfəq əndzi-ka * a-n=ma abu lʕatahiya
"A dry piece of bread * you eat in a corner,
A glass of cool water * you drink pure,
In a room that's narrow, * in which you sit alone;
Or an isolated mosque, * remote from people,
In which you read the Quran, * leaning on a column
As you remember what is gone * and centuries that have passed -
It's better than those hours that passed for you * living in great houses.
What came after that stuff, * you'd burn for it in a hot fire.
Look, this is my advice, * which shows you your situation;
Happy is he that hears it * what he hears will be enough for him.
Listen to one who pities you, * whose name is Abu'l-Atahiya."

Thursday, January 04, 2024

Ngər "die out"

In Algerian Arabic, ngər نڨر means "to perish, to die out, to become extinct", used primarily of patrilineal families; nəgru نڨرو "they died out" typically means they died leaving no descendants bearing the family name. I've usually heard it in reference to small families that had no sons, but it can also be caused by mass killing, as recent events horribly remind us; expressions used in the news, alongside "wiped out", include the oddly bureaucratic formulation "erased from the civil registry".

This verb has no connection to Arabic نقر naqara "peck, hollow out, etc.", as its non-emphatic r betrays. It is a denominal verb formed within Arabic from the Amazigh (Berber) noun anəggaru "end, latter", derived from the verb gʷri "remain behind" (originally *ăgrəβ; forms cited are from Kabyle). Nevertheless, it has been been reborrowed from Arabic into a wide range of Amazigh languages, e.g. Kabyle ngər, glossed by Dallet as "die leaving behind neither descendants nor relatives; die out (family); be exterminated".

This concept, unambiguously expressed by a single word in most North African languages, doesn't seem to be lexicalised in English. Is it lexicalised elsewhere?

Tuesday, December 26, 2023

"The Sound of Music" across three languages

You may well be familiar with The Sound of Music, an American musical from the 1950s loosely based on the von Trapp family's memoirs. It features a neat little song for teaching musical notes, "Do, a Deer", which has been translated into a number of languages. Let's contrast three versions - English, Japanese, and Arabic - and see what they suggest.

EnglishJapaneseArabic
Do, a deer, a female deer, ドはドーナツのド
Do is for "donut" (dōnatsu),
دو دروب ومعاني
Do is "paths" (durūb) and meanings,
Re, a drop of golden sun; レはレモンのレ
Re is for "lemon" (remon);
ري ربيع الأغنيات
Re is a "spring" (rabīʕ) of songs;
Mi, a name I call myself,ミはみんなのミ
Mi is for "everyone" (minna);
مي مـوسيقى وأغاني
Mi, "music" (mūsīqā) and songs;
Fa, a long long way to run; ファはファイトのファ
Fa is for "fight" (faito);
فا فـجر الذكريات
Fa, a "dawn" (fajr) of memories;
So, a needle pulling thread;ソは青い空
So is blue "sky" (sora);
صوتنا ملء الفضاء
Our "sound" (ṣawt) is a filling up of space;
La, a note to follow So; ラはラッパのラ
Ra is for "trumpet" (rappa);
لم يزل فينا الوفاء
In us is "still" (lam tazal) loyalty;
Ti, a drink with jam and bread; シは幸せよ
Si is "happiness" (shiawase)
سوف تبقى يا غناء
You, O song, "shall" (sawfa) remain;
That will bring us back to Do! さぁ歌いましょう
So let us sing!
لنغنّي نغنّي.. لحن الحياة
Let us sing, sing... the tune of life!

As should be obvious, the Arabic version is derived from the Japanese one (via a popular anime of the 1990s) rather than directly from the English one. However, it contrasts sharply with both in the choice of note-mnemonics. In English, each note name (well, except "la") is mapped directly to a near-homophonous monosyllabic word, taking advantage of English's relatively short minimal word length; most of these are widely familiar, high-frequency items. In Japanese, the word choices are necessarily longer and perhaps more obscure (the syllable fa is found only in relatively recent loanwords anyway), but in each case the note is mapped perfectly to the first syllable of a single word, usually referring to something readily visualisable. In Arabic, the note is again mapped (increasingly approximatively) to the first syllable, not of a word, but of a 2-4 word phrase; not a single one of these phrases refers to anything concrete enough to visualise. High-flown slogans replace the original's homely whimsy.

I have no way of proving it, but I believe this is symptomatic - certainly of the Arabic dubbing in the cartoons I used to watch in the early 1990s, and plausibly of Modern Standard Arabic discourse in general: an imagination based on recitation rather than visualization, preferring stirring abstractions to concrete details. After all, concrete details travel poorly in this diglossic context.

Saturday, December 02, 2023

Latin authors from Algeria

The modern borders of Algeria had no existence or meaning in the Roman era, but for any potential Algerian classicists, it may be interesting to consider which of the Latin texts that have come down to us were written by people born in Algeria. So far, I've found the following:

  • Suetonius, born in Hippo Regius (modern Annaba), ca. 70 AD; historian
  • Fronto, born in Cirta (modern Constantine), ca. 100 AD; grammarian
  • Apuleius, born in Madaura (modern M'daourouch), ca. 124 AD; author of Metamorphoses, a comic-mystical proto-novel, along with various philosophical and rhetorical works.
  • Lactantius, born perhaps in Cirta, ca. 250 AD; a Christian apologist
  • Nonius Marcellus, born in Thubursicum (modern Teboursouk), perhaps late 200s AD; a lexicographer
  • Augustine, born in Thagaste (modern Souk Ahras), ca. 354 AD; a Christian saint notable especially for his autobiographical Confessions
  • Martianus Capella, born in Madaura, late 300s AD; author of a formerly influential allegorical curriculum of the liberal arts, De nuptiis Philologiae et Mercurii
  • Cassius Felix, born in Cirta, late 300s AD; author of a medical handbook
  • Priscian, born in Caesarea (modern Cherchell), late 400s AD; Latin grammarian

Conspicuously, all but one of them were born in the east, in what was then Numidia, and all but three date to the late Roman Empire, after Roman citizenship had been extended to all free men under Roman rule but before the Vandals' arrival. It is no doubt misleading to treat such authors separately from their (probably more numerous) counterparts born just across the modern border in Tunisia.

Literary works, of course, are just a small subset of what was written in Latin. For a wider selection of much shorter texts written in Algeria, the Corpus Inscriptionum Latinarum covers the area in Volume VIII. Even the Albertini Tablets, a set of legal documents found near Tebessa and mostly dating to 493-496 AD, are online now.

No doubt I'm missing a few authors; who else belongs on the list above?

Friday, October 20, 2023

Being "upon the truth"

It's not too hard to think of words that are characteristically used in English almost exclusively by Muslims - salat, namaz, wudu, shahada, masjid... There are even a few such words that aren't borrowings from Arabic or Urdu: circumambulation comes to mind. It is much more difficult, at least for me, to think of characteristics of "Islamic English" that go beyond the lexicon.

I was recently struck, however, by the expression "upon the truth". Searching for "upon the truth" yields plenty of mainstream English examples like "hit upon the truth", "lay hold upon the truth", "an essay upon the truth of the Christian religion"... However, searching for "be upon the truth", "are upon the truth", "is upon the truth", etc. yields a very different picture. Suddenly almost every single search result is specifically Islamic:

You get the idea. The rare exceptions, like "their ultimate dependence is upon the truth", reflect quite a different construction, as the inanimate subject shows. In English, referring to people or groups being "upon the truth" appears to be unique to Islamic discourse (perhaps even to some genres thereof; most of the hits seem to have a vaguely Salafi vibe).

While this construction uses only well-known English vocabulary, it literally translates the Arabic expression على الحق ʕalā l-ḥaqq "on the truth/right". Within Arabic, this expression has a bit of an archaic ring to it, but is familiar from a number of hadith, e.g:

فَجَاءَ عُمَرُ فَقَالَ أَلَسْنَا عَلَى الْحَقِّ وَهُمْ عَلَى الْبَاطِلِ
At that time `Umar came (to the Prophet) and said, "Aren't we on the right (path) and they (pagans) in the wrong?" (Bukhari 65.365)

Being "upon the truth" is thus a calque into Islamic English from Arabic. No doubt a wider investigation would reveal other such cases.

Thursday, October 12, 2023

Chenoua and the rectification of names

According to Ethnologue - or even to the HCA - Chenoua (Tacenwit) is one of the larger Berber/Amazigh languages of Algeria, spoken west of Algiers from Tipasa almost to Tenes. Unfortunately, no one seems to have told the speakers, who call their own language Haqḇayliṯ or Haqḇayləḵṯ - i.e. Kabyle. Chenoua is the name of one particular area, a mountain near Tipasa, and speakers from other areas are often entirely unfamiliar with the term; I recently learned of a first-language speaker who had reached her twenties without ever hearing of it.

This is not to say that they speak the same language in Tipasa as in Tizi-Ouzou! In fact, "Chenoua" is much more closely related to Chaoui than to what is usually called "Kabyle". But "Kabyle" is just an Anglicisation of Arabic qbayǝl - "tribes". It came to be applied to mountain-dwelling groups like this in the Ottoman period as a broad ethno-political category, not a linguistic one; around Jijel, communities who have spoken Arabic for many generations still call themselves Kabyle.

What should you call a language in a situation like this? "Chenoua" takes a part for the whole, and as such is confusing, as well as privileging one group of speakers over others. "Kabyle" matches speakers' traditional self-understanding, but misleads linguists, who are accustomed to using this for the much larger, not very closely related Berber variety spoken further west. "Western Algerian Berber" is potentially too broad; perhaps "Dahra Berber" is better, after the low-lying mountain range where most speakers live, but it presupposes a distinction from "Ouarsenis Berber" that is probably not linguistically justified.

But neither "Berber" nor the currently preferred term "Tamazight" correspond to traditional usage among speakers. "Berber" has never been used in any Berber variety; it has always been a term used by outsiders to label them, and in traditional coastal Algerian usage bǝṛbṛiyya actually referred to colloquial Arabic, not to Berber. And before the Amazigh identity movement gained ground in the late 20th century, most speakers in northern Algeria had never heard of "Tamazight".

In contexts like this, it makes no sense for a linguist to insist on using the name speakers use. Folk categories simply don't divide languages up at the same level as the one the linguists are interested in, nor for the same purposes. (In Bechar, šəlħa "Shilha" refers not only to several very different Berber varieties, but to the completely unrelated Songhay language Korandje). That doesn't mean denying the validity of folk categories; people can call whales "fish" if they want to. It does mean making sure not to get misled by them.

Thursday, October 05, 2023

Nilotic father tongues

Back in the late 1990s as human genetic data started piling up, it became increasingly clear that there were a lot of language families where most speakers shared relatively recent common male-line ancestry, visible by looking at Y-haplogroups. George van Driem memorably turned this observation into the Father Tongue Hypothesis: that language expansions are typically male-led, with children often raised to speak their father's language rather than their mother's. Berber is one of the many families where this holds true; Afroasiatic, on the other hand, shows several quite different dominant Y-haplogroups depending on the subgroup, indicating a more complex story at an earlier stage. What about Nilotic?

Nilotic, the most geographically widespread family within the rather questionable "Nilo-Saharan" phylum, divides into three primary subgroups:

  • West Nilotic was originally concentrated around the White Nile, in modern South Sudan, including such languages as Dinka, Nuer, and Shilluk. Medieval-era expansions brought Luo speakers as far south as Kenya.
  • East Nilotic languages are spread from southern South Sudan down to Tanzania, including such languages as Bari, Turkana, and Maasai.
  • South Nilotic languages are concentrated in mountainous areas of Kenya and Tanzania, including languages like Nandi and Kipsigis.

It turns out that each of these subfamilies has a reasonable correlation with a Y-haplogroup. West Nilotic shows high rates of A1b1b2b-M13 (62% Dinka, 53% Shilluk, 50% Kenya Luo, 38% Nuer, 22% Alur). Its northern members also have a high frequency of B (54% Nuer, 27% Shilluk, 23% Dinka), which is nearly absent from the more southerly ones (6% Kenya Luo, 0% Alur). A1b1b2b-M13 is also frequent, to a lesser extent, in East Nilotic (33% Karimojong, 28% Maasai and Turkana, 17% Samburu - but 0% Camus), though significant rates of B are recorded only for Karimojong (33%). In South Nilotic, on the other hand, A1b1b2b-M13 is much less frequent (13% Pokot, 10% Marakwet, 8% Ogiek, and so on down to 2% Datog and 0% Sabaot), with B even rarer (11% Pokot), and the plurality of lineages usually belong to E1b1b1-M35 - a Y-haplogroup otherwise notably associated with Cushitic and Nubian speakers (50% Ogiek, 46% Datog, 45% Marakwet, 38% Sengwer...) - or to E2. E1b1b1-M35 is not unknown further north, but is far rarer (20% Shilluk, 15% Dinka, 8% Nuer).

None of this looks much like the result of a single male-led expansion. An obvious interpretation would be that South Nilotic primarily reflects communal language shift, probably from Cushitic judging by the well-studied stratum of Cushitic vocabulary in these languages. One might reasonably postulate a classical male-led expansion to explain the spread of West Nilotic within South Sudan; but, if so, one is led to the conclusion (already plausible on linguistic and historical grounds) that the Luo expansion southwards involved considerable assimilation of local men, notably Bantu-speaking (the Bantu-associated E1b1a1-M2 accounted for 33% of Kenya Luo sampled). Such assimilation also appears probable in East Nilotic, for which I unfortunately lack data from South Sudan.

In a broader perspective, A1b1b2b-M13 is frequent in several far-flung "Nilo-Saharan" groups along the southeastern fringes of the Sahara whose languages are only very distantly related, if at all, to Nilotic: Fur (31%), various Sudanese Maban (26%), and even Cameroon Kanuri (27%). It does not, however, seem to be frequent among Nubian speakers, much closer at hand.


I won't attempt to exhaustively reference this post, which is basically open notes on work in progress, but key sources include Wood et al. 2005, Tishkoff et al. 2007, Hassan et al. 2008, Gomes et al. 2010, and Hirbo 2011. Note that I've combined different samples for Nuer and Dinka.

Tuesday, October 03, 2023

Feynman's Father's Fallacy

The first time I read this quote from Richard Feynman, I was quite convinced by it:

The next Monday, when the fathers were all back at work, we kids were playing in a field. One kid says to me, "See that bird? What kind of bird is that?" I said, "I haven't the slightest idea what kind of a bird it is." He says, "It's a brown-throated thrush. Your father doesn't teach you anything!" But it was the opposite. He had already taught me: "See that bird?" he says. "It's a Spencer's warbler." (I knew he didn't know the real name.) "Well, in Italian, it's a Chutto Lapittida. In Portuguese, it's a Bom da Peida. In Chinese, it's a Chung-long-tah, and in Japanese, it's a Katano Tekeda. You can know the name of that bird in all the languages of the world, but when you're finished, you'll know absolutely nothing whatever about the bird. You'll only know about humans in different places, and what they call the bird. So let's look at the bird and see what it's doing-that's what counts." (I learned very early the difference between knowing the name of something and knowing something.)

And it would be true - in a world where no one else knows anything about birds. (That's probably not so far from the world you or I or Feynman grew up in as children.) If you don't know what nightingales are called, and neither does anyone else, then you can still learn about them - if you have the time and patience to go deep into the countryside to places where they live, and spend cold nights with a pair of infra-red goggles, or set clever traps deep in the countryside or something.

On the other hand, if you do know what a nightingale is called, you can find out enormous amounts about it by simply asking. You can scour Google Scholar for papers by people who did the hard part already; you can get birdwatchers talking about it; you can look it up in a reference manual; in short, you can benefit from the accumulated experience of many generations of observers, instead of having to reinvent the wheel yourself, only to have your knowledge perish with you in the end. If you know what it's called in other languages, you can find out what other communities of observers had to say about it - which, in some cases, may reflect much longer observation than English speakers have been able to undertake. Having found all this out, you can understand your own observations better. Maybe you've discovered something new! Or maybe you've misunderstood what you saw because you lacked a broader context. Either way, you'll know much more with the name than you're ever likely to be able to discover individually without it.

Wednesday, September 27, 2023

Two Bambara words in Gnawa songs of Meknes

Across North Africa, small groups dominated by descendants of slaves brought from the Sahel preserve musical traditions, with ritual and medical functions, usually called Gnawa in Morocco, Diwan in Algeria, and Stambeli in Tunisia. Aguadé's Die Lieder der Gnawa aus Meknes provides the lyrics of an extensive corpus of Gnawa songs from Meknes in northern Morocco. These songs are primarily in Arabic, but characteristically include a number of words with no plausible Arabic or Berber source, presumed to derive from languages of the Sahel. Their identification, however, is generally difficult, although Aguadé ventures a few suggestions drawn from Hausa. Anyone can comb dictionaries for sound-alikes, but similar forms may be found across unrelated languages of the Sahel with very different meanings. It would be much easier if the meanings were certain, but the singers do not necessarily know the meaning of such words, and the context often hardly narrows it down. Nevertheless, some cases can be identified more confidently than others.

Aguadé's song number 88, Lalla l-Batul "Lady Virgin" (pp. 128-129), is dedicated to a female genie whose song cycle corresponds to the colour yellow. Its refrain (accounting for 5 out of its 8 lines) is a lalla l-batul, saysay "Oh Lady Virgin, saysay". The word saysay has no meaning in Arabic or in Berber. In Bambara, however, sáyi means "yellow"; the refrain would then be "Oh Lady Virgin, yellow, yellow".

In his song number 90 (pp. 130-132), the refrain is fufu dənba ya sidi "fufu dənba, oh master" (repeated 14 times, including the opening line of the song). Bambara dénba means "mother". The first verse after the initial refrain is ma bɣatək kda ya sidi "she didn't want you like that, oh master"; no feminine singular subject to which this could refer appears anywhere in the Arabic text of the song, but the Bambara interpretation allows this line to be better understood. I'd like to relate the preceding fufu to Bambara fò "greet" and/or fɔ́ "say, speak" - "greet Mother" would seem contextually appropriate - but I can't quite see how the grammar would hang together.

Addendum: In song 5, Sidi Gangafu "Mr. Gangafu", almost every couplet ends in Bambaṛa or shortened ya Mbaṛa, so a Bambara etymology seems worth considering (although an allusion to Hausa is also found). As Aguadé notes, Ganga is simply a kind of drum used by the Gnawa, whose name is shared across most of the Sahel, so one would expect this name to mean something like "drum-player" or "drum-maker". In fact, Gangafu can readily be interpreted as Bambara gàngan-fɔ̀ "play the ganga-drum". "Drum-player" should properly be something like gàngan-fɔ̀-la, but it doesn't seem like much of a stretch to suppose that the Bambara used by slaves among themselves would have had some non-standard features, given that for many of them it would have been a second language to begin with.

Wednesday, September 20, 2023

Some Dellys manuscripts

(Not linguistics, just history - possibly self-indulgent at that.)

Quite a few years ago in Dellys, I was allowed to photograph a bundle of pages from different manuscripts grouped together in a single detached cover, labelled as belonging to my great-uncle (رحمه الله). (I wasn't very good with metadata at the time, so I apologise in case anything ended up in the resulting folder from a different source.) Both the internet and my ability to read premodern Arabic handwriting have advanced a lot since then, and I can now identify (more or less) six of the works which these were taken from:

  • A commentary on al-Nawawī's Forty Ḥadīth - a selection of key sayings of the Prophet Muḥammad (SAWS)
  • Muhammad Mayyāra's commentary on Ibn ʕĀshir's Guiding Helper - a condensed summary in verse of essential Mālikī fiqh (religious jurisprudence)
  • Abū al-Layth al-Samarqandī's Warning to the Neglectful, a book of religious exhortation
  • Ibn Ghānim al-Maqdisī's Decipherment of the Symbols and Keys of the Treasures, explaining Sufi concepts and terms
  • A linguistically focused commentary on al-Būṣīrī's Mantle - a poem in praise of the Prophet Muḥammad (SAWS)
  • Ibn Mālik's Thousand-Liner - a condensed presentation of Arabic grammar in verse to facilitate memorisation
  • A commentary on al-Abharī's Isagoge - an introduction to Aristotelian logic

Apart from these, there were a few pages of rhymed dua (supplication to God), which I can't find a source for online.

I still can't identify most of the commentators; it seems that plenty of commentaries have yet to be properly digitised. But the geographic spread of the authors is noteworthy, covering almost the whole span of the former territories of the Umayyad Caliphate: al-Samarqandī from Uzbekistan, al-Abharī from Iraq or Iran, al-Nawawī from Syria, Ibn Ghānim al-Maqdisī from Palestine, al-Būṣīrī from Egypt, Mayyāra and Ibn ʕĀshir from Morocco, Ibn Mālik from Spain. The chronological spread, on the other hand, is notably more concentrated: 10th c. (al-Samarqandī), 13th c. (al-Abharī, al-Nawawī, al-Būṣīrī, Ibn Mālik), 16th/17th c. (Ibn Ghānim al-Maqdisī, Ibn ʕĀshir). The 13th century doesn't necessarily spring to mind as a golden age of Islamic thought, but for the early 20th century curriculum this notebook presumably reflects, it was at least a golden age of school texts. (On the other side of the Mediterranean, it was also the age of Thomas Aquinas and Dante.) The absence of 19th century texts here might be accounted for by the rise of printing, but that cannot explain the paucity of texts from other recent centuries; even the 16th/17th century texts seem to be intended to open the door to understanding older works. The common purpose of these works should also be clear: all of them either relate directly to religion or are ancillary to the religious sciences.

The texts themselves accordingly therefore cast only a very indirect light on the context where they were being studied. A note carefully added in pencil on the inside cover sometime in the early/mid-20th century, however, is much more eloquent:

WARNING: The earth is a dark planet, lit by the moon at night and by the sun in the day. The earth is suspended in space by the power of Allah SWT; He made a gravitational power in the stars that attracts the earth towards them just as a magnet attracts iron. The earth is not carried on the horn of a bull, as claimed on p. 36 of this book in a ḥadīth of `Abd Allāh ibn Sallām when he asked the Messenger of Allāh SAWS about the earth "What was it created from?" and so on until he asked him "And what do these seven earths rest upon?" He replied "On a bull." He asked "And what is the bull like?" He said "A bull with 40,000 heads", etc. This ḥadīth has no basis, and has been deemed fabricated, and none of the learned have confirmed this ḥadīth - and Allah knows best.

This short comment feels like the entire modernist era in a nutshell - that late 19th/early 20th century moment of collision with the West, when this vast storehouse of traditional knowledge, stabilised over centuries by mnemonic verses and long insulated from external criticism, is suddenly confronted with an urgent need to sift out the grain from the chaff and go back to first principles, or risk losing intellectual as well as physical battles. We're still living through the aftermath; one result is a widespread suspicion of works formerly treated as unimpeachable, including some of those above.