Jabal al-Lughat

Wednesday, July 25, 2007

Writing codas, from Sylhet to Winnipeg

In Greek-based scripts (like Latin or Cyrillic), unless a consonantal letter is followed by a vowel letter, it is assumed not to be followed by a vowel. This seems natural enough if you're used to it; but if you look at it differently, it's rather wasteful. The commonest sound to follow any given consonant is usually a vowel, not another consonant, so if you allow a single letter to represent a consonant plus a vowel you're saving space and effort.

But if you do that, then how do you represent the fact that a consonant is not followed by a vowel? Different writing systems use different solutions. In alphabets that have stuck more closely to their Canaanite prototype, like Arabic, Hebrew, Syriac, or (traditional) Tifinagh, you normally don't bother: a consonant may be followed by a vowel or may not, and you rely on the reader to figure it out. However, sometimes the reader needs additional cues: maybe the word you're writing is obscure, or two words have the same consonants, or it's very important that the text be read exactly right with no possibility of error. In that case, in Arabic, Hebrew, and Syriac, you mark what follows each consonant with a little sign above or below the letter - one sign for "a", say, another for "i", and another to indicate that nothing follows it. Such a sign is necessary if you're still mainly using the system with no vowel marking, because if you left the letter unmarked it would mean not that the letter had no vowel but that what vowel, if any, followed the consonant should be deduced from context.

Typical Indic scripts, such as Devanagari (the script used for Hindi and Nepali), adopt a rather different solution. A consonant letter on its own is to be read with a default vowel, short a ([ʌ]); a consonant followed by a consonant is written as a single "conjunct" letter, formed in any of several ways, but usually by either putting the second letter underneath the first or taking away a line on the right of the first letter and joining it to the second. On the plus side, this yields much of the compactness of a vowel-optional system without any of the ambiguity, and means that each letter is pronounceable on its own; on the minus side, this means fonts have to include a much larger number of letter forms.

Sylheti Nagri is an Indic script formerly (up to the 1950s or so) in use in the district of Sylhet, in eastern Bangladesh. Like Devanagari, it represents consonant-consonant sequences using conjuncts. However, its users were often also familiar with the Arabic script, where letters could be combined into ligatures whether or not they had vowels between them. This may have inspired them to do something rather unusual for an Indic script: develop vowel-consonant conjuncts, such as a+m, a+l, i+n... and consonant-vowel-consonant conjuncts, like pi+r, mo+t... In fact, judging by the examples in the Unicode proposal, it seems that, for at least some historic users, Sylheti did not have a conjunct system at all, just a ligature system.

One very nice solution is that adopted in Canadian Syllabics, the family of writing systems used by a number of Native American tribes in Canada. The name is potentially misleading: I prefer to reserve the term "syllabary" for writing systems like hiragana, where different syllables differ from each other unpredictably. In Canadian Syllabics, for example Cree, the shape of a symbol represents the consonant, while its orientation represents the vowel that follows it, and length or labialisation may be represented by dots. If no vowel follows the consonant, then the base shape is simply written small and superscripted, using the a-orientation, or for labialised consonants the u-orientation.

Monday, July 23, 2007

Baccouche on Darja

Just found an interview with Taieb Baccouche, preparing a Linguistic Atlas of Tunisia - the interview isn't actually that interesting, but I definitely want to see that atlas.

Monday, July 16, 2007

Language endangerment in Yorkshire

Several members of the British Parliament took a few minutes out from worrying about issues like Iraq, the housing shortage, and global warming to put together an Early Day Motion expressing their concern about the fate of the Yorkshire dialect:

That this House is concerned at the recently published research indicating that words are disappearing from the Yorkshire dialect because of the influence of the internet, social mobility and globalisation; and furthermore supports the work of the Yorkshire Dialect Society in continuing to promote what is, after all, the best English regional accent in the world.

The amendments proposed are also worth a look, featuring such phrases as "a slow national convergence towards the monochrome mush of effete estuarial English". For what it's worth, I am rather inclined to agree that Yorkshire may have "the best English regional accent in the world" - although two MPs proposed to amend this to "after the Lancashire accent" - and I'm glad to see a bit of appreciation for dialectologists, but I find it difficult to be all that concerned about the loss of a few well-documented local words (supposedly due to people watching national media and getting out more) in a fairly widely spoken dialect of one of the world's most flourishing languages, when whole languages are disappearing virtually undocumented every month due to factors like kidnapping children or beating them when they speak their language.

Thursday, July 12, 2007

Harun ar-Rashid and the Golden Apples of the Hesperides

I recently heard a rather good folk tale from my father about the adventures of (a completely mythologised) Hārūn ar-Rashīd during his foreordained seven years of hardship, living as a poor man dressed in goatskin nicknamed Bou-Krisha (بو كريشة). One element of the story fits nicely with the previous two posts' theme of cultural survivals from the classical era. The king gathers his sons-in-law and his would-be son-in-law Bou-Krisha, and tells them that he is terribly ill, and to cure him they must go and bring him:

ət-təffaħ ən-nifuħ التّفّاح النّيفوح
əlli yṛədd əṛ-ṛuħ اللي يردّ الرّوح
m-əs-səb`a jbal مسّبعة جبال

the fragrant apple
that restores the soul
from the Seven Mountains

For the tale's purposes, of course, all that matters about this evocative phrase is that it refers to something that it will take a long and arduous quest to get. But the historically minded listener may be excused for speculating on the phrase's origin.

Etymologically, the phrase is mildly interesting. nifuħ is unexpected, and possibly distorted to fit the rhyme - a more normal term, with obvious Classical Arabic origins, would be nəffaħ; it might have arisen by contamination from əlli yfuħ “which smells” (especially since əlli in Kabyle is ənni.) But it may be possible to look deeper.

Ceuta (Arabic səbta سبتة) is an ancient Moroccan port town at the edge of the Straits of Gibraltar which has been part of Spain since 1668. Its name derives from a longer Latin one - Septem Fratres, the Seven Brothers, said to be a reference to seven hills around the city; it was a wild area, among the last places in North Africa where elephants were found (as noted by Pliny.) And the region around the Straits of Gibraltar is where the gardens of the Hesperides were supposed to be located - where the Golden Apples grew. Is ət-təffaħ ən-nifuħ one of the Golden Apples?

Friday, July 06, 2007

Berberised Afro-Latin speakers in Gafsa

One reader of my last post asked how late Latin (or some descendant thereof) continued to be spoken in North Africa. The answer is, pretty late: the latest attestation I came across on short notice seems to be in the major medieval geographer Al-Idrisi (12th century) who, describing Gafsa in southern Tunisia, notes that:

وأهلها متبربرون وأكثرهم يتكلّم باللسان اللطيني الإفريقي.
Its inhabitants are Berberised, and most of them speak the African Latin tongue.

He even gives one word of their dialect:

ولها في وسطها العين المسماة بالطرميد.
In the middle of the town is a spring called the ṭarmīd (perhaps to be related to Latin thermae).

One interesting thing to note about this statement is that he said that the town was Berberised - in other words, that, in the very century when the Banū Hilāl were rapidly spreading through Tunisia and Libya (a subject he has fairly harsh things to say about), Berber culture was prestigious enough to be adopted by members of other cultures, in particular the remaining Roman or Romanised towns, in the area. Gafsa, of course, speaks Arabic now, but several nearby villages still spoke Berber in the 1800s, and two, Sened and Majoura, well into the 1900s.

Wednesday, July 04, 2007

Chenanith b'Libya - in the 11th century AD?

Anyone interested in North African languages who doesn't speak Dutch should immediately check out Bulbul's posting on Latino-Punic. The Phoenicians brought their language with them to North Africa when they founded Carthage and other cities. Carthage was destroyed, of course, but many other cities continued to speak Phoenician for longer; however, like Arabic in more recent times, it changed a lot under Berber influence, and this later dialect is usually called Punic. This language was spoken by St. Augustine, who quotes a number of Phoenician words, such as salus (< shalu:sh < shalo:sh < shala:sh < thala:th) "three", in his works. In eastern Libya, as it happens, Punic continued to be written even after the Phoenician alphabet was forgotten; this body of inscriptions, using the Latin alphabet to write Punic, is called (logically enough) Latino-Punic, and a comprehensive database of such inscriptions is available from Leiden. Recently, as Bulbul points out, a thesis was submitted at Leiden on Latino-Punic and its Linguistic Environment; I would love to read it.

The twist in this tale is that Phoenician may have survived into the 11th century AD! Al-Bakri (whom I've mentioned before) enigmatically says of the inhabitants of Sirt in Libya that:

لهم كلام يراطنون به ليس بعربي ولا عجمي ولا بربري ولا قبطي ولا يعرفه غيرهم
‍They have a speech in which they jabber which is neither Arabic nor Ajami (by which he probably means Latin ~~but might mean Persian~~) nor Berber nor Coptic, which no one but them knows.

The location (in eastern Tripolitania) is about right for it to be Punic, and if it were Greek you would expect him to know, considering he cites (more or less correctly) the Greek etymology of طرابلس (Tripoli) in the next page. So was Punic still spoken in the 11th century? Your guess is as good as mine, but it looks plausible.

Tuesday, July 03, 2007

Galileo's sociolinguistics and free software

Just came across an interesting quote from a law professor in the free software movement on Europe's shift away from diglossia:

[W]ith the name Galileo Galilei, we associate two of the most important cultural responses to the quandary of possessed physics.

The first is an insistence upon freedom from censorship, that is "e pur si muove" -- determination to prohibit the ownership of physics by an entity rich enough and powerful enough to define its physics as the only permissible physics, the only available physics, for most ordinary people. And second, the first significant attempt in the history of the West to write scientific literature at the state of the art in a vernacular language, accessible to everyone.

Galileo Galilei's decision to publish in Italian is as important as his decision to risk confrontation with the Church, for what it says about the fundamental pillars of free science in the history of the West. Not merely, in other words, an insistence upon the freedom of ideas to work their will in skilled hands, but a determination that the ideas which motivate the world, which explain its behavior and which render it controllable, should be universally accessible to people regardless of their ability to acquire enough social surplus to have Latin.

I'm not sure whether the details of his account are accurate, but this has always been one of the strongest arguments against diglossia. The availability of universal free education goes a long way to mitigating the problem; but it's still not cost-free, since all the time devoted to learning the high language is time that could have been devoted to learning something else. (Of course, there are also practical issues regarding the quality of teaching provided - but that's another story.)

Wednesday, June 20, 2007

Is Omotic Afroasiatic?

Omotic, a small group of non-Cushitic, non-Semitic languages spoken in the highlands of Ethiopia, has always been the odd one out in Afroasiatic; by anyone's tree it is the first to have split off, and the noted Chadicist Paul Newman expressed scepticism about its membership in the family. I know little about Omotic, or Cushitic for that matter, but after reading a few sketch grammars in Omotic Language Studies , I found it very difficult to imagine these languages as Afro-Asiatic; with Berber or Hausa or Beja or Semitic the cognates are instantly visible, but none of the most familiar grammatical morphemes or lexical items seemed to be present. However, a paper I just came across by Rolf Theil is the first I've seen to present an argument against the hypothesis, and a pretty good one at that. There are parts I would question - for example, the suggestion that pronouns are unreliable (they are conspicuously unreliable in regions where extensive politeness systems have developed, like East and Southeast Asia, but I didn't think highland Ethiopia fell in that category) - but the overall argumentation seems good. In particular, the attempt to show that a roughly equal number of similarities can be observed between Omotic and families other than Afro-Asiatic is on the right track - if Omotic were to have more similarities with Afro-Asiatic than with any other family, then merely pointing out problems with some of those similarities would be inadequate. I'll be interested to see the reactions of people better acquainted with the family.

On another note, I passed my upgrade presentation yesterday - yay!

Sunday, June 17, 2007

Ugaritic inscription

Last weekend I got a chance to indulge my longstanding passion for ancient Semitic languages at the Louvre. The Ugaritic collection was, as you might expect, especially good; I took many photographs, including this particularly clear one here, a ceremonial axe from the 13th or 12th century BC. Since the Ugaritic alphabet only contained some 30 letters, it's easy enough to read the inscription (turn it 90 degrees counterclockwise), although no word dividers are present:

xrṣn rb khnm

which the museum caption translates as "la hache du Grand Prêtre". xrṣn is presumably "axe"; I can't find it in my small dictionary, but it looks like it might be related to xurāṣ "gold" (itself cognate not only to Hebrew ḥarūṣ, but also to Greek chrysos, a Semitic loanword.) rabb- means "great one", identical to Arabic ربّ "lord" and cognate to Hebrew rav "great one; rabbi". kāhin- is "priest", identical to the Arabic كاهن "soothsayer" and cognate to Hebrew kohen "priest" - yes, the same word from which the surname "Cohen" comes from. -īm is the oblique plural, identical to Hebrew -īm (which however is no longer inflected for case) and cognate to Arabic -īn. Once you start looking, it's so easy to spot the connections between Semitic languages; no wonder people a thousand years ago noticed.

Wednesday, June 13, 2007

Brothers in Law

Reading a Language Log post I just noticed a rather laughable argument apparently being used in the Jose Padilla trial (according to AP):

FBI wiretaps played in court for jurors contain frequent references to "brothers," which prosecutors say means mujahedeen fighters looking for a battle. Defense lawyers contend the term is a common expression among male Muslims.

If the AP article has correctly represented the prosecution argument, they must be either absolutely desperate or thoroughly unqualified. As the defense correctly states, "brother" is fairly commonly used between male Muslims, in accordance with a hadith saying that "A Muslim is the brother of a Muslim"; it carries absolutely no implication of being a fighter. Google will turn up numerous examples; for an illustrative sample, consider this slightly frivolous MPAC forum discussion about finding motivational speakers, where other Muslims are called by the terms "brs", "bros", "akhee" (Arabic for "my brother"), and "Brother".

However, in fairness, other reports indicate that the prosecution claims that the defendants used some kind of system of codewords: they claim innocuous words like "picnics", "football", and "marriage" were used with much more sinister intended meanings. If their claim is that "brother" meant "fighter looking for battle" in this alleged code, as opposed to in normal usage, then that might not be completely absurd; I haven't found any transcripts, so I can't attempt to evaluate the plausibility of such a claim.

The usual arguments about the correct translation of "jihad" and "Allah" apparently came up as well - I don't think I'll bother adding to the thousands of web pages discussing that issue.

Saturday, June 02, 2007

Popper, Sapir, and international auxiliary languages

I've been reading some of Karl Popper's work lately, and found it quite interesting (and clearly written, which one doesn't always expect of philosophers.) Both his political and his scientific writings are dominated by the same important theme: no one can get closer to the truth without being willing to put their beliefs to the test, and the more different people doing the testing, the less likely they are to overlook a flaw in the idea. Thus dictatorship and censorship - in any power structure, governmental or academic - are not just bad, but intrinsically prone to get worse results. I noticed that he took this view to have implications for language policy too:

The adoption of rationalism implies, moreover, that there is a common medium of communication, a common language of reason; it establishes something like a moral obligation towards that language, the obligation to keep up standards of clarity and to use it in such a way that it can retain its function as a vehicle of argument. That is to say, to use it plainly; to use it as an instrument of rational communication, of significant information, rather than as a means of 'self-expression', as the vicious romantic jargon of most of our educationists has it. (It is characteristic of the modern romantic hysteria that it combines Hegelian collectivism concerning 'reason' with an excessive individualism concerning 'emotions': thus the emphasis on language as a means of self-expression instead of a means of communication.) (Karl Popper, The Open Society and its Enemies vol. II: Hegel and Marx, Routledge 1945/2003, p. 264)

While this quote is mainly about how you should use a given language, rather than which language to use, it clearly suggests the desirability of some international language, and Sapir's idea of how such a language should be built happens to be rather Popperian in spirit:

It [the international auxiliary language] must, ideally, be as superior to any accepted language as the mathematical method of expressing quantities and relations between quantities is to the more lumbering means of expressing these quantities and relations in verbal form. This is, undoubtedly, an ideal which can never be reached, but ideals are not meant to be reached; they simply indicate the direction of movement. (p. 51)... National languages are all huge systems of vested interests which sullenly resist critical enquiry... (p.60) Intelligent men should not allow themselves to become international language doctrinaires. They should do all they can to keep the problem experimental, welcoming criticism at every point and trusting to the gradual emergence of an international language that is a fit medium for the modern spirit. (p. 64, Edward Sapir, "International Auxiliary Language" in Culture, Language, and Personality, Berkeley: University of California)

So it is all the more ironic to find that Popper's paragraph continues with this:

And it implies the recognition that mankind is united by the fact that our different mother tongues, in so far as they are rational, can be translated into one another. It recognizes the unity of human reason.

This seems to imply that linguistic diversity is worthless: if something is rational, it can be explained in any language, and if it can't be explained to me in my language, it must be irrational. I rather suspect that the Sapir-Whorf hypothesis was partly intended as a rebuttal to this sort of argument. If your language tends to blind you to the differences in logical form between sentences with superficially identical structures (his example in the very essay quoted above is the perfective/imperfective distinction in English) and makes it easy to spot the differences between ones with different structures, then the ideal auxiliary language should allow you to express logical form as unambiguously as possible; and to be able to make a language free of your own linguistic biases and blind spots, you will have to carefully study many languages of as many different types as possible.

Of course, that begs the question: is there such a thing as an overall better language, or is that whole approach misconceived? Algebraic notation is unquestionably superior to English for describing physical laws, but it's not a very effective way to make a grocery list. In practice, people use different languages, and different technical vocabularies embedded in the same language, for different purposes.

Monday, May 28, 2007

Talk at SOAS: The typology of number borrowing in Berber

Just a quick note for London readers: I'm going to be giving a talk on Wednesday in room B111 at SOAS, on "The typology of number borrowing in Berber" - basically the same talk I gave in Cambridge, expanded a bit (for example, I've added a section on Northern Songhay.)

Sunday, May 27, 2007

Why people say silly things about historical linguistics

I recently realised that a lot of popular misconceptions about language evolution derive from uncritical use of the "family" metaphor. In families, a person has kids and then stays around, alongside the kids, for many years... they may live to see their great-grandchildren. The parent and the child may show a family resemblance, but will certainly be separate individuals. If you're told that languages come in "families", and "descend" from past languages, then it seems perfectly reasonable to imagine those ancestor languages lingering on alongside their descendants, and to imagine that the minor changes occurring daily within the language you speak are completely different from the sharp discontinuities that would have to occur for a new language to emerge.

But languages don't work that way at all: a language's "descendants" are (with rare exceptions) simply the various results of its own changes in the mouths of various communities. It's usually meaningless to talk about one living language being the "ancestor" of another one; in such cases, both are descendants of the same ancestor, even if (as infrequently happens) one has changed significantly less than the other. (Revived languages, like Sanskrit, are arguably an exception.) The same mistake is frequently made in popular understandings of biology, for the same reason; people imagine that chimpanzees (say) are humans' ancestors, when in reality the very fact that chimpanzees exist alongside humans proves that, while both species share a common ancestor, that ancestor was neither of them (or, looking at it another way, has equal right to be described as either of them.)

Friday, May 25, 2007

Songhay materials

Songhay is a close-knit family of languages in West Africa, spread by the medieval Songhay Empire, that happens to be rather relevant to my PhD. It has no close relatives; the best guess is that it's Nilo-Saharan, but if it were spoken in the Americas, it would undoubtedly be classed as having no relatives whatsoever, and the resemblance to other languages is not strong. It has some rather interesting syntactic patterns. Throughout the family, NPs are organised as follows: possessor - head - adjective - determiner - plural marker, eg Kwarandzie adra kedda gh yu (mountain small this pl) "these small mountains", Sidi L`arbi n iz yu n targa (Sidi Larbi 's child pl 's canal) "Sidi Larbi's children's canal". While at least one other West African family, Mande, has this NP order, the only case I am aware of offhand outside Africa is Ulwa in Nicaragua. Even rarer worldwide is a feature found in a number of centrally located Songhay varieties: having two distinct classes of verb, one - the vast majority - requiring SOV word order (ie preverbal objects), and the other (including such verbs as "follow", "marry", "want", "see", "fear", "bring", at least in Gao) requiring SVO order (ie postverbal objects.)

Anyway, for me this has been a great week for finding materials on Songhay. Jeffrey Heath has updated his webpage, adding work in progress on the nearly undocumented dialect of Humburi as well as several others (I will find the Tadaksahak wordlist especially useful; aside from Songhay, readers may also want to check out his Dogon materials.) On a missionary website (though I strongly disapprove of such work, it does have useful byproducts), I found a good hour of fairly comprehensible audio in Tadaksahak, an inadequately documented Northern Songhay language important for my purposes; and SOAS library just informed me that the copy of Ousseina Alidou's unpublished dissertation on Tasawaq, an even more important language for comparisons to Kwarandzie, has at long last arrived from Hamburg. Above all, I got an email from a kind contact from Tabelbala, with some more Kwarandzie audio files.

For other Songhay materials, try Relative Clauses in Tadaksahak, Some Verb Morphology Features of Tadaksahak, Northern Songhay Languages in Mali and Niger, Southern Songhay Speech Varieties in Niger, The Zarma Website, Zarma Dictionary, Notions élémentaires pour apprendre le Zarma, La dénomination en Zarma, Lexique kaado-français...

Tuesday, May 22, 2007

Prenominal adjective borrowed into Arabic from Persian?

A major interest of mine lately is the way in which lexical borrowings can affect syntax, dragging bits of the source language's word order with them. I came across what looks like a nice example of this in a book on Gulf Arabic. In Kuwaiti dialect, as in all dialects of Arabic, adjectives normally follow the noun. However:

The (Persian) adjective kooš precedes the noun it qualifies. It does not occur in association with defined nouns. It is not inflected for gender or number. Thus:
kooš walad, bint a good boy, girl
(T. M. Johnstone, Eastern Arabian Dialect Studies, London: Oxford University Press 1967, p. 147.)

Only trouble is, my Persian grammar doesn't say anything about the Persian adjective in question being pre-nominal, and virtually all adjectives in Persian are post-nominal. Does anyone know more about this?

Wednesday, May 16, 2007

Learn Oneida!

Came across a great new site, the Oneida Language Revitalisation Program, consisting mainly of an extensive audio phrasebook of Oneida, the Iroquoian language native to upstate New York. There's also a teaching grammar and dictionary at Oneida Language Tools, and some video at Tracy Williams' site. It's great to see this much material online for a language with less than two hundred speakers; this should make it a lot easier for would-be speakers to make a good start at learning it.

Translation and propaganda

Horrifying news from Palestine - a Hamas Mickey Mouse is telling Palestinian kids to "annihilate Jews"! Or not. In fact - after running on a wide range of media, few of whom I suspect will bother to correct their story - this story was independently quickly exposed by several sources, such as Angry Arab, Ali Alarabi, and Brian Whitaker; MEMRI (the Israeli secret services-linked outlet that provided it) made the mistake of providing a video allowing any Arabic-speaker to confirm their mistranslations. With just a bit of spin, the kids' show in question was turned from merely propagandistic to verging on Bond-villain-esque:
* nqāwim, "we will resist", is rendered as "we will fight";
* biṭuxxūnā l-yahūd "the Jews shoot us", is rendered as "we will kill the Jews" (!);
* 'astašhid "I will be a martyr" as "I will commit martyrdom" (I don't think that's even an English expression, but never mind);
* 'ustāđiyyat al-`ālam, literally "professorship of the world" (in context, they clearly mean being at the intellectual forefront of the world), is rendered as "masters of the world".

When challenged on the translation of "biṭuxxūna l-yahūd", the ex-colonel in Israeli military intelligence who runs MEMRI, Yigal Carmon, apparently resorted to insisting that because "yahūd" (Jews) comes at the end, it must somehow be the object! ("Even someone who doesn't know Arabic would listen to the tape and would hear the word 'Jews' is at the end, and also it means it is something to be done to the Jews, not by the Jews.") It is rather difficult to imagine someone running an organisation dedicated to translating Arabic being unaware that subjects in Arabic commonly follow the verb (especially when a pronominal object suffix (-nā "us") is present, as here).

The moral in all this for English-language media is clear: when some helpful organisation sends you a free translation of some foreign-language article or program, do look a gift horse in the mouth, and check the translation with an independent source first. As for readers/viewers of the media in any language - caveat lector! But you no doubt already knew that.

Tuesday, May 08, 2007

Agflation

I'm a bit busy getting my core chapter ready to hand in, so just a quick post on an English word I spotted lately: agflation. The term confirms the ever-increasing productivity of "-flation" as a suffix; the phenomenon is rather alarming.

Sunday, April 29, 2007

Who has more than 40 words for camels?

Geoffrey Pullum is annoyed to hear a reporter state that "Arabic famously has over 40 terms for different types of camel" - not so much for whether it's true or not as because "they are presented as if profound and significant and clearly supportive of exoticizing claims about far-away nomadic peoples like Arabs and Eskimos, when in fact even if they were true they would be utterly unsurprising." I suppose I should point out that it is true - unsurprisingly. I don't know much more than three (Classical) Arabic words for "camel" ('ibil camels in general, jamal male camel, nāqah female camel); but people I can only describe as camel geeks have taken the trouble to post lists of terms for camels of various ages, sexes, colours, and breeds - and there appear to be 38 more terms for female camels classified by their breeding status alone, and another 14 for different Saudi camel breeds (and I'm ignoring at least another 6 lists of specialised terms for camels just on that site.) If I were a professional camel breeder or something (perish the thought!) I would no doubt know all these terms; but otherwise, who needs them?

But "Arabic famously has over 40 terms for different types of camel" is nonetheless misleading. People have a habit of thinking of technical vocabularies as aspects of a language - English has n terms for types of dog, Japanese has x terms for types of seaweed, etc. But that doesn't really work. It's not English speakers that have more than thirty terms for places of articulation; it's linguists working in a certain tradition. If they publish in a different language but studied in the same place, they'll just calque or borrow the words; if they cut their teeth on Panini or Sibawayh - in the original or in English translation - they will use a differently organised vocabulary even if they're writing in English. Likewise, an English camel breeder (if such a thing exists) will most likely just borrow the terminology of whichever region he got his camels from wholesale, as sure as an English sushi restaurant will borrow Japanese sushi terminology. Some Fulani tribes have shifted to Songhay - primarily a language of town-dwellers, with few native words for livestock types - but kept their cattle-herding lifestyle; unsurprisingly, they've also kept Fulani's enormous set of words for different types of cow, and not suddenly forgotten how to tell one cow from another. If practically every speaker of a language knows a given technical terminology, then it might make sense to view it as a property of the language; but that certainly isn't the case here.

Friday, April 27, 2007

Tamghazinut

I'm writing my core chapter at the moment on Kwarandzie (Korandje), the Northern Songhay language of Tabelbala. (Ethnologue and basic historical common sense notwithstanding, it is specifically Northern Songhay in ancestry, sharing common innovations with the language of places like In-Gall rather than the city it used to trade extensively with, Timbuktu.) It is very heavily influenced by Berber, like other Northern Songhay languages, and I found a great example the other day: the word for "old woman" is tamghazinut. Amghar is a Berber word meaning "old man"; zinu is a Songhay word meaning "old"; and ta-...-t is a Berber circumfix forming the feminine, which, even though Kwarandzie doesn't have gender agreement of any kind, seems (judging by this remarkable case) to be marginally productive as a derivational affix. (Postvocalic r is regularly lost in Korandje.)

On the map below (which I put together for my thesis using Google Earth and GIMP), you can see something of the geographic improbability of the situation: