Jabal al-Lughat: Chinese

Showing posts with label Chinese. Show all posts

Saturday, February 09, 2019

Abdurehim Heyit's "Mother Tongue"

While I was doing my PhD at SOAS, I found myself one term helping teach a field methods class focusing on Uyghur, a Turkic language closely related to Uzbek spoken in Xinjiang in far western China (textbook here). At the time, as far as I gathered, it was a sleepy borderland region; these days, it's best known for the massive reeducation camps into which the Chinese government has thrown a substantial proportion of the population, in what appears to be an ambitious effort to eradicate their language, culture, and religion. ("Kill the Indian to save the man" was the American version.) Today, it's being reported that the talented Uyghur musician Abdurehim Heyit (ئابدۇرېھىم ھېيىت, equivalent to Arabic عبد الرحيم عيد), from Kashgar, died in detention at the age of 55, after two years in the camps. [UPDATE: It now seems that he's alive and still being imprisoned without trial.]

One of his best-known songs, originally a poem by Qutluq Shewqi, is a good fit for this blog: Ana til (ئانا تىل), "Mother Tongue" (lyrics, translation). When he sang it, language was still a relatively politically acceptable element of Uyghur identity to emphasise; traditional Communist Party policy for officially recognised ethnic minorities emphasised development of their languages. Now, with hundreds of thousands of people arbitrarily imprisoned, the rapid loss of language rights is the least of anyone's worries.

ئانا تىل بىلگەن كىشىنىڭ ئىززىتىن قىلغۇم كەلۇر،	I salute the people who speak my mother tongue,
ئانا تىلنى ئاغزىدىن ئالتۇن بەرىپ ئالغۇم كەلۇر.	I am willing to pay in gold for the words they speak.
بۇ ئانا تىل بولسا گەر ئامەرىكا-يۇ ئافرىقىدا،	Wherever my mother tongue is found, be it Africa or America,
سەرپ ئەتىپ مىڭلارچە تىللا ئاندى مەن بارغۇم كەلۇر.	I would go there, whatever the cost and expense.
ئانا تىل بىلگەن كىشىنىڭ ئىززىتىن قىلغۇم كەلۇر،	I salute the people who speak my mother tongue,
ئانا تىلنى ئاغزىدىن ئالتۇن بەرىپ ئالغۇم كەلۇر.	I am willing to pay in gold for the words they speak.
ئەي ئانا تىل بىزگە سەن قالغان ئۇلۇغلاردىن نىشان،	Oh, my mother tongue, you are the sacred bequest to us from our great ancestors,
سەن بىلەن روھىي زىمىندا ئىپتىخارلانغۇم كەلۇر.	With you, I desire to share my pride in you in the spiritual world.

Sunday, May 10, 2015

How to remember numerals better

In all the debate around "Whorfian" effects of language on cognition, one relatively well-known case has received oddly little attention among linguists, despite being widely discussed by psychologists and popularised by Malcolm Gladwell: the effect of word length on short-term memory (Baddeley et al. 1975). Basically, all other things being equal, it's easier to remember a sequence of short words than a sequence of long words. This suggests that our short-term memory for words (what psychologists confusingly call phonological memory) has a capacity limited by length - specifically, the amount that can be pronounced in about 2 seconds (Schweickert & Boruff 1987). That should suggest, in particular, that numbers presented orally will be easier to remember in a language with short numerals than in one with long numerals. (Note that this affects, among other things, IQ test results, since IQ tests typically include tests of numeral recall.)

Psychologists followed up on this by attempting to test this hypothesis with a number of language pairs (for an overview, see Baddeley (1997). Disclaimer: I'm not a psycholinguist, and the following references are certainly not exhaustive). The best-tested and most consistent result concerns Chinese. Mandarin and Cantonese numerals take shorter to say than English ones, and a number of psychologists have accordingly confirmed that Chinese speakers can remember longer numerals than English speakers (Stigler, Lee, & Stevenson (1986), Hoosain & Salili (1987)), even at 4 years old Chen and Stevenson (1988)), and that this applies even when bilinguals are tested across their two languages (Hoosain 1979). It goes further than that, in fact: Chincotta & Underwood (1997) find that, out of Cantonese, English, Greek, Finnish, Swedish, and Spanish, only Cantonese speakers remember significantly more digits than speakers of other languages - and that this difference disappeared if the subjects were prevented from rehearsing the numbers auditorily by being asked to keep repeating "la-la" while being tested, proving its linguistic nature. The difference ranges around 2 digits, with the exact figure depending on the experiment.

Data for other languages is less clearcut. Welsh numerals take longer to say in isolation than English ones, and Ellis & Hennelly (1986) accordingly found that English-Welsh bilinguals can on average remember longer numerals in English than Welsh. Naveh-Benjamin & Ayres (1986) simultaneously tested the hypothesis for university students in Israel speaking English, Spanish, Arabic, and Hebrew natively (but excluding the digits "seven" and "zero"). They found that the average number of digits recalled was highest in English (7.21), followed by Hebrew (6.51), then Spanish (6.37), and lowest in Arabic (5.77); the ordering by average number of syllables per digit, or by average time taken to read a digit, was English, Spanish, Hebrew, Arabic. However, the difference in number of digits recalled was smaller than predicted by the time taken to read a digit in each language, suggesting that other factors were also relevant.

A proviso is necessary: some recent work, without disputing the differences observed, has made a strong case that they relate not simply to length ( Lovatt, Avons, & Masterson 2000), but crucially to phonological factors (Service 2010, Lethbridge, Hinton & Nimmo 2002). This has been argued for Welsh numerals vs. English ones by Murray & Jones (2002), who find that Welsh digits take longer to say in isolation but actually take less time to say in connnected speech than English ones, and that changes of place of articulation at word boundaries negatively affect memory.

The research is curiously selective in terms of languages examined, and many of the experiments don't control for all possible confounding factors, such as diglossia and social status in the case of Welsh or Arabic. Nevertheless, it does at least seem well-established that speaking Chinese gives a short-term digit memory advantage over speaking major European or Semitic languages. So, if for some reason you regularly need to remember long numerals, and your preferred language doesn't happen to be Chinese, how do you compensate for this handicap?

There are two obvious ways to get around this (assuming you care enough about remembering numerals to want to, which depends very much on your tastes and circumstances). One is to remember the number visually (as a sequence of written digits) or even kinesthetically (as a sequence of typing actions), in which case this particular constraint no longer applies (cf. eg Olsthoorn, Andriga, & Hulstijn 2012). This only helps, however, if you remember numerals better visually or kinesthetically than auditorily, and my impression is that most people don't.

A probably more helpful alternative is to establish a code that lets you turn long numerals into much shorter words by identifying digits with single letters or single phonemes. This solution has a very long history in Arabic and Hebrew, in which each letter of the alphabet can be used to represent a digit: 'a is 1, b is 2, etc. (the first 9 digits are units, the second 10 are tens, and the rest are hundreds). Since short vowels are not letters, the resulting word can be given whatever vowels the user sees fit to give it. A common game of later poets using the Arabic script was to encode the date of their poem within the poem as a chronogram; more practically, Moroccan schoolchildren used to memorise the multiplication tables as a series of meaningless words formed by this encoding (Meakin 1905). Chronograms have been formed using Roman numerals, but for memorisation, at least, they are rather ill-adapted to such a system - think how much padding would be required to turn a number like MDCCCLXXXIII into words.

However, the spread of Hebrew studies in Western Europe following the Renaissance, and the increasing importance of memorising statistics there, encouraged European mnemonists to look for ways of emulating this encoding without having to learn a Semitic language. Doing so at a time when place notation was widely used, they introduced a crucial improvement: each consonant represented a digit in a place notation system, rather than a number in an additive notation system. After various cumulative efforts at improvement, this culminated in the early 19th century with the so-called Major system: 0=s/z, 1=t/d, 2=n, 3=m, 4=r, 5=l, 6=š/ž/č/j, 7=k/g, 8=f/v, 9=p/b, with vowels, semivowels, and laryngeals ignored. To remember 94801 (LACITO's zip code), for example, one would turn it into "professed". This system apparently remains in use among professional mnemonists to this day, despite being virtually unknown to wider society.

Perhaps this is why linguists haven't paid more attention to the word-length effect in the context of the Whorfian debate: it's a clear-cut effect of language on cognition, but not a very profound one, in that it should be fixable by some very simple hacks (or even just by borrowing some one else's numerals). But I'm not aware of any experimental work testing the effect of this particular hack on digit recall...

Friday, September 27, 2013

Minkaohan - a Chinese word Algerians need

I read a fascinating and depressing article recently (The Strangers) - in which a linguist plays a lead role - about the worsening situation in Xinjiang. The author makes comparisons to Algeria at one point, but not for the following, which will surely strike a chord in anyone familiar with North African educational policy:

"Among the Uighur, however, the policy has created two distinct groups: the minkaohan, minorities educated in Mandarin, and the minkaomin, educated in their own language. Minkaomin education is not taken seriously by non-Uighur employers, and not speaking Mandarin shuts minkaomin graduates out of jobs. In turn, they often resent minkaohan students as opportunistic and unfaithful to their own heritage."

There seems to be a fair amount of scholarship on this issue, judging from a quick skim. The minkaohan have been analysed as a "hybrid identity", sometimes feeling a "sense of shame regarding their ethnic background" and often seen by their minkaomin peers as irreligious or potentially disloyal - but, of course, ambitious parents who want their children to be middle-class often see minkaohan education as the only way forward. Chinese is required for university, although 82% of Uyghur adults can't read Chinese, and students often have difficulty adjusting to the Chinese-speaking world of the university.

Sounds like a remarkably effective way to exacerbate social tensions, right? The irony is that, in North Africa, both governments and employers expanded or even created a very similar system after independence!

It hasn't escaped the Chinese government's notice that this is problematic, so they're addressing the problem by cutting way down on Uyghur teaching, in the hope of eventually making everyone "minkaohan": "'bilingual' classes in many areas have already developed from using Mandarin to teach math, physics, and chemistry to the new model of using Mandarin for all classes except for mother-tongue [language arts] classes." North Africa hasn't quite reached that second stage for Arabic, I'm glad to say - although that's actually the best it's ever managed for Berber - but that "solution" does have some proponents.

It's often been noted that Chinese has contributed surprisingly few loans to English. I think I'd nominate "minkaomin" and "minkaohan" for borrowing: they have no commonly used English equivalent, and are relevant to describing post-colonial situations in many countries.

Saturday, February 23, 2013

"Ya-chü-lo" (Kufa) and other confusing transcriptions

Translators of Chinese histories generally have a very inconvenient habit of transcribing foreign names according to the modern Mandarin pronunciation of the characters used. Mandarin is almost the least suitable of all the modern Chinese topolects for this purpose, since it's lost all the final stops, turned final m to n, and palatalised initial velars. Cantonese or Hakka forms would at least be more helpful; but reconstructions exist now, and those could be given directly.

A brief passage in Tongdian, a Tang dynasty encyclopedia, reports a Chinese soldier's impressions of life in early Abbasid Iraq (where he found himself captive after the Battle of Talas): Tongdian, chapter 193. A translation, taken from Hoyland (1997), is available online: T'ung tien. Unlike many of the other foreign accounts of early Islamic history that Hoyland gathers, which tend towards the bitter, this text is in parts rather charming, particularly where based on first-hand observation. However, the transliterations are as unhelpful as usual. Here are the appropriate Middle Chinese forms, based for convenience on Starostin's database, with y substituted for j:

T'a-shih: 大食 thầyźik, a reasonable transcription of Persian Tājīk, itself originally from Arabic Ṭā'ī (member of the tribe of Ṭayy) - see Language Hat)

mo-shou: 摩首 mwâśǝ́w – no idea what this alleged title of the caliphs might be; probably not Arabic, so maybe Persian? Any ideas?

Po-ssu: 波斯 pwâsye "Persia", presumably from Pārs.

Fulin: 拂菻 phütlim "Byzantium", apparently somehow from Armenian Rhôm (ie "Rome").

Ya chü-lo: 亞俱羅 ʔạ̀külâ "Kūfah", a pretty good transcription of the town's Syriac name, ʕAqūlā.

mumen: 暮門 mòmon (given as a title of the caliph), ie Arabic mu'min "believer"; the vowel choice in the second syllable is interesting, suggesting that the short i was already pronounced in a rather schwa-like way.

Shan: 苫, ie Arabic šām "Syria, the Levant, Damascus". This character isn't in Starostin's list, but its Korean and Vietnamese readings make it clear that the final letter was m, not n (CJKV-English dictionary).