- Superintelligent machine translation
- Writing differences between Corsican and Gallurese
- What are the conditions for a given endangered language to be a candidate for rule-based machine translation?
- Quandu da la forza à la raghjoni cuntrasta Tandu vinci la forza è la raghjoni ùn basta
- How rule-based and statistical machine translation can help each other
- Why rule-based translation is (presently) best suited to endangered languages
- Rough typology of remaining errors
- Enhancing French to Italian translation
- Very first draft on French to Italian
- Improvement in grammatical structures: another 100% hit
- Percentage of ambiguous words in French sentence (from French to Corsican translation perspective)
- Disambiguation of two consecutive ambiguous words: ‘plusieurs mois’
- Disambiguation of ‘vie’
- A virsioni 1.1 hè dispunibuli
- Light version 1.1 is available
Italian source text
Tagsadjective accordance cismuntincu conjugation Corse Corsica Corsican 'sartinesu' Corsican language corsu dependency parser dependency parsing disambiguation false positive Feigenbaum hit Feigenbaum test francese-corsu francesu-corsu français-corse French into Corsican French to Corsican French to English gaddhuresu gallurese language Italian Italian language langue corse machine translation numbers grammatical type past participe accordance preposition reference reference language shift rule-based machine translation sartinesu self-reference semantic disambiguation statistical machine translation taravesu traduction traduction automatique traduttore traduttori translation corpora translation corpus translator word-sense dismbiguation
A jeweler examines an emerald. "Aha," he says, "another green emerald. In all my years in this business, I must have seen thousands of emeralds, and every one has been green." We think the jeweler reasonable to hypothesize that all emeralds are green. Next door is another jeweler having equally comprehensive experience with emeralds. He speaks only the Choctaw Indian language. Color distinctions are not as universal as might be thought. The Choctaw Indians made no distinction between green and blue—the same words applied to both. The Choctaws did make a linguistic distinction between okchamali, a vivid green or blue, and okchakko, a pale green or blue. The Choctaw-speaking jeweler says: All emeralds are okchamali. He maintains that all his years in the jewelry business confirm this hypothesis. (William Poundstone, Labyrinths of reason)
The Corsican language is currently considered by Unesco as a "definitely endangered language". This site's aim is to help reviving the Corsican language by providing translation into Corsican. It translates French and Italian into one of the three main Corsican variants: 'cismuntincu', 'sartinesu' or 'taravesu'.
Most illustrations are from Wiki Commons
Category Archives: blog
What is the average percentage of ambiguous words in a French sentence (from a French to Corsican translation perspective). In the above example, this percentage amounts to 20/99 words = approximately 20%. Not all semantic ambiguities are taken into account here, so … Continue reading
Testing improved disambiguation engine. This is a special case of disambiguation of two consecutive ambiguous words. French ‘au terme de plusieurs mois’ translates into à u capu di parechji mesa (at the end of several months) in Corsican (taravese … Continue reading
We face here a special case of disambiguation: ‘un général byzantin du vie siècle’ (a Byzantine general of the sixth century) should translate: un generali bizantinu di u 6esimu seculu. French ‘vie’ is ambiguous between vita and 6esimu or VIesimu (life/sixth). In … Continue reading
Okchakko Traduttori: a virsioni 1.1 hè dispunibuli. Ci sò i nuvità: – traduci da u francesu à i trè varietà maestri di a lingua corsa: cismuntincu, sartinesu, taravesu – migliuramentu riguardu à u schidariu d’aiutu – migliuramentu riguardu à l’elisioni … Continue reading
Light version 1.1 is available. New features: translates from French to one of the three main variants of Corsican language: cismuntincu, sartinesu, taravesu some improvements made to the help file improvements on elision additional vocabulary
Here is a series of words that are masculine in French and feminine in Corsican language (taravese variant).
French to Corsican (taravese variant): here is a list of words that are changing from feminine to masculine.
Now beginning to handle adjective order in French to English translation: ‘un peintre russe juif’: un pittori russiu ghjudeiu (a Russian Jewish painter)
Now testing the new engine with the semantically ambiguous French ‘échecs’ = fiaschi/scacchi (failures/chess). What is interesting here is that semantic disambiguation transfers successfully into English (although the French/English engine is still in its infancy as there are still a … Continue reading
Now it is patent that there cannot be successful Feigenbaum test (i.e. not only occasional Feigenbaum hits, but regular and average performance) without an adequate treatment of semantic disambiguation. Arguably, it is one hard problem of machine translation. Here are some … Continue reading