Tag Archives: machine translation

Writing differences between Corsican and Gallurese

Here are some writing differences between Corsican and Sardinian gallurese, that result from historical writing habits. These writing differences prevail, even when the words are the same: ghj is replaced by gghj: acciaghju (corsu), acciagghju (gallurese) , steel chj is … Continue reading

Posted in blog | Tagged , , , , , , , | Comments Off on Writing differences between Corsican and Gallurese

Quandu da la forza à la raghjoni cuntrasta Tandu vinci la forza è la raghjoni ùn basta

Quandu da la forza à la raghjoni cuntrasta Tandu vinci la forza è la raghjoni ùn basta. This is a rare Corsican proverb. In French, litterally: “Lorsque la force et la raison s’opposent, alors la force gagne car la raison … Continue reading

Posted in blog | Tagged , , , , , , , , , , , | Leave a comment

How rule-based and statistical machine translation can help each other

Here are a few suggestions on how rule-based and statistical machine translation  can help each other: (This is a follow-up to the previous post) to begin with, rule-based and statistical machine translation are often contrasted and compared: it would be … Continue reading

Posted in blog | Tagged , , , , , , | Leave a comment

Why rule-based translation is (presently) best suited to endangered languages

Here are some arguments in favor of the choice of rule-based translation concerning machine translation of endangered languages (it relates to the philosophy of language policy): there does not exist at present time a reliable corpus between the given endangered … Continue reading

Posted in blog | Tagged , , , , , , , | Leave a comment

Word-sense disambiguation: first test of new engine

Now testing the new engine with the semantically ambiguous French ‘échecs’ = fiaschi/scacchi (failures/chess). What is interesting here is that semantic disambiguation transfers successfully into English (although the French/English engine is still in its infancy as there are still a … Continue reading

Posted in blog | Tagged , , , , , | Comments Off on Word-sense disambiguation: first test of new engine

A first 100%!

Now scoring 1 – 0/124 = 100%. Translated into Corsican ‘sartinesu’. Another Feigenbaum hit.

Posted in blog | Tagged , , | Leave a comment

French to English: first experimental test

Now testing the translation from French to English. A lot of grammatical errors (a rough 75%). To mention but a few of them: adjective + noun inversion: localities Alsatian should read: Alsatian localities date format inversion: 4 March should read: March … Continue reading

Posted in blog | Tagged , | Leave a comment

French ‘fin’ followed by a year number: fixed

Tagger improvement: fixed this issue. French ‘l’Empire allemand’ now translates properly into l’Imperu alimanu (the German Empire). French word ‘fin’ is now identified as a preposition when followed by a year number. The above excerpt is translated into the ‘sartinesu’ … Continue reading

Posted in blog | Tagged , , , , | Comments Off on French ‘fin’ followed by a year number: fixed