Tag Archives: machine translation

Superintelligent machine translation

Let us consider superintelligence related to machine translation. To fix ideas, we can propose a rough definition: machine with the ability to translate with 99% or above accuracy from one of the 8000 languages to another. It seems relevant here … Continue reading

Posted in blog | Tagged , , , , , | Comments Off on Superintelligent machine translation

Writing differences between Corsican and Gallurese

Here are some writing differences between Corsican and Sardinian gallurese, that result from historical writing habits. These writing differences prevail, even when the words are the same: ghj is replaced by gghj: acciaghju (corsu), acciagghju (gallurese) , steel chj is … Continue reading

Posted in blog | Tagged , , , , , , , | Comments Off on Writing differences between Corsican and Gallurese

What are the conditions for a given endangered language to be a candidate for rule-based machine translation?

What are the conditions for a given endangered language to be a candidate for rule-based machine translation? For a given endangered language to be a candidate for rule-based machine translation, some requirements are in order. There is notably need for: – … Continue reading

Posted in blog | Tagged , , , , , | Comments Off on What are the conditions for a given endangered language to be a candidate for rule-based machine translation?

Quandu da la forza à la raghjoni cuntrasta Tandu vinci la forza è la raghjoni ùn basta

Quandu da la forza à la raghjoni cuntrasta Tandu vinci la forza è la raghjoni ùn basta. This is a rare Corsican proverb. In French, litterally: “Lorsque la force et la raison s’opposent, alors la force gagne car la raison … Continue reading

Posted in blog | Tagged , , , , , , , , , , , | Leave a comment

How rule-based and statistical machine translation can help each other

Here are a few suggestions on how rule-based and statistical machine translation  can help each other: (This is a follow-up to the previous post) to begin with, rule-based and statistical machine translation are often contrasted and compared: it would be … Continue reading

Posted in blog | Tagged , , , , , , | Leave a comment

Why rule-based translation is (presently) best suited to endangered languages

Here are some arguments in favor of the choice of rule-based translation concerning machine translation of endangered languages (it relates to the philosophy of language policy): there does not exist at present time a reliable corpus between the given endangered … Continue reading

Posted in blog | Tagged , , , , , , , | Leave a comment

Rough typology of remaining errors

French to Corsican: performing on French wikipedia sample test currently amounts to 93% on average. Below is a rough typology of remaining errors (presumably an average of 95% performance should be attainable on the basis of correction of ‘easy’ tagged errors): … Continue reading

Posted in blog | Tagged , | Comments Off on Rough typology of remaining errors

Download Light version for Windows

Download the light version 1.1 for Windows of OKCHAKKO TRANSLATOR French to Corsican (cismuntincu-sartinesu-taravesu) for Windows (R) version 1.1 translates French into one of the three main variants of  Corsican language (cismuntincu, sartinesu, or taravesu)    

Tagged , , , , , , , , , , , , , , , , , | Leave a comment

Word-sense disambiguation: first test of new engine

Now testing the new engine with the semantically ambiguous French ‘échecs’ = fiaschi/scacchi (failures/chess). What is interesting here is that semantic disambiguation transfers successfully into English (although the French/English engine is still in its infancy as there are still a … Continue reading

Posted in blog | Tagged , , , , , | Comments Off on Word-sense disambiguation: first test of new engine

A first 100%!

Now scoring 1 – 0/124 = 100%. Translated into Corsican ‘sartinesu’. Another Feigenbaum hit.

Posted in blog | Tagged , , | Leave a comment