Author Archives: pilinu

Superintelligent machine translation

Let us consider superintelligence related to machine translation. To fix ideas, we can propose a rough definition: machine with the ability to translate with 99% or above accuracy from one of the 8000 languages to another. It seems relevant here … Continue reading

Posted in blog | Tagged , , , , , | Comments Off on Superintelligent machine translation

Writing differences between Corsican and Gallurese

Here are some writing differences between Corsican and Sardinian gallurese, that result from historical writing habits. These writing differences prevail, even when the words are the same: ghj is replaced by gghj: acciaghju (corsu), acciagghju (gallurese) , steel chj is … Continue reading

Posted in blog | Tagged , , , , , , , | Comments Off on Writing differences between Corsican and Gallurese

What are the conditions for a given endangered language to be a candidate for rule-based machine translation?

What are the conditions for a given endangered language to be a candidate for rule-based machine translation? For a given endangered language to be a candidate for rule-based machine translation, some requirements are in order. There is notably need for: – … Continue reading

Posted in blog | Tagged , , , , , | Comments Off on What are the conditions for a given endangered language to be a candidate for rule-based machine translation?

Quandu da la forza à la raghjoni cuntrasta Tandu vinci la forza è la raghjoni ùn basta

Quandu da la forza à la raghjoni cuntrasta Tandu vinci la forza è la raghjoni ùn basta. This is a rare Corsican proverb. In French, litterally: “Lorsque la force et la raison s’opposent, alors la force gagne car la raison … Continue reading

Posted in blog | Tagged , , , , , , , , , , , | Leave a comment

How rule-based and statistical machine translation can help each other

Here are a few suggestions on how rule-based and statistical machine translation  can help each other: (This is a follow-up to the previous post) to begin with, rule-based and statistical machine translation are often contrasted and compared: it would be … Continue reading

Posted in blog | Tagged , , , , , , | Leave a comment

Why rule-based translation is (presently) best suited to endangered languages

Here are some arguments in favor of the choice of rule-based translation concerning machine translation of endangered languages (it relates to the philosophy of language policy): there does not exist at present time a reliable corpus between the given endangered … Continue reading

Posted in blog | Tagged , , , , , , , | Leave a comment

Rough typology of remaining errors

French to Corsican: performing on French wikipedia sample test currently amounts to 93% on average. Below is a rough typology of remaining errors (presumably an average of 95% performance should be attainable on the basis of correction of ‘easy’ tagged errors): … Continue reading

Posted in blog | Tagged , | Comments Off on Rough typology of remaining errors

Enhancing French to Italian translation

Some improvements made to French to Italian translation: fixed several contractors (della, dello, …) the nice thing is that semantic disambiguation is working: ‘échecs’ = fallimenti/scacchi (failures/chess) and translates properly into scacchi

Posted in blog | Tagged , , , | Leave a comment

Very first draft on French to Italian

Now testing French to Italian translation: it is the very first draft. A rough 80%. A lot of things to fix.

Posted in blog | Leave a comment

Improvement in grammatical structures: another 100% hit

Progress on grammatical structures: some improvements to be included in future 1.2 version yield another Feigenbaum hit: 100%. In the present case, the Corsican language variety is taravese.  

Posted in blog | Comments Off on Improvement in grammatical structures: another 100% hit