Tag Archives: machine translation

Rough typology of remaining errors (updated march 2018)

French to Corsican: performing on French wikipedia sample test currently amounts to 94% on average. Below is a rough typology of remaining errors (presumably an average scoring of 95% on the open test should be attainable on the basis of correction … Continue reading

Posted in blog | Tagged , | Leave a comment

Evaluation of machine translation: why not self-evaluation?

Evaluation of machine translation is usually done via external tools (to cite some instances: ARPA, BLEU, METEOR, LEPOR, …). But let us investigate the idea of self-evaluation. For it seems that the software itself is capable of having an accurate … Continue reading

Posted in blog | Tagged , , , , | Leave a comment

Semantic disambiguation of French ‘femme’: in the mud, gold is still shining

  In Corsican language, French word ‘femme’ can be translated, depending on the context either into donna (woman) or into moglia (wife) The above sample still contains a lot of vocabulary and grammatical disambiguation errors (easy/medium difficulty), but it handles … Continue reading

Posted in blog | Tagged , , , | Leave a comment

A Special Case of Anaphora Resolution

Anaphora resolution usually refers to pronouns. But we face here a special case of anaphora resolution that relates to an adjective. The following sentence: ‘un vase de Chine authentique’ (an authentic vase of China) is translated erroneously as un vasu … Continue reading

Posted in blog | Tagged , , , , , | Leave a comment

Four consecutive ambiguous words

Translating the following sentence: ‘ce fait est unique’ is not as easy as it could seem at first glance. In effect, it is made up of four consecutive ambiguous words: ‘ce’: ‘ssu (demonstrative pronoun, this) or ciò (it, relative pronoun) … Continue reading

Posted in blog | Tagged , , | Leave a comment

What are the conditions for a given endangered language to be a candidate for rule-based machine translation?

What are the conditions for a given endangered language to be a candidate for rule-based machine translation? For a given endangered language to be a candidate for rule-based machine translation, some requirements are in order. There is notably need for: a … Continue reading

Posted in blog | Tagged , , , , , , , , , | Comments Off on What are the conditions for a given endangered language to be a candidate for rule-based machine translation?

Solving fivefold ambiguity: translation for French ‘poste’

French word ‘poste’ has (at least) fivefold ambiguity. For it can designate: ‘poste’ (masculine singular noun) : postu, masculine singular noun (set, i.e. television set) ‘poste’ (masculine singular noun): posta, feminine singular noun (position): erroneously translated as postu in the present case … Continue reading

Posted in blog | Tagged , , , | Leave a comment

Another case of firstname ambiguity: ‘Noël’

Translation of the French word ‘Noël’ yields another case of ambiguity. For ‘Noël’ can translate: either into Natali (Christmas, Christmas Day): the annual festival commemorating Jesus Christ’s birth or into, identically, Natali (‘Noel‘): the firstname Now it seems there is no case of … Continue reading

Posted in blog | Tagged , , , , | Leave a comment