Category Archives: blog

6/7: sixth open test

Today we are conducting the sixth open test. The result is 1 – (4/123) = 96.7%. There are three errors. The current, provisional average is: (98.61 + 93.75 + 93.93 + 95.34 + 99.42 + 96.74)/5 = 96.29%.

Posted in blog | Leave a comment

5/7: fifth open test

Today we are conducting the fifth open test. The result is 1 – (1/175) = 99.42%. There is only one error (level of difficulty: medium) due to the incorrect translation of the proper name: ‘del Monte’. Otherwise, several sentences are … Continue reading

Posted in blog | Leave a comment

4/7: fourth open test

Today we are conducting the fourth open test. The result is 1 – (8/172) = 95.34%. There is one error (level of difficulty: easy) due to lack of vocabulary (‘escaladées’, ‘valaisan’). There are also some errors (level of difficulty: medium) … Continue reading

Posted in blog | Leave a comment

3/7: third open test

Today we are conducting the third open test. The result is 1 – (8/132) = 93.93%. There are several errors (level of difficulty: easy) due to lack of vocabulary (‘alunissage’, ‘plaqué”). There is also an error (level of difficulty: medium) … Continue reading

Posted in blog | Leave a comment

2/7: Second open test

Today we are conducting the second open test. The result is 1 – (7/112) = 93.75%. There are several errors (level of difficulty: easy) due to lack of vocabulary (‘même’, ‘raillerie’, ‘elocution’). There is also an error (level of difficulty: … Continue reading

Posted in blog | Leave a comment

1/7: Evaluation of the translator’s current performance: start of a series of 7 open tests

We will evaluate the translator’s current performance, using a series of seven open tests. The aim is to translate the first 100 words of the article of the day from the wikipedia encyclopedia into French for seven consecutive days. Today, … Continue reading

Posted in blog | Leave a comment

500 users for the Traduttore corsu application for Android

There are now 500 users for the Traduttore corsu application for Android. There are also 200 users for the Traduttore corsu for windows application.

Posted in blog | Leave a comment

Palindroms in Corsican language

Here are a certain number of palindromes, in each of the main variants of the Corsican language: cismuntincu: apa (bee), ala (wing), anilina (aniline), elle (them), ebbe (did have), esegese (exegesis), issi (you hoist), inni (hymns), usu (usage) sartinesu: abba … Continue reading

Posted in blog | Leave a comment

Further reflexions on the status of “I love you” in Corsican language

Let us briefly recall the problem: translating ‘I love you’ might sound trivial, but it’s not. In fact, ‘ti amu‘ is not the best translation. The best translation is ‘ti tengu caru‘ when addressed to a male person, or ‘ti … Continue reading

Posted in blog | Tagged , , , , , , , , | Comments Off on Further reflexions on the status of “I love you” in Corsican language

Update to priority pairs for endangered languages

If we were to update the priorities for language pairs to be achieved, from the point of view of endangered languages, the result would be as follows: Corsican language: French to Corsican (already done) Sardinian Gallurese: Italian to Gallurese Sardinian … Continue reading

Posted in blog | Tagged , , , , , | 1 Comment

The enigmatic grammatical status of “I love you” in Corsican language

Translating ‘I love you’ might sound trivial, but it’s not. In fact, ‘ti amu‘ is not the best translation. The best translation is ‘ti tengu caru‘ when addressed to a male person, or ‘ti tengu cara‘ when addressed to a … Continue reading

Posted in blog | Tagged , , , | 1 Comment

The taxonomy optimization problem

Let us add further reflexions on the remaining 1% problem. As hinted at previously, the remaining 1% problem may only be solved by general AI (GAI). Let us sketch in a series of posts what features are required for general … Continue reading

Posted in blog | Tagged , | Leave a comment

One hundred users for the Traduttore corsu app for Android

The Traduttore corsu application for Android has now more than a hundred users. Moving on…

Posted in blog | Tagged , , | Leave a comment

More on polymorphic disambiguation…

Let’s take another look at polymorphic disambiguation. We shall consider the French word sequence ‘nombre de’. The translation into Corsican (the same goes for English and other languages) cannot be identical, because ‘number of’ can be translated in two different … Continue reading

Posted in blog | Tagged , , , | Leave a comment

Performing our first open test of the year

Let us comment on the remaining errors encountered in the above open test: French ‘carrière’ remains undisambiguated: either carriera (career) or cava (quarry): two occurrences ‘de’: French ‘de’ is perhaps the most difficult word to translate into another language, due … Continue reading

Posted in blog | Tagged , , , | Leave a comment

More on the remaining 1% problem

The analysis of the Wikipedia article of the day in French is interesting, in the sense that it sheds light on the skills that will be necessary for a machine translation system to achieve a 100% accurate translation. The error … Continue reading

Posted in blog | Tagged , | 1 Comment

More on two-sided grammar

Let us expand the idea of two-sided (from the analytic/synthetic duality standpoint) grammatical analysis: consider, for example, ‘beaucoup et souvent’ (a lot and often) in the sentence ‘il mange beaucoup et souvent’ (he eats a lot and often). Analytically, ‘beaucoup … Continue reading

Posted in blog | Tagged , , , | Leave a comment

New version on Android Playstore

The application now changes its name on the Android Playstore, and becomes “Traduttore corsu”: the name is not very original, let’s face it, but at least it is easy to understand. “Traduttore corsu” is dedicated especially to the translation from … Continue reading

Posted in blog | Tagged , , , , , , , , | Leave a comment

Some advance in polymorphic disambiguation

Just powered the new engine (prototypal, not yet transferred to the API which is used both by the current site translator and the Android application) and made a few tests: it works! Let us take an example with French ‘en … Continue reading

Posted in blog | Leave a comment

Follow-up to Autonomous MT system

Closely related to my previous post on autonomous MT systems is the article on the fact that Researchers have developed a machine-learning system capable of deciphering lost languages.

Posted in blog | Leave a comment

Autonomous MT system

Let us speculate about what could be an autonomous MT system. In the present state of MT we provide rules and dictionary to the software (rules-based translation) or we feed it with a corpus regarding a given pair of languages … Continue reading

Posted in blog | Tagged , , | 1 Comment

On the statistical/rule-based divide regarding MT

The classical divide with regard to MT separates statistical from rule-based MT. But this divide is not as clear-cut as one could think at first glance. For rule-based MT can operate statistically. Let us take an example, concerning the disambiguation … Continue reading

Posted in blog | Tagged , | Leave a comment

Priority pairs for endangered languages

Let us discuss the question of priority pairs with regard to endangered languages. It consists of the most wanted translation pairs for a given endangered language, in keeping with the main language with which it is associated. To take an … Continue reading

Posted in blog | Tagged , | 1 Comment

More on two-sided grammatical analysis

Let us give some further examples of two-sided grammatical analysis: “à dessein” (purposedly), “à volonté” (at will), “à tort” (mistakenly): from an analytical standpoint, these are prepositions followed by a singular noun. From a synthetical viewpoint, they are adverbs (adverbial … Continue reading

Posted in blog | Tagged , , | 1 Comment

Two-sided grammatical analysis

Let us call two-sided grammatical analysis the type of grammatical analysis that will be described below. Two-sided grammatical analysis contrasts with one-sided analysis, which sees a sequence of words either as a locution type (adverbial locution, verbal locution, noun locution, … Continue reading

Posted in blog | Tagged , , | 1 Comment

A hard case for disambiguation: polymorphic disambiguation

Let us investigate an issue that relates to disambiguation. It is a hard case that needs to be addressed: I shall call it in what follows, for reasons that will become clearer later, polymorphic disambiguation. Let us take an example. … Continue reading

Posted in blog | Tagged | 1 Comment

Some ethics for MT related to endangered languages

Let us sketch what could be some ethical requirements related to machine translation regarding endangered languages. Perhaps a first requirement would be: don’t publish translation pairs regarding an endangered language until the success rate has reached at least 90%. Because … Continue reading

Posted in blog | Leave a comment

Word sense disambiguation: a hard case

Let us consider a hard case for word sense disambiguation, in the context of French to Corsican MT. But the same goes for French to English MT. It relates to French words such as: ‘accomplit’, ‘affaiblit’, ‘affranchit’, ‘alourdit’, ‘amortit’. The … Continue reading

Posted in blog | Tagged , , , , , | Leave a comment

More on grammatical type disambiguation

Let us focus on grammatical type disambiguation, which is a subproblem of word disambiguation. General grammatical types are: verbs, nouns, adjectives, adverbs, prepositions, gerundive, etc. But for grammatical type disambiguation purposes, more accuracy is in order: instances of grammatical types … Continue reading

Posted in blog | Tagged , , | Comments Off on More on grammatical type disambiguation

New insight on the issue of pair reversal (updated)

The issue of pair reversal: it goes as follows: Suppose your have a given translation pair A>B that translates language A into language B, how hard is it to build the reverse pair B>A? Now the current instance of this … Continue reading

Posted in blog | Tagged , , | Comments Off on New insight on the issue of pair reversal (updated)