How does language recognition work?

There is more than one way to identify the language of a text. In a small number of cases, the script itself helps, for example if it uses an alphabet unique to only one language (e.g. Greek letters for the Greek language, Georgian script for the Georgian language).

If this doesn’t help identify the language, Neticle’s own vocabulary lists come in handy, which are unique to each language. These lists contain inflected forms of words as well, and an exact match is required to consider them a hit in the text. If the majority of the words in a given text belong to the same language list, it is safe to assume that we’ve identified the language used.