This class can identify predominant character set in a string.
It can take a string of text in UTF-8 and analyzes the character codes to determine which is the predominant character set that the is used based on the frequency of the characters that are typically of certain languages.
Currently it can identify the character sets of Latin, Greek, Cyrillic. Armenian, Hebrew, Arabic, Devanagari, Bengali, Gujarati, Tamil, Malayalam, Sinhala, Thai, Lao, Tibetan, Burmese, Georgian, Korean, Khmer, Japanese, and CJK.
A string in Unicode may contain text of multiple character sets.
This class can identify predominant character set in a string of many possible character sets.