answersLogoWhite

0

What languages are used in Unicode?

Updated: 10/26/2022
User Avatar

Wiki User

11y ago

Best Answer

There are very few modern languages that cannot use Unicode. Unicode can also be used from some obsolete languages as well, such as ancient Akkadian, Ugaritic, Old Persian. and so forth. The most basic Egyptian hieroglyphics will be added in the next version, 5.1 along with

The Unicode website at http://www.unicode.org/standard/WhatIsUnicode.html provides the English phrase "What is Unicode" translated into 48 languages and there are far more than that not do not appear here, for example all current American Indian languages that use a version of the Latin script or which use the Unified Canadian Aboriginal Syllables are covered by Unicode.

We are told at http://www.unicode.org/versions/Unicode5.1.0/ that Version 5.1 of Unicode due out next week "extends support for languages in Africa, India, Indonesia, Myanmar, and Vietnam, with the addition of the Cham, Lepcha, Ol Chiki, Rejang, Saurashtra, Sundanese, and Vai scripts."

Accordingly when this question appears in the Unicode FAQ at http://unicode.org/faq/basic_q.html#16 , the consortium's answer is:

"It's hard to say. Many scripts (especially Latin) are used for a very large number of languages. The easiest answer is that Unicode covers all the languages that can be written in the following scripts: Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Syriac, Thaana, Devanagari, Bengali, Gurmukhi, Oriya, Tamil, Telugu, Kannada, Malayalam, Sinhala, Thai, Lao, Tibetan, Myanmar, Georgian, Hangul, Ethiopic, Cherokee, Canadian-Aboriginal Syllabics, Ogham, Runic, Tagalog, Hanunóo, Buhid, Tagbanwa, Khmer, Mongolian, Limbu, Tai Le, Han (Japanese, Chinese, Korean ideographs), Hiragana, Katakana, Bopomofo, Yi, Linear B, Old Italic, Gothic, Ugaritic, Deseret, Shavian, Osmanya, and Cypriot.. See also the list of Languages and Scripts. [MD]"

Note that this leaves out some of the scripts in Unicode 5.0 (such as some Philippino scripts) as well as some of the scripts to come in version 5.1. Modern languages still missing would those of small minority communities with unique scripts where no-one has come forth the present the encoding and the Unicode consortium has not got around to doing so themselves as yet. I some cases there is dispute about how a script should be encoded.

The intention is that almost very language known should be encoded in Unicode, with the exception of some modern invented languages of dubious importance. Essentially, if those using such languages don't themselves use a script different from Latin or Arabic or such, even though one may have been invented, then it is not going to be encoded. The only such script which has been rejected (for the moment) from Unicode is Klingon, because the supposed alphabet is rarely used even by those who known Klingon and was not used with the supposed proper values in the flms themsleves. This is the only that I know about where a script was rejected.

User Avatar

Wiki User

11y ago
This answer is:
User Avatar

Add your answer:

Earn +20 pts
Q: What languages are used in Unicode?
Write your answer...
Submit
Still have questions?
magnify glass
imp
Related questions

Which Indian languages are included in unicode?

All of the major languages of india and most of the minority languages are included in unicode.


What is the difference between ASCII and Unicode?

ASCII is a set of digital codes widely used as a standard fromat in the transfer of text. Unicode is an international encoding standard for used with different languages and scripts


What is different between unicode and ascii?

ASCII is a set of digital codes widely used as a standard fromat in the transfer of text. Unicode is an international encoding standard for used with different languages and scripts


What are the advantages of unicode?

Unicode is intended to account for as many languages and symbols as possible while ASCII only covers a small subset. (English and some European languages).


This coding system is designed to support international languages like Chinese and Japanese?

unicode or ansic


Coding scheme capable of representing almost all the worlds current written languages?

That sounds like a quiz question asking for the answer Unicode.


Is unicode limited to languages that do not use the English alphabet?

No. Unicode includes (or has the capability to include) every language on Earth, including English.


Coding scheme capable of representing almost all the world's current language?

Unicode is a coding scheme that can represent almost all of the world's current languages. It includes characters for a wide range of scripts, symbols, emojis, and special characters used in various languages worldwide. Unicode allows for consistent text representation across different platforms, devices, and applications.


Why was the UNICODE system invented?

The Unicode system was invented to create a universal character encoding standard that could support multiple languages and scripts. This standard allows for the representation of text in different languages and writing systems across various platforms and devices. Unicode helps to ensure consistency and interoperability in text encoding.


How many bits are in a Unicode character?

Depends on what you refer to as Unicode. Typically the ones you will see is UTF-8 which uses from up to one to three bytes per character (the two or three-byte characters are usually for characters used in various other languages that are not already covered under the ASCII codepage). Otherwise, the convention states that Unicode is UTF-16.


Preeti To Unicode?

Preeti To Unicode COnverter is one of the most widely used tool to convert nepali traditional roman font to unicode and vice versa.


Write short note on ASCII and unicode?

In computer memory, character are represented using predefined character set. Historically 7 bit American Standard Code for Information Interchange (ASCII) code, 8 bit American National Standards Institute (ANSI) code and Extended Binary Coded Decimal Interchange Code(EBCDIC) were used. These coding scheme represents selected characters into 7 or 8 bit binary code. These character schemes do not represent all the characters in all the languages in uniform format. At present Unicode is used to represent characters into the computer memory. Unicode provides universal and efficient character presentations and hence evolved as modern character representation scheme. Unicode scheme is maintained by a non-profit organization called Unicode consortium. Unicode is also compatible with other coding scheme like ASCII. Unicode use either 16 bits or 32 bits to represent a character. Unicode has capability represent characters from all the major languages in use currently across the world.