To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 誤??韋?????檍??宥??蟻??億??陰 1000110011101011001111110011111111101000111010000011111100111111001111110011111100111111100111101111100000111111001111111001011101000111001111110011111110001011011000010011111100111111100010011010110100111111001111111000100101000001 8ceb3f3fe8e83f3f3f3f3f9ef83f3f97473f3f8b613f3f89ad3f3f8941
EUC-JP 誤??韋??洧??檍??宥??蟻??億??陰 10111000111011010011111100111111111100001110101000111111001111111000111111000111101101000011111100111111110111001111101000111111001111111100110110101000001111110011111110110101110000100011111100111111101100101010111100111111001111111011000110100010 b8ed3f3ff0ea3f3f8fc7b43f3fdcfa3f3fcda83f3fb5c23f3fb2af3f3fb1a2
UTF-8 誤곸룆韋귝궇洧룸짎檍됰콅宥욇쉬蟻숇쾴億됱뼇陰 111010001010101010100100111010101011001110111000111010111010001110000110111010011001111110001011111010101011011110011101111010101011011010000111111001101011010010100111111010111010001110111000111011001010011110001110111001101010101010001101111010111001000010110000111011001011110110000101111001011010111010100101111011001001101010000111111011001000100110101100111010001001111110111011111011001000100010000111111011001011111010110100111001011000010010000100111010111001000010110001111010111011110010000111111010011001100110110000 e8aaa4eab3b8eba386e99f8beab79deab687e6b4a7eba3b8eca78ee6aa8deb90b0ecbd85e5aea5ec9a87ec89ace89fbbec8887ecbeb4e58484eb90b1ebbc87e999b0
UHC 誤곸룆韋귝궇洧룸짎檍됰콅宥욇쉬蟻숇쾴億됱뼇陰 1110100010100110100000011110110010001111100001011110101011011111100000101110011010000010101000001110101011111011101101111110101110100011100110101110010111100101100010011110101110110001100000011110101011101001100111101110100110111101101011001110101111111100100110011110101110110010100010101110010111100010100010011110110010010110100100011110101111100100 e8a681ec8f85eadf82e682a0eafbb7eba39ae5e589ebb181eae99ee9bdacebfc99ebb28ae5e289ec9691ebe4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)