To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ëð”ë ²ëð”ë ”ëïˆë  cB 1110101111110000100101001110101110100000101100101110101111110000100101001110101110100000100101001110101111101111100010001110101110100000101000000110001101000010 ebf094eba0b2ebf094eba094ebef88eba0a06342
SJIS-WIN ??????????????????cB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110001101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6342
EUC-JP ëð?ë??ëð?ë??ëï?ë??cB 1000111110101011101100111000111110101001110000110011111110001111101010111011001100111111001111111000111110101011101100111000111110101001110000110011111110001111101010111011001100111111001111111000111110101011101100111000111110101011110000010011111110001111101010111011001100111111001111110110001101000010 8fabb38fa9c33f8fabb33f3f8fabb38fa9c33f8fabb33f3f8fabb38fabc13f8fabb33f3f6342
UTF-8 ëð”ë ²ëð”ë ”ëïˆë  cB 1100001110101011110000111011000011000010100101001100001110101011110000101010000011000010101100101100001110101011110000111011000011000010100101001100001110101011110000101010000011000010100101001100001110101011110000111010111111000010100010001100001110101011110000101010000011000010101000000110001101000010 c3abc3b0c294c3abc2a0c2b2c3abc3b0c294c3abc2a0c294c3abc3afc288c3abc2a0c2a06342
UHC ?ð???²?ð??????????cB 0011111110101001101000110011111100111111001111111010100111110111001111111010100110100011001111110011111100111111001111110011111100111111001111110011111100111111001111110110001101000010 3fa9a33f3f3fa9f73fa9a33f3f3f3f3f3f3f3f3f3f6342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)