To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???瓦?????搖?????瓦?????蜈??B 0011111100111111001111111000101010100010001111110011111100111111001111110011111110011101100010100011111100111111001111110011111100111111100010101010001000111111001111110011111100111111001111111110010110000101001111110011111101000010 3f3f3f8aa23f3f3f3f3f9d8a3f3f3f3f3f8aa23f3f3f3f3fe5853f3f42
EUC-JP 縕??瓦??縕??搖??縕??瓦??縕??蜈??B 10001111110101001100001000111111001111111011010010100100001111110011111110001111110101001100001000111111001111111101100111101010001111110011111110001111110101001100001000111111001111111011010010100100001111110011111110001111110101001100001000111111001111111110100111100101001111110011111101000010 8fd4c23f3fb4a43f3f8fd4c23f3fd9ea3f3f8fd4c23f3fb4a43f3f8fd4c23f3fe9e53f3f42
UTF-8 縕됵슴瓦븝슬縕됵슴搖억쉠縕됵슴瓦븝슬縕됵슴蜈랃쉐B 11100111101110001001010111101011100100001011010111101100100010101011010011100111100100111010011011101011101110001001110111101100100010101010110011100111101110001001010111101011100100001011010111101100100010101011010011100110100100001001011011101100100101101011010111101100100010011010000011100111101110001001010111101011100100001011010111101100100010101011010011100111100100111010011011101011101110001001110111101100100010101010110011100111101110001001010111101011100100001011010111101100100010101011010011101000100111001000100011101011100111101000001111101100100010011001000001000010 e7b895eb90b5ec8ab4e793a6ebb89dec8aace7b895eb90b5ec8ab4e69096ec96b5ec89a0e7b895eb90b5ec8ab4e793a6ebb89dec8aace7b895eb90b5ec8ab4e89c88eb9e83ec899042
UHC 縕됵슴瓦븝슬縕됵슴搖억쉠縕됵슴瓦븝슬縕됵슴蜈랃쉐B 11101000101100101000100111101111101111011011111111101000101111111011101011101111101111011011110111101000101100101000100111101111101111011011111111101000111101001011111011101111101111011010101011101000101100101000100111101111101111011011111111101000101111111011101011101111101111011011110111101000101100101000100111101111101111011011111111101000101001011000110111101111101111011010011001000010 e8b289efbdbfe8bfbaefbdbde8b289efbdbfe8f4beefbdaae8b289efbdbfe8bfbaefbdbde8b289efbdbfe8a58defbda642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)