To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 兀????????v兀????????vB 1001100101011001001111110011111100111111001111110011111100111111001111110011111101110110100110010101100100111111001111110011111100111111001111110011111100111111001111110111011001000010 99593f3f3f3f3f3f3f3f7699593f3f3f3f3f3f3f3f7642
EUC-JP 兀??縕??縕??v兀??縕??縕??vB 11010001101110100011111100111111100011111101010011000010001111110011111110001111110101001100001000111111001111110111011011010001101110100011111100111111100011111101010011000010001111110011111110001111110101001100001000111111001111110111011001000010 d1ba3f3f8fd4c23f3f8fd4c23f3f76d1ba3f3f8fd4c23f3f8fd4c23f3f7642
UTF-8 兀믢옇縕띶걙縕딉숱v兀믢옇縕띶걙縕딉숱vB 111001011000010110000000111010111010111110100010111011001001100010000111111001111011100010010101111010111001110110110110111010101011000110011001111001111011100010010101111010111001010010001001111011001000100010110001011101101110010110000101100000001110101110101111101000101110110010011000100001111110011110111000100101011110101110011101101101101110101010110001100110011110011110111000100101011110101110010100100010011110110010001000101100010111011001000010 e58580ebafa2ec9887e7b895eb9db6eab199e7b895eb9489ec88b176e58580ebafa2ec9887e7b895eb9db6eab199e7b895eb9489ec88b17642
UHC 兀믢옇縕띶걙縕딉숱v兀믢옇縕띶걙縕딉숱vB 111010001011010010010010111001001011111110111000111010001011001010001101111001011000000110000011111010001011001010001010111011111011110110100010011101101110100010110100100100101110010010111111101110001110100010110010100011011110010110000001100000111110100010110010100010101110111110111101101000100111011001000010 e8b492e4bfb8e8b28de58183e8b28aefbda276e8b492e4bfb8e8b28de58183e8b28aefbda27642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)