To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Fh?????????Fk 00111111001111110011111100111111001111110011111100111111001111110011111101000110011010000011111100111111001111110011111100111111001111110011111100111111001111110100011001101011 3f3f3f3f3f3f3f3f3f46683f3f3f3f3f3f3f3f3f466b
SJIS-WIN ???沃?????Fh???沃?????Fk 001111110011111100111111100101111000000000111111001111110011111100111111001111110100011001101000001111110011111100111111100101111000000000111111001111110011111100111111001111110100011001101011 3f3f3f97803f3f3f3f3f46683f3f3f97803f3f3f3f3f466b
EUC-JP ???沃??縕??Fh???沃??縕??Fk 00111111001111110011111111001101111000000011111100111111100011111101010011000010001111110011111101000110011010000011111100111111001111111100110111100000001111110011111110001111110101001100001000111111001111110100011001101011 3f3f3fcde03f3f8fd4c23f3f46683f3f3fcde03f3f8fd4c23f3f466b
UTF-8 筽띰슐沃얏벤縕딉쉠Fh筽띰슐沃얏벤縕딉쉠Fk 11100111101011011011110111101011100111011011000011101100100010101001000011100110101100101000001111101100100101101000111111101011101100101010010011100111101110001001010111101011100101001000100111101100100010011010000001000110011010001110011110101101101111011110101110011101101100001110110010001010100100001110011010110010100000111110110010010110100011111110101110110010101001001110011110111000100101011110101110010100100010011110110010001001101000000100011001101011 e7adbdeb9db0ec8a90e6b283ec968febb2a4e7b895eb9489ec89a04668e7adbdeb9db0ec8a90e6b283ec968febb2a4e7b895eb9489ec89a0466b
UHC 筽띰슐沃얏벤縕딉쉠Fh筽띰슐沃얏벤縕딉쉠Fk 11101000101001001011011011101111101111011011011011101000101010101011111011100110101110101010010111101000101100101000101011101111101111011010101001000110011010001110100010100100101101101110111110111101101101101110100010101010101111101110011010111010101001011110100010110010100010101110111110111101101010100100011001101011 e8a4b6efbdb6e8aabee6baa5e8b28aefbdaa4668e8a4b6efbdb6e8aabee6baa5e8b28aefbdaa466b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)