To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?砥?窪?淨?旨鞨??淨?砥?窪?淨?旨鞨??^ 10011111110001000011111110010011011101010011111110001100010001010011111110011111110001000011111110001110011111001110100011100000001111110011111110011111110001000011111110010011011101010011111110001100010001010011111110011111110001000011111110001110011111001110100011100000001111110011111101011110 9fc43f93753f8c453f9fc43f8e7ce8e03f3f9fc43f93753f8c453f9fc43f8e7ce8e03f3f5e
EUC-JP 淨?砥?窪?淨?旨鞨??淨?砥?窪?淨?旨鞨??^ 11011110110001100011111111000101110101100011111110110111101001100011111111011110110001100011111110111011110111011111000011100010001111110011111111011110110001100011111111000101110101100011111110110111101001100011111111011110110001100011111110111011110111011111000011100010001111110011111101011110 dec63fc5d63fb7a63fdec63fbbddf0e23f3fdec63fc5d63fb7a63fdec63fbbddf0e23f3f5e
UTF-8 淨렍砥렫窪렜淨렞旨鞨렯렞淨렍砥렫窪렜淨렞旨鞨렯렞^ 11100110101101111010100011101011101000001000110111100111101000001010010111101011101000001010101111100111101010101010101011101011101000001001110011100110101101111010100011101011101000001001111011100110100101111010100011101001100111101010100011101011101000001010111111101011101000001001111011100110101101111010100011101011101000001000110111100111101000001010010111101011101000001010101111100111101010101010101011101011101000001001110011100110101101111010100011101011101000001001111011100110100101111010100011101001100111101010100011101011101000001010111111101011101000001001111001011110 e6b7a8eba08de7a0a5eba0abe7aaaaeba09ce6b7a8eba09ee697a8e99ea8eba0afeba09ee6b7a8eba08de7a0a5eba0abe7aaaaeba09ce6b7a8eba09ee697a8e99ea8eba0afeba09e5e
UHC 淨렍砥렫窪렜淨렞旨鞨렯렞淨렍砥렫窪렜淨렞旨鞨렯렞^ 11101111111001001000111010100011111100101011001010001110101110011110100011000001100011101010111011101111111001001000111010101111111100101010100111001010111010101000111010111100100011101010111111101111111001001000111010100011111100101011001010001110101110011110100011000001100011101010111011101111111001001000111010101111111100101010100111001010111010101000111010111100100011101010111101011110 efe48ea3f2b28eb9e8c18eaeefe48eaff2a9caea8ebc8eafefe48ea3f2b28eb9e8c18eaeefe48eaff2a9caea8ebc8eaf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)