To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 鄭??醫?鄭??醫?[鄭??醫?鄭??醫?[^ 10010011010000010011111100111111111001111100111000111111100100110100000100111111001111111110011111001110001111110101101110010011010000010011111100111111111001111100111000111111100100110100000100111111001111111110011111001110001111110101101101011110 93413f3fe7ce3f93413f3fe7ce3f5b93413f3fe7ce3f93413f3fe7ce3f5b5e
EUC-JP 鄭??醫?鄭??醫?[鄭??醫?鄭??醫?[^ 11000101101000100011111100111111111011101101000000111111110001011010001000111111001111111110111011010000001111110101101111000101101000100011111100111111111011101101000000111111110001011010001000111111001111111110111011010000001111110101101101011110 c5a23f3feed03fc5a23f3feed03f5bc5a23f3feed03fc5a23f3feed03f5b5e
UTF-8 鄭얘ㄴ醫렫鄭얘ㄴ醫렫[鄭얘ㄴ醫렫鄭얘ㄴ醫렫[^ 111010011000010010101101111011001001011010011000111000111000010010110100111010011000011010101011111010111010000010101011111010011000010010101101111011001001011010011000111000111000010010110100111010011000011010101011111010111010000010101011010110111110100110000100101011011110110010010110100110001110001110000100101101001110100110000110101010111110101110100000101010111110100110000100101011011110110010010110100110001110001110000100101101001110100110000110101010111110101110100000101010110101101101011110 e984adec9698e384b4e986abeba0abe984adec9698e384b4e986abeba0ab5be984adec9698e384b4e986abeba0abe984adec9698e384b4e986abeba0ab5b5e
UHC 鄭얘ㄴ醫렫鄭얘ㄴ醫렫[鄭얘ㄴ醫렫鄭얘ㄴ醫렫[^ 11101111111101111011111011101010101001001010010011101100101000101000111010111001111011111111011110111110111010101010010010100100111011001010001010001110101110010101101111101111111101111011111011101010101001001010010011101100101000101000111010111001111011111111011110111110111010101010010010100100111011001010001010001110101110010101101101011110 eff7beeaa4a4eca28eb9eff7beeaa4a4eca28eb95beff7beeaa4a4eca28eb9eff7beeaa4a4eca28eb95b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)