To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔?霽?制耿??趙陌工弔?霽?制耿??趙貊? 1001001010100010001111111110100011000111001111111001000010100111111000111101010000111111001111111110011011100010111010001001100110001101010010001001001010100010001111111110100011000111001111111001000010100111111000111101010000111111001111111110011011100010111001101011101100111111 92a23fe8c73f90a7e3d43f3fe6e2e8998d4892a23fe8c73f90a7e3d43f3fe6e2e6bb3f
EUC-JP 弔?霽?制耿??趙陌工弔?霽?制耿??趙貊? 1100010010100100001111111111000011001001001111111100000010101001111001101101011000111111001111111110110011100100111011111111100110111001101010011100010010100100001111111111000011001001001111111100000010101001111001101101011000111111001111111110110011100100111011001011110100111111 c4a43ff0c93fc0a9e6d63f3fece4eff9b9a9c4a43ff0c93fc0a9e6d63f3fece4ecbd3f
UTF-8 弔렟霽렢制耿렕렟趙陌工弔렟霽렢制耿렕렟趙貊뱌 111001011011110010010100111010111010000010011111111010011001110010111101111010111010000010100010111001011000100010110110111010001000000010111111111010111010000010010101111010111010000010011111111010001011011010011001111010011001100110001100111001011011011110100101111001011011110010010100111010111010000010011111111010011001110010111101111010111010000010100010111001011000100010110110111010001000000010111111111010111010000010010101111010111010000010011111111010001011011010011001111010001011001010001010111010111011000110001100 e5bc94eba09fe99cbdeba0a2e588b6e880bfeba095eba09fe8b699e9998ce5b7a5e5bc94eba09fe99cbdeba0a2e588b6e880bfeba095eba09fe8b699e8b28aebb18c
UHC 弔렟霽렢制耿렕렟趙陌工弔렟霽렢制耿렕렟趙貊뱌 1111000011000000100011101011000011110000101110001000111010110011111100001010010011001100111010101000111010101010100011101011000011110000111000011101100011101000110011011110111111110000110000001000111010110000111100001011100010001110101100111111000010100100110011001110101010001110101010101000111010110000111100001110000111011000111001111011100111110010 f0c08eb0f0b88eb3f0a4ccea8eaa8eb0f0e1d8e8cdeff0c08eb0f0b88eb3f0a4ccea8eaa8eb0f0e1d8e7b9f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)