To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????LB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4c42
SJIS-WIN 治識セュ襃爾篠セ、柴治識セュ襃爾篠セォ治LB 10001110101000011000111010101111101111101010110111100101111011111000111010100010100011101100001010111110101001001000111011000100100011101010000110001110101011111011111010101101111001011110111110001110101000101000111011000010101111101010101110001110101000010100110001000010 8ea18eafbeade5ef8ea28ec2bea48ec48ea18eafbeade5ef8ea28ec2beab8ea14c42
EUC-JP 治識セュ襃爾篠セ、柴治識セュ襃爾篠セォ治LB 101111001010001110111100101100011000111010111110100011101010110111101010111100011011110010100100101111001100010010001110101111101000111010100100101111001100011010111100101000111011110010110001100011101011111010001110101011011110101011110001101111001010010010111100110001001000111010111110100011101010101110111100101000110100110001000010 bca3bcb18ebe8eadeaf1bca4bcc48ebe8ea4bcc6bca3bcb18ebe8eadeaf1bca4bcc48ebe8eabbca34c42
UTF-8 治識セュ襃爾篠セ、柴治識セュ襃爾篠セォ治LB 1110011010110010101110111110100010101101100110001110111110111101101111101110111110111101101011011110100010100101100000111110011110001000101111101110011110101111101000001110111110111101101111101110111110111101101001001110011010011111101101001110011010110010101110111110100010101101100110001110111110111101101111101110111110111101101011011110100010100101100000111110011110001000101111101110011110101111101000001110111110111101101111101110111110111101101010111110011010110010101110110100110001000010 e6b2bbe8ad98efbdbeefbdade8a583e788bee7afa0efbdbeefbda4e69fb4e6b2bbe8ad98efbdbeefbdade8a583e788bee7afa0efbdbeefbdabe6b2bb4c42
UHC 治識???爾篠??柴治識???爾篠??治LB 1111011010111101111000111101101100111111001111110011111111101100101100111110000111000110001111110011111111100011110000111111011010111101111000111101101100111111001111110011111111101100101100111110000111000110001111110011111111110110101111010100110001000010 f6bde3db3f3f3fecb3e1c63f3fe3c3f6bde3db3f3f3fecb3e1c63f3ff6bd4c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)