To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 遏ュ雍育処竏エ閧ゥ鄒 1110011110011111101011011110100010110100100010001110011110001111100010001110001010001000101101001110100010000010101010011110011110111110 e79fade8b488e78f88e288b4e882a9e7be
EUC-JP 遏ュ雍育処竏エ閧ゥ鄒 1110111010100001100011101010110111110000101101101011000011101001101111011110100011100011111010001000111010110100111011111110001010001110101010011110111011000000 eea18eadf0b6b0e9bde8e3e88eb4efe28ea9eec0
UTF-8 遏ュ雍育処竏エ閧ゥ鄒 111010011000000110001111111011111011110110101101111010011001101110001101111010001000001010110010111001011000011110100110111001111010101110001111111011111011110110110100111010011001011010100111111011111011110110101001111010011000010010010010 e9818fefbdade99b8de882b2e587a6e7ab8fefbdb4e996a7efbda9e98492
UHC ??雍育?????鄒 00111111001111111110100010111100111010111100000000111111001111110011111100111111001111111111010111011011 3f3fe8bcebc03f3f3f3f3ff5db

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)