To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霆??耘?毋證?齬? 111010001011101100111111001111111110001111001111001111111001111101111000111001101001101000111111111010101001011100111111 e8bb3f3fe3cf3f9f78e69a3fea973f
EUC-JP 霆??耘?毋證?齬? 111100001011110100111111001111111110011011010001001111111101110111011001111010111111101000111111111100111111011100111111 f0bd3f3fe6d13fddd9ebfa3ff3f73f
UTF-8 霆면섦耘띔毋證렧齬냄 111010011001110010000110111010111010100110110100111011001000010010100110111010001000000010011000111010111001110110010100111001101010111110001011111010001010110110001001111010111010000010100111111010011011110110101100111010111000001110000100 e99c86eba9b4ec84a6e88098eb9d94e6af8be8ad89eba0a7e9bdaceb8384
UHC 霆면섦耘띔毋證렧齬냄 1110111111111101101110001110100110111100101101001110100111111100101101101110101011011001111011001111000111111011100011101011011011100101111000011011001110111111 effdb8e9bcb4e9fcb6ead9ecf1fb8eb6e5e1b3bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)