To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 獄?????音⑤? 100011011001011000111111001111110011111100111111001111111000100110111001100001110100010000111111 8d963f3f3f3f3f89b987443f
EUC-JP 獄?????音?? 1011100111110110001111110011111100111111001111110011111110110010101110110011111100111111 b9f63f3f3f3f3fb2bb3f3f
UTF-8 獄멸퀎兩볠섕音⑤젶 111001111000110110000100111010111010100110111000111011011000000010001110111011111010010110111000111010111011001110100000111011001000010010010101111010011001111110110011111000101001000110100100111011001010000010110110 e78d84eba9b8ed808eefa5b8ebb3a0ec8495e99fb3e291a4eca0b6
UHC 獄멸퀎兩볠섕音⑤젶 111010001010101110111000111010101011001110000100111001011011101110010011111001101011110010101100111010111110010110101000111010111010000010101010 e8abb8eab384e5bb93e6bcacebe5a8eba0aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)