To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 歪??昻?ぇ節 1001100001100011001111110011111111111010110100000011111110000010101001011001000011011111 98633f3ffad03f82a590df
EUC-JP 歪????ぇ節 11001111110001000011111100111111001111110011111110100100101001111100000011100001 cfc43f3f3f3fa4a7c0e1
UTF-8 歪귝깄昻섋ぇ節 111001101010110110101010111010101011011110011101111010101011100110000100111001101001100010111011111011001000010010001011111000111000000110000111111001111010111110000000 e6adaaeab79deab984e698bbec848be38187e7af80
UHC 歪귝깄昻섋ぇ節 1110100011100000100000101110011010000011100001011110010011101001100110001110100010101010101001111110111110111101 e8e082e68385e4e998e8aaa7efbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)