To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 五?????沃? 10001100110111000011111100111111001111110011111100111111100101111000000000111111 8cdc3f3f3f3f3f97803f
EUC-JP 五??濚??沃? 101110001101111000111111001111111000111111001001101000010011111100111111110011011110000000111111 b8de3f3f8fc9a13f3fcde03f
UTF-8 五욁꺈濚띹빆沃뚩 111001001011101010010100111011001001101010000001111010101011101010001000111001101011111110011010111010111001110110111001111010111011100110000110111001101011001010000011111010111001101010101001 e4ba94ec9a81eaba88e6bf9aeb9db9ebb986e6b283eb9aa9
UHC 五욁꺈濚띹빆沃뚩 11100111111010011001111011100011100000111010111111100111101110011000110111101000100101011010110111101000101010101000110011101000 e7e99ee383afe7b98de895ade8aa8ce8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)