To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 五?????沃?? 1000110011011100001111110011111100111111001111110011111110010111100000000011111100111111 8cdc3f3f3f3f3f97803f3f
EUC-JP 五??濚??沃?? 10111000110111100011111100111111100011111100100110100001001111110011111111001101111000000011111100111111 b8de3f3f8fc9a13f3fcde03f3f
UTF-8 五잓쉹濚띹빆沃뚩츖 111001001011101010010100111011001001111010010011111011001000100110111001111001101011111110011010111010111001110110111001111010111011100110000110111001101011001010000011111010111001101010101001111011001011100010010110 e4ba94ec9e93ec89b9e6bf9aeb9db9ebb986e6b283eb9aa9ecb896
UHC 五잓쉹濚띹빆沃뚩츖 111001111110100110011111111010011001101010001111111001111011100110001101111010001001010110101101111010001010101010001100111010001010111010010000 e7e99fe99a8fe7b98de895ade8aa8ce8ae90

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)