To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 螂幄ュッ鬲假スォ鬲 1110010110100101100110111110100010101101101011111110100110101101100110001110111110111101101010111110100110101101 e5a59be8adafe9ad98efbdabe9ad
EUC-JP 螂幄ュッ鬲假スォ鬲 111010101010011111010110111010101000111010101101100011101010111111110010101011111101000011110001100011101011110110001110101010111111001010101111 eaa7d6ea8ead8eaff2afd0f18ebd8eabf2af
UTF-8 螂幄ュッ鬲假スォ鬲 111010001001111010000010111001011011100110000100111011111011110110101101111011111011110110101111111010011010110010110010111001011000000110000111111011111011110110111101111011111011110110101011111010011010110010110010 e89e82e5b984efbdadefbdafe9acb2e58187efbdbdefbdabe9acb2
UHC 螂幄???假??? 110101011100110011100100110000010011111100111111001111111100101010100011001111110011111100111111 d5cce4c13f3f3fcaa33f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)