To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 螳溥∪辷オ螳滓万 111001011010111010011111111011101000000110111110111001111000100010110101111001011010111010011111111001101001011010011100 e5ae9fee81bee788b5e5ae9fe6969c
EUC-JP 螳溥∪辷オ螳滓万 11101010101100001101111011110000101000101100000011101101111010001000111010110101111010101011000011011110111010001100101111111100 eab0def0a2c0ede88eb5eab0dee8cbfc
UTF-8 螳溥∪辷オ螳滓万 111010001001111010110011111001101011101010100101111000101000100010101010111010001011111010110111111011111011110110110101111010001001111010110011111001101011101110010011111001001011100010000111 e89eb3e6baa5e288aae8beb7efbdb5e89eb3e6bb93e4b887
UHC 螳溥∪??螳滓万 1101001111011001110111011010101010100001111110100011111100111111110100111101100111101110101010111101100010110010 d3d9ddaaa1fa3f3fd3d9eeabd8b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)