To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 堤?咳賂??堤?? 10010010111001110011111110001010010100001001100001000111001111110011111110010010111001110011111100111111 92e73f8a5098473f3f92e73f3f
EUC-JP 堤?咳賂??堤?? 11000100111010010011111110110011101100011100111110101000001111110011111111000100111010010011111100111111 c4e93fb3b1cfa83f3fc4e93f3f
UTF-8 堤렕咳賂렰렲堤멩렭 111001011010000010100100111010111010000010010101111001011001001010110011111010001011001110000010111010111010000010110000111010111010000010110010111001011010000010100100111010111010100110101001111010111010000010101101 e5a0a4eba095e592b3e8b382eba0b0eba0b2e5a0a4eba9a9eba0ad
UHC 堤렕咳賂렰렲堤멩렭 111100001010011110001110101010101111101010100110110101101111000110001110101111011000111010111111111100001010011110111000111001101000111010111010 f0a78eaafaa6d6f18ebd8ebff0a7b8e68eba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)