To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????U 00111111001111110011111100111111001111110011111101010101 3f3f3f3f3f3f55
SJIS-WIN 踝ъЁ讌れ乙U 11100110111101001000010010001100100001000100011011100110101001011000001011101010100010011011001101010101 e6f4848c8446e6a582ea89b355
EUC-JP 踝ъЁ讌れ乙U 11101100111101101010011111101100101001111010011111101100101001111010010011101100101100101011010101010101 ecf6a7eca7a7eca7a4ecb2b555
UTF-8 踝ъЁ讌れ乙U 1110100010111000100111011101000110001010110100001000000111101000101011101000110011100011100000101000110011100100101110011001100101010101 e8b89dd18ad081e8ae8ce3828ce4b99955
UHC ?ъЁ?れ乙U 0011111110101100111011001010110010100111001111111010101011101100111010111110000001010101 3facecaca73faaecebe055

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)