To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?撤呈??逕糾 0011111110010011010100001001001011100110001111110011111111100111100101001000101110001010 3f935092e63f3fe7948b8a
EUC-JP ?撤呈??逕糾 0011111111000101101100011100010011101000001111110011111111101101111101001011010111101010 3fc5b1c4e83f3fedf4b5ea
UTF-8 뤋撤呈쨵샅逕糾 111010111010010010001011111001101001001010100100111001011001000110001000111011001010100010110101111011001000001110000101111010011000000010010101111001111011001110111110 eba48be692a4e59188eca8b5ec8385e98095e7b3be
UHC 뤋撤呈쨵샅逕糾 1000111110111011111101001100110011101111110100001010010010001111101110111111010011001100111011111101000010101100 8fbbf4ccefd0a48fbbf4ccefd0ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)