To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?瑟品粃??お 0011111111100000111011001001010101101001111000101110000100111111001111111000001010101000 3fe0ec9569e2e13f3f82a8
EUC-JP ?瑟品粃??お 0011111111100000111011101100100111001010111001001110001100111111001111111010010010101010 3fe0eec9cae4e33f3fa4aa
UTF-8 룵瑟品粃룴쵍お 111010111010001110110101111001111001000110011111111001011001001110000001111001111011001010000011111010111010001110110100111011001011010110001101111000111000000110001010 eba3b5e7919fe59381e7b283eba3b4ecb58de3818a
UHC 룵瑟品粃룴쵍お 1000111110101010111000111010001011111001101000011101110111111011100011111010100110101100100011111010101010101010 8faae3a2f9a1ddfb8fa9ac8faaaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)