To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN ?須市?須市^ 0011111110010000011110111000111001110011001111111001000001111011100011100111001101011110 3f907b8e733f907b8e735e
EUC-JP ?須市?須市^ 0011111110111111110111001011101111010100001111111011111111011100101110111101010001011110 3fbfdcbbd43fbfdcbbd45e
UTF-8 렒須市렒須市^ 11101011101000001001001011101001101000001000100011100101101110001000001011101011101000001001001011101001101000001000100011100101101110001000001001011110 eba092e9a088e5b882eba092e9a088e5b8825e
UHC 렒須市렒須市^ 10001110101001111110001011001110111000111011110010001110101001111110001011001110111000111011110001011110 8ea7e2cee3bc8ea7e2cee3bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)