To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN ?葦絲?葦絲B 0011111110001000101011111110001101001110001111111000100010101111111000110100111001000010 3f88afe34e3f88afe34e42
EUC-JP ?葦絲?葦絲B 0011111110110000101100011110010110101111001111111011000010110001111001011010111101000010 3fb0b1e5af3fb0b1e5af42
UTF-8 썸葦絲썸葦絲B 11101100100011011011100011101000100100011010011011100111101101011011001011101100100011011011100011101000100100011010011011100111101101011011001001000010 ec8db8e891a6e7b5b2ec8db8e891a6e7b5b242
UHC 썸葦絲썸葦絲B 10111101111001101110101011011000110111101110101010111101111001101110101011011000110111101110101001000010 bde6ead8deeabde6ead8deea42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)