To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??莎た?純ヒ?尋??莎た?純ヒ?尋B 0011111100111111111001001011001110000010101111010011111110001111100000111000001101110001001111111001000001110001001111110011111111100100101100111000001010111101001111111000111110000011100000110111000100111111100100000111000101000010 3f3fe4b382bd3f8f8383713f90713f3fe4b382bd3f8f8383713f907142
EUC-JP ??莎た?純ヒ?尋??莎た?純ヒ?尋B 0011111100111111111010001011010110100100101111110011111110111101111000111010010111010010001111111011111111010010001111110011111111101000101101011010010010111111001111111011110111100011101001011101001000111111101111111101001001000010 3f3fe8b5a4bf3fbde3a5d23fbfd23f3fe8b5a4bf3fbde3a5d23fbfd242
UTF-8 룴가莎た룶純ヒ룶尋룴가莎た룶純ヒ룶尋B 11101011101000111011010011101010101100001000000011101000100011101000111011100011100000011001111111101011101000111011011011100111101101001001010011100011100000111001001011101011101000111011011011100101101100001000101111101011101000111011010011101010101100001000000011101000100011101000111011100011100000011001111111101011101000111011011011100111101101001001010011100011100000111001001011101011101000111011011011100101101100001000101101000010 eba3b4eab080e88e8ee3819feba3b6e7b494e38392eba3b6e5b08beba3b4eab080e88e8ee3819feba3b6e7b494e38392eba3b6e5b08b42
UHC 룴가莎た룶純ヒ룶尋룴가莎た룶純ヒ룶尋B 10001111101010011011000010100001110111101110110110101010101111111000111110101011111000101110110110101011110100101000111110101011111000111111110010001111101010011011000010100001110111101110110110101010101111111000111110101011111000101110110110101011110100101000111110101011111000111111110001000010 8fa9b0a1deedaabf8fabe2edabd28fabe3fc8fa9b0a1deedaabf8fabe2edabd28fabe3fc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)