To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 松?6?ル+冗??松?6?ル+冗??B 1000111110111100001111111000001001010101001111111000001110001011100000010111101110001111111001110011111100111111100011111011110000111111100000100101010100111111100000111000101110000001011110111000111111100111001111110011111101000010 8fbc3f82553f838b817b8fe73f3f8fbc3f82553f838b817b8fe73f3f42
EUC-JP 松?6?ル+冗??松?6?ル+冗??B 1011111010111110001111111010001110110110001111111010010111101011101000011101110010111110111010010011111100111111101111101011111000111111101000111011011000111111101001011110101110100001110111001011111011101001001111110011111101000010 bebe3fa3b63fa5eba1dcbee93f3fbebe3fa3b63fa5eba1dcbee93f3f42
UTF-8 松듬6略ル+冗밤걗松듬6略ル+冗밤걗B 11100110100111011011111011101011100100111010110011101111101111001001011011101111101001011011011011100011100000111010101111101111101111001000101111100101100001101001011111101011101100001010010011101010101100011001011111100110100111011011111011101011100100111010110011101111101111001001011011101111101001011011011011100011100000111010101111101111101111001000101111100101100001101001011111101011101100001010010011101010101100011001011101000010 e69dbeeb93acefbc96efa5b6e383abefbc8be58697ebb0a4eab197e69dbeeb93acefbc96efa5b6e383abefbc8be58697ebb0a4eab19742
UHC 松듬6略ル+冗밤걗松듬6略ル+冗밤걗B 11100001111001101011010111101011101000111011011011100101101100101010101111101011101000111010101111101001101101111011100111100011100000011000001011100001111001101011010111101011101000111011011011100101101100101010101111101011101000111010101111101001101101111011100111100011100000011000001001000010 e1e6b5eba3b6e5b2abeba3abe9b7b9e38182e1e6b5eba3b6e5b2abeba3abe9b7b9e3818242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)