To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???????????????姨????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110011011010010000011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f9b483f3f3f3f42
EUC-JP ????????ŀ??????姨????B 001111110011111100111111001111110011111100111111001111110011111110001111101010011100100100111111001111110011111100111111001111110011111111010101101010010011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f8fa9c93f3f3f3f3f3fd5a93f3f3f3f42
UTF-8 溜삠뀛梨붾졎栒붿ŀ栒붾젽溜삠뀛姨ⓦ뀛溜캪B 111011111010011110001011111011001000001010100000111010111000000010011011111011111010011110100010111010111011011010111110111011001010000110001110111001101010000010010010111010111011011010111111110001011000000011100110101000001001001011101011101101101011111011101100101000001011110111101111101001111000101111101100100000101010000011101011100000001001101111100101101001111010100011100010100100111010011011101011100000001001101111101111101001111000101111101100101110101010101001000010 efa78bec82a0eb809befa7a2ebb6beeca18ee6a092ebb6bfc580e6a092ebb6beeca0bdefa78bec82a0eb809be5a7a8e293a6eb809befa78becbaaa42
UHC 溜삠뀛梨붾졎栒붿ŀ栒붾젽溜삠뀛姨ⓦ뀛溜캪B 1110101011111110101110111110001110000101100101001110110010110001100101001110101110100000101110111110001011100011100101001110110010101001101010001110001011100011100101001110101110100000101011111110101011111110101110111110001110000101100101001110110010101001101010001110001110000101100101001110101011111110101100000100110001000010 eafebbe38594ecb194eba0bbe2e394eca9a8e2e394eba0afeafebbe38594eca9a8e38594eafeb04c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)