To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???????????????姨????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110011011010010000011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f9b483f3f3f3f42
EUC-JP ????????ŀ??????姨????B 001111110011111100111111001111110011111100111111001111110011111110001111101010011100100100111111001111110011111100111111001111110011111111010101101010010011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f8fa9c93f3f3f3f3f3fd5a93f3f3f3f42
UTF-8 溜삠뀛梨낅졎栒붿ŀ栒붾젌溜삠뀛姨ⓦ뀛溜밽B 111011111010011110001011111011001000001010100000111010111000000010011011111011111010011110100010111010111000001010000101111011001010000110001110111001101010000010010010111010111011011010111111110001011000000011100110101000001001001011101011101101101011111011101100101000001000110011101111101001111000101111101100100000101010000011101011100000001001101111100101101001111010100011100010100100111010011011101011100000001001101111101111101001111000101111101011101100001011110101000010 efa78bec82a0eb809befa7a2eb8285eca18ee6a092ebb6bfc580e6a092ebb6beeca08cefa78bec82a0eb809be5a7a8e293a6eb809befa78bebb0bd42
UHC 溜삠뀛梨낅졎栒붿ŀ栒붾젌溜삠뀛姨ⓦ뀛溜밽B 1110101011111110101110111110001110000101100101001110110010110001100001011110101110100000101110111110001011100011100101001110110010101001101010001110001011100011100101001110101110100000100011011110101011111110101110111110001110000101100101001110110010101001101010001110001110000101100101001110101011111110100100110110011101000010 eafebbe38594ecb185eba0bbe2e394eca9a8e2e394eba08deafebbe38594eca9a8e38594eafe936742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)