To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????????異??????????異?B 00111111001111110011111100111111001111110011111100111111001111110011111110001000110110010011111100111111001111110011111100111111001111110011111100111111001111110011111110001000110110010011111101000010 3f3f3f3f3f3f3f3f3f88d93f3f3f3f3f3f3f3f3f3f88d93f42
EUC-JP ?????????異??????????異?B 00111111001111110011111100111111001111110011111100111111001111110011111110110000110110110011111100111111001111110011111100111111001111110011111100111111001111110011111110110000110110110011111101000010 3f3f3f3f3f3f3f3f3fb0db3f3f3f3f3f3f3f3f3f3fb0db3f42
UTF-8 溜삳젩溜븍젪溜삥솹異늲溜삳젩溜븍젪溜삥솹異늲B 11101111101001111000101111101100100000101011001111101100101000001010100111101111101001111000101111101011101110001000110111101100101000001010101011101111101001111000101111101100100000101010010111101100100001101011100111100111100101011011000011101011100010101011001011101111101001111000101111101100100000101011001111101100101000001010100111101111101001111000101111101011101110001000110111101100101000001010101011101111101001111000101111101100100000101010010111101100100001101011100111100111100101011011000011101011100010101011001001000010 efa78bec82b3eca0a9efa78bebb88deca0aaefa78bec82a5ec86b9e795b0eb8ab2efa78bec82b3eca0a9efa78bebb88deca0aaefa78bec82a5ec86b9e795b0eb8ab242
UHC 溜삳젩溜븍젪溜삥솹異늲溜삳젩溜븍젪溜삥솹異늲B 111010101111111010111011111010111010000010100001111010101111111010111010111010111010000010100010111010101111111010111011111001101001100110101110111011001011011010001000011101101110101011111110101110111110101110100000101000011110101011111110101110101110101110100000101000101110101011111110101110111110011010011001101011101110110010110110100010000111011001000010 eafebbeba0a1eafebaeba0a2eafebbe699aeecb68876eafebbeba0a1eafebaeba0a2eafebbe699aeecb6887642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)