To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????????????????佚 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111001100011000011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f98c3
EUC-JP ?????????????????????佚 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101000011000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fd0c5
UTF-8 溜삳젒溜븍젘溜블뙇溜뽯졋溜딅졋溜붾졎溜뽯졋佚 111011111010011110001011111011001000001010110011111011001010000010010010111011111010011110001011111010111011100010001101111011001010000010011000111011111010011110001011111010111011100010010100111010111001100110000111111011111010011110001011111010111011110110101111111011001010000110001011111011111010011110001011111010111001010010000101111011001010000110001011111011111010011110001011111010111011011010111110111011001010000110001110111011111010011110001011111010111011110110101111111011001010000110001011111001001011110110011010 efa78bec82b3eca092efa78bebb88deca098efa78bebb894eb9987efa78bebbdafeca18befa78beb9485eca18befa78bebb6beeca18eefa78bebbdafeca18be4bd9a
UHC 溜삳젒溜븍젘溜블뙇溜뽯졋溜딅졋溜붾졎溜뽯졋佚 1110101011111110101110111110101110100000100100011110101011111110101110101110101110100000100101001110101011111110101110101110110110001100100011011110101011111110100101101110101110100000101110101110101011111110100010101110101110100000101110101110101011111110100101001110101110100000101110111110101011111110100101101110101110100000101110101110110011101010 eafebbeba091eafebaeba094eafebaed8c8deafe96eba0baeafe8aeba0baeafe94eba0bbeafe96eba0baecea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)