To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????淹??????????????? 0011111100111111001111110011111100111111001111111001111110111001001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f9fb93f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ??????淹??????????????? 0011111100111111001111110011111100111111001111111101111010111011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3fdebb3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 溜삳젛溜븀츧淹ㅻ젚溜븍젍溜븍ℓ溜뤿졋溜쒕졋溜 111011111010011110001011111011001000001010110011111011001010000010011011111011111010011110001011111010111011100010000000111011001011100010100111111001101011011110111001111000111000010110111011111011001010000010011010111011111010011110001011111010111011100010001101111011001010000010001101111011111010011110001011111010111011100010001101111000101000010010010011111011111010011110001011111010111010010010111111111011001010000110001011111011111010011110001011111011001001001010010101111011001010000110001011111011111010011110001011 efa78bec82b3eca09befa78bebb880ecb8a7e6b7b9e385bbeca09aefa78bebb88deca08defa78bebb88de28493efa78beba4bfeca18befa78bec9295eca18befa78b
UHC 溜삳젛溜븀츧淹ㅻ젚溜븍젍溜븍ℓ溜뤿졋溜쒕졋溜 1110101011111110101110111110101110100000100101111110101011111110101110101110011110101110100111011110010111110100101001001110101110100000100101101110101011111110101110101110101110100000100011101110101011111110101110101110101110100111101001001110101011111110100011111110101110100000101110101110101011111110100111001110101110100000101110101110101011111110 eafebbeba097eafebae7ae9de5f4a4eba096eafebaeba08eeafebaeba7a4eafe8feba0baeafe9ceba0baeafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)