To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????蜈???伊???蜈???蔚 001111110011111100111111001111111110010110000101001111110011111100111111100010001100100100111111001111110011111111100101100001010011111100111111001111111000100101010101 3f3f3f3fe5853f3f3f88c93f3f3fe5853f3f3f8955
EUC-JP ????蜈???伊???蜈???蔚 001111110011111100111111001111111110100111100101001111110011111100111111101100001100101100111111001111110011111111101001111001010011111100111111001111111011000110110110 3f3f3f3fe9e53f3f3fb0cb3f3f3fe9e53f3f3fb1b6
UTF-8 샬선렱렚蜈선렱렚伊선렱렚蜈선렱렚蔚 111011001000001110101100111011001000010010100000111010111010000010110001111010111010000010011010111010001001110010001000111011001000010010100000111010111010000010110001111010111010000010011010111001001011110010001010111011001000010010100000111010111010000010110001111010111010000010011010111010001001110010001000111011001000010010100000111010111010000010110001111010111010000010011010111010001001010010011010 ec83acec84a0eba0b1eba09ae89c88ec84a0eba0b1eba09ae4bc8aec84a0eba0b1eba09ae89c88ec84a0eba0b1eba09ae8949a
UHC 샬선렱렚蜈선렱렚伊선렱렚蜈선렱렚蔚 10111100101000111011110010110001100011101011111010001110101011011110100010100101101111001011000110001110101111101000111010101101111011001010010110111100101100011000111010111110100011101010110111101000101001011011110010110001100011101011111010001110101011011110101010100101 bca3bcb18ebe8eade8a5bcb18ebe8eadeca5bcb18ebe8eade8a5bcb18ebe8eadeaa5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)