To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??懿?????飮??釉??嚥??泣 00111111001111110011111110001011100000110011111100111111100111001111001000111111001111110011111100111111001111111001111101011010001111110011111111100111110101100011111100111111100110101000101100111111001111111000101110000011 3f3f3f8b833f3f9cf23f3f3f3f3f9f5a3f3fe7d63f3f9a8b3f3f8b83
EUC-JP ???泣??懿?????飮??釉??嚥??泣 00111111001111110011111110110101111000110011111100111111110110001111010000111111001111110011111100111111001111111101110110111011001111110011111111101110110110000011111100111111110100111110101100111111001111111011010111100011 3f3f3fb5e33f3fd8f43f3f3f3f3fddbb3f3feed83f3fd3eb3f3fb5e3
UTF-8 念잙툦泣섉퓴懿얠뒳黎싳쥜飮긺삜釉앹뒛嚥싳빢泣 111011111010011010100011111011001001111010011001111011011000100010100110111001101011001110100011111011001000010010001001111011011001001110110100111001101000011110111111111011001001011010100000111010111001001010110011111011111010011010001001111011001000101110110011111011001010010110011100111010011010001110101110111010101011100010111010111011001000001010011100111010011000011110001001111011001001010110111001111010111001001010011011111001011001101010100101111011001000101110110011111010111011100110100010111001101011001110100011 efa6a3ec9e99ed88a6e6b3a3ec8489ed93b4e687bfec96a0eb92b3efa689ec8bb3eca59ce9a3aeeab8baec829ce98789ec95b9eb929be59aa5ec8bb3ebb9a2e6b3a3
UHC 念잙툦泣섉퓴懿얠뒳黎싳쥜飮긺삜釉앹뒛嚥싳빢泣 1110011011110110100111111110101110111000100111011110101111101000100110001110011010111111100110101110101111110011101111101110110010001010101011001110011010110001100110101110110010100010100100011110101111100110101100011110011110011000100111111110101110111000100111011110110010001010100110001110011010111111100110101110110010010101101111101110101111101000 e6f69febb89debe898e6bf9aebf3beec8aace6b19aeca291ebe6b1e7989febb89dec8a98e6bf9aec95beebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)