To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??逾??違??永??釉э─淫??永??逾 1000100101101001001111110011111111100111101001010011111100111111100010001110000100111111001111111000100101101001001111110011111111100111110101101000010010001111100001001001111110001000111110100011111100111111100010010110100100111111001111111110011110100101 89693f3fe7a53f3f88e13f3f89693f3fe7d6848f849f88fa3f3f89693f3fe7a5
EUC-JP 永??逾??違??永??釉э─淫??永??逾 1011000111001010001111110011111111101110101001110011111100111111101100001110001100111111001111111011000111001010001111110011111111101110110110001010011111101111101010001010000110110000111111000011111100111111101100011100101000111111001111111110111010100111 b1ca3f3feea73f3fb0e33f3fb1ca3f3feed8a7efa8a1b0fc3f3fb1ca3f3feea7
UTF-8 永띔퍏逾뽫쨼違먯돩永띔퍎釉э─淫볦돟永띔퍏逾 1110011010110000101110001110101110011101100101001110110110001101100011111110100110000000101111101110101110111101101010111110110010101000101111001110100110000001100101011110101110101000101011111110101110001111101010011110011010110000101110001110101110011101100101001110110110001101100011101110100110000111100010011101000110001101111000101001010010000000111001101011011110101011111010111011001110100110111010111000111110011111111001101011000010111000111010111001110110010100111011011000110110001111111010011000000010111110 e6b0b8eb9d94ed8d8fe980beebbdabeca8bce98195eba8afeb8fa9e6b0b8eb9d94ed8d8ee98789d18de29480e6b7abebb3a6eb8f9fe6b0b8eb9d94ed8d8fe980be
UHC 永띔퍏逾뽫쨼違먯돩永띔퍎釉э─淫볦돟永띔퍏逾 1110011110110101101101101110101010111011100001101110101110110101100101101110011110100100100101101110101011011110100100001110110010001001101011001110011110110101101101101110101010111011100001011110101110111000101011001110111110100110101000011110101111100010100100111110110010001001101001011110011110110101101101101110101010111011100001101110101110110101 e7b5b6eabb86ebb596e7a496eade90ec89ace7b5b6eabb85ebb8acefa6a1ebe293ec89a5e7b5b6eabb86ebb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)