To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 訝???????????????????? 11100110011000100011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 e6623f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP 訝???????????????????? 11101011110000110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 ebc33f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 訝방겘呂묇퍟呂묌걶呂묇퍟呂묈쩀呂묇퍟呂묈뎴 111010001010100010011101111010111011000010101001111010101011001010011000111011111010011010000000111010111010110010000111111011011000110110011111111011111010011010000000111010111010110010001100111010101011000110110110111011111010011010000000111010111010110010000111111011011000110110011111111011111010011010000000111010111010110010001000111011001010100110000000111011111010011010000000111010111010110010000111111011011000110110011111111011111010011010000000111010111010110010001000111010111000111010110100 e8a89debb0a9eab298efa680ebac87ed8d9fefa680ebac8ceab1b6efa680ebac87ed8d9fefa680ebac88eca980efa680ebac87ed8d9fefa680ebac88eb8eb4
UHC 訝방겘呂묇퍟呂묌걶呂묇퍟呂묈쩀呂묇퍟呂묈뎴 111001001011100010111001111001101000000110101111111001011111101110010001111001001011101110010110111001011111101110010001111010011000000110011100111001011111101110010001111001001011101110010110111001011111101110010001111001011010010010011010111001011111101110010001111001001011101110010110111001011111101110010001111001011000100110000111 e4b8b9e681afe5fb91e4bb96e5fb91e9819ce5fb91e4bb96e5fb91e5a49ae5fb91e4bb96e5fb91e58987

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)