To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????O 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f
SJIS-WIN 艾?????隱??乙?????肉η?違??O 11100100100010000011111100111111001111110011111100111111111010001010101000111111001111111000100110110011001111110011111100111111001111110011111110010011111101111000001111000101001111111000100011100001001111110011111101001111 e4883f3f3f3f3fe8aa3f3f89b33f3f3f3f3f93f783c53f88e13f3f4f
EUC-JP 艾?????隱??乙??孼??肉η?違??O 111001111110100000111111001111110011111100111111001111111111000010101100001111110011111110110010101101010011111100111111100011111011101011000011001111110011111111000110111110011010011011000111001111111011000011100011001111110011111101001111 e7e83f3f3f3f3ff0ac3f3fb2b53f3f8fbac33f3fc6f9a6c73fb0e33f3f4f
UTF-8 艾싳궠梨욘룚隱껃뿗乙녹굻孼뽯떻肉η솒違곷렰O 111010001000100110111110111011001000101110110011111010101011011010100000111011111010011110100010111011001001101010011000111010111010001110011010111010011001101010110001111010101011101110000011111010111011111110010111111001001011100110011001111010111000010110111001111010101011010110111011111001011010110110111100111010111011110110101111111010111001011010111011111010001000001010001001110011101011011111101100100001101001001011101001100000011001010111101010101100111011011111101011101000001011000001001111 e889beec8bb3eab6a0efa7a2ec9a98eba39ae99ab1eabb83ebbf97e4b999eb85b9eab5bbe5adbcebbdafeb96bbe88289ceb7ec8692e98195eab3b7eba0b04f
UHC 艾싳궠梨욘룚隱껃뿗乙녹굻孼뽯떻肉η솒違곷렰O 11100100111101011001101011101100100000101011001111101100101100011011111111100110100011111001011011101011110111111000001111100101100101111001101011101011111000001011001111101100101100011011111111100101111011011001011011101011101101101011101111101011101111111010010111100111100110011001001011101010110111101000000111101011100011101011110101001111 e4f59aec82b3ecb1bfe68f96ebdf83e5979aebe0b3ecb1bfe5ed96ebb6bbebbfa5e79992eade81eb8ebd4f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)