To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 毓?????訝??[毓?????訝??[^ 10011111011110010011111100111111001111110011111100111111111001100110001000111111001111110101101110011111011110010011111100111111001111110011111100111111111001100110001000111111001111110101101101011110 9f793f3f3f3f3fe6623f3f5b9f793f3f3f3f3fe6623f3f5b5e
EUC-JP 毓?????訝??[毓?????訝??[^ 11011101110110100011111100111111001111110011111100111111111010111100001100111111001111110101101111011101110110100011111100111111001111110011111100111111111010111100001100111111001111110101101101011110 ddda3f3f3f3f3febc33f3f5bddda3f3f3f3f3febc33f3f5b5e
UTF-8 毓띴툟殮덈풅訝뽫썕[毓띴툟殮덈풅訝뽫썕[^ 111001101010111110010011111010111001110110110100111011011000100010011111111011111010011010100101111010111000110110001000111011011001001010000101111010001010100010011101111010111011110110101011111011001000110110010101010110111110011010101111100100111110101110011101101101001110110110001000100111111110111110100110101001011110101110001101100010001110110110010010100001011110100010101000100111011110101110111101101010111110110010001101100101010101101101011110 e6af93eb9db4ed889fefa6a5eb8d88ed9285e8a89debbdabec8d955be6af93eb9db4ed889fefa6a5eb8d88ed9285e8a89debbdabec8d955b5e
UHC 毓띴툟殮덈풅訝뽫썕[毓띴툟殮덈풅訝뽫썕[^ 111010111011111010001101111001001011100010010110111001101111100110001000111010111011111010001101111001001011100010010110111001111001101110001000010110111110101110111110100011011110010010111000100101101110011011111001100010001110101110111110100011011110010010111000100101101110011110011011100010000101101101011110 ebbe8de4b896e6f988ebbe8de4b896e79b885bebbe8de4b896e6f988ebbe8de4b896e79b885b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)