To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 毓?????秧??[毓?????秧??[^ 10011111011110010011111100111111001111110011111100111111111000100101111000111111001111110101101110011111011110010011111100111111001111110011111100111111111000100101111000111111001111110101101101011110 9f793f3f3f3f3fe25e3f3f5b9f793f3f3f3f3fe25e3f3f5b5e
EUC-JP 毓?????秧??[毓?????秧??[^ 11011101110110100011111100111111001111110011111100111111111000111011111100111111001111110101101111011101110110100011111100111111001111110011111100111111111000111011111100111111001111110101101101011110 ddda3f3f3f3f3fe3bf3f3f5bddda3f3f3f3f3fe3bf3f3f5b5e
UTF-8 毓멨뜥力꾨씀秧ⓩ퀎[毓멨뜥力꾨씀秧ⓩ퀎[^ 111001101010111110010011111010111010100110101000111010111001110010100101111011111010011010001010111010101011111010101000111011001001010010000000111001111010011110100111111000101001001110101001111011011000000010001110010110111110011010101111100100111110101110101001101010001110101110011100101001011110111110100110100010101110101010111110101010001110110010010100100000001110011110100111101001111110001010010011101010011110110110000000100011100101101101011110 e6af93eba9a8eb9ca5efa68aeabea8ec9480e7a7a7e293a9ed808e5be6af93eba9a8eb9ca5efa68aeabea8ec9480e7a7a7e293a9ed808e5b5e
UHC 毓멨뜥力꾨씀秧ⓩ퀎[毓멨뜥力꾨씀秧ⓩ퀎[^ 111010111011111010111000111001011000110110101000111001101011001110000100111010111011111010111000111001001110101110101000111001101011001110000100010110111110101110111110101110001110010110001101101010001110011010110011100001001110101110111110101110001110010011101011101010001110011010110011100001000101101101011110 ebbeb8e58da8e6b384ebbeb8e4eba8e6b3845bebbeb8e58da8e6b384ebbeb8e4eba8e6b3845b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)