To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 諺??毅??異????????B 10001100101111110011111100111111100010110100001000111111001111111000100011011001001111110011111100111111001111110011111100111111001111110011111101000010 8cbf3f3f8b423f3f88d93f3f3f3f3f3f3f3f42
EUC-JP 諺??毅??異?????孼??B 101110001100000100111111001111111011010110100011001111110011111110110000110110110011111100111111001111110011111100111111100011111011101011000011001111110011111101000010 b8c13f3fb5a33f3fb0db3f3f3f3f3f8fbac33f3f42
UTF-8 諺⑸쉼毅싨룚異잍렔戮곗삖孼뽰텫B 11101000101010111011101011100010100100011011100011101100100010011011110011100110101011111000010111101100100010111010100011101011101000111001101011100111100101011011000011101100100111101000110111101011101000001001010011101111101001111001001011101010101100111001011111101100100000101001011011100101101011011011110011101011101111011011000011101101100001011010101101000010 e8abbae291b8ec89bce6af85ec8ba8eba39ae795b0ec9e8deba094efa792eab397ec8296e5adbcebbdb0ed85ab42
UHC 諺⑸쉼毅싨룚異잍렔戮곗삖孼뽰텫B 11100101111011001010100111101011101111011011000011101011111101101001101011100110100011111001011011101100101101101001111111100110100011101010100111101011101111011011000011101100100110001001101011100101111011011001011011101100101101101001111101000010 e5eca9ebbdb0ebf69ae68f96ecb69fe68ea9ebbdb0ec989ae5ed96ecb69f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)