To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????az??????azB 0011111100111111001111110011111100111111001111110110000101111010001111110011111100111111001111110011111100111111011000010111101001000010 3f3f3f3f3f3f617a3f3f3f3f3f3f617a42
SJIS-WIN 鐔醐醜鐔種州az鐔醐醜鐔種州azB 1110100001011100100011001110110110001111010110001110100001011100100011101110110110001111010000100110000101111010111010000101110010001100111011011000111101011000111010000101110010001110111011011000111101000010011000010111101001000010 e85c8ced8f58e85c8eed8f42617ae85c8ced8f58e85c8eed8f42617a42
EUC-JP 鐔醐醜鐔種州az鐔醐醜鐔種州azB 1110111110111101101110001110111110111101101110011110111110111101101111001110111110111101101000110110000101111010111011111011110110111000111011111011110110111001111011111011110110111100111011111011110110100011011000010111101001000010 efbdb8efbdb9efbdbcefbda3617aefbdb8efbdb9efbdbcefbda3617a42
UTF-8 鐔醐醜鐔種州az鐔醐醜鐔種州azB 1110100110010000100101001110100110000110100100001110100110000110100111001110100110010000100101001110011110101000101011101110010110110111100111100110000101111010111010011001000010010100111010011000011010010000111010011000011010011100111010011001000010010100111001111010100010101110111001011011011110011110011000010111101001000010 e99094e98690e9869ce99094e7a8aee5b79e617ae99094e98690e9869ce99094e7a8aee5b79e617a42
UHC ??醜?種州az??醜?種州azB 0011111100111111111101011101110100111111111100001111101011110001101101100110000101111010001111110011111111110101110111010011111111110000111110101111000110110110011000010111101001000010 3f3ff5dd3ff0faf1b6617a3f3ff5dd3ff0faf1b6617a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)