To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????}v??????}vB 0011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f7d763f3f3f3f3f3f7d7642
SJIS-WIN 淨???窪ⅶ}v淨???窪ⅶ}vB 1001111111000100001111110011111100111111100011000100010111111010010001100111110101110110100111111100010000111111001111110011111110001100010001011111101001000110011111010111011001000010 9fc43f3f3f8c45fa467d769fc43f3f3f8c45fa467d7642
EUC-JP 淨???窪?}v淨???窪?}vB 110111101100011000111111001111110011111110110111101001100011111101111101011101101101111011000110001111110011111100111111101101111010011000111111011111010111011001000010 dec63f3f3fb7a63f7d76dec63f3f3fb7a63f7d7642
UTF-8 淨렠履렰窪ⅶ}v淨렠履렰窪ⅶ}vB 1110011010110111101010001110101110100000101000001110111110100111100111111110101110100000101100001110011110101010101010101110001010000101101101100111110101110110111001101011011110101000111010111010000010100000111011111010011110011111111010111010000010110000111001111010101010101010111000101000010110110110011111010111011001000010 e6b7a8eba0a0efa79feba0b0e7aaaae285b67d76e6b7a8eba0a0efa79feba0b0e7aaaae285b67d7642
UHC 淨렠履렰窪ⅶ}v淨렠履렰窪ⅶ}vB 1110111111100100100011101011000111101100101010101000111010111101111010001100000110100101101001110111110101110110111011111110010010001110101100011110110010101010100011101011110111101000110000011010010110100111011111010111011001000010 efe48eb1ecaa8ebde8c1a5a77d76efe48eb1ecaa8ebde8c1a5a77d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)