To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 率??辱わ?浴? 100101111010011000111111001111111001000001001010100000101110110100111111100101111000000100111111 97a63f3f904a82ed3f97813f
EUC-JP 率??辱わ?浴? 110011101010100000111111001111111011111110101011101001001110111100111111110011011110000100111111 cea83f3fbfaba4ef3fcde13f
UTF-8 率녘나辱わ풗浴싪 111001111000111010000111111010111000010110011000111010111000001010011000111010001011111010110001111000111000001010001111111011011001001010010111111001101011010110110100111011001000101110101010 e78e87eb8598eb8298e8beb1e3828fed9297e6b5b4ec8baa
UHC 率녘나辱わ풗浴싪 11100001111000111011001111101000101100111010101011101001101101001010101011101111101111101001101011101001101100011001101011101000 e1e3b3e8b3aae9b4aaefbe9ae9b19ae8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)