To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蜈??????ι4瀛 1110010110000101001111110011111100111111001111110011111100111111100000111100011110000010010100111110000001101001 e5853f3f3f3f3f3f83c78253e069
EUC-JP 蜈??繇???ι4瀛 11101001111001010011111100111111100011111101010011010001001111110011111100111111101001101100100110100011101101001101111111001010 e9e53f3f8fd4d13f3f3fa6c9a3b4dfca
UTF-8 蜈욃뮧繇닻뙞銳ι4瀛 1110100010011100100010001110110010011010100000111110101110101110101001111110011110111001100001111110101110001011101110111110101110011001100111101110100110001010101100111100111010111001111011111011110010010100111001111000000010011011 e89c88ec9a83ebaea7e7b987eb8bbbeb999ee98ab3ceb9efbc94e7809b
UHC 蜈욃뮧繇닻뙞銳ι4瀛 1110100010100101100111101110010110010010101100101110100110100011101101001110100110001100101000111110011111100101101001011110100110100011101101001110011110111010 e8a59ee592b2e9a3b4e98ca3e7e5a5e9a3b4e7ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)