To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 蜈??閻??閻?? 111001011000010100111111001111111110100010000101001111110011111111101000100001010011111100111111 e5853f3fe8853f3fe8853f3f
EUC-JP 蜈??閻??閻?? 111010011110010100111111001111111110111111100101001111110011111111101111111001010011111100111111 e9e53f3fefe53f3fefe53f3f
UTF-8 蜈욅쥢閻볢쪥閻뺟깗 111010001001110010001000111011001001101010000101111011001010010110100010111010011001011010111011111010111011001110100010111011001010101010100101111010011001011010111011111010111011101010011111111010101011100110010111 e89c88ec9a85eca5a2e996bbebb3a2ecaaa5e996bbebba9feab997
UHC 蜈욅쥢閻볢쪥閻뺟깗 111010001010010110011110111001111010001010010101111001111010001010010011111010001010010110011110111001111010001010010101111001111000001110001111 e8a59ee7a295e7a293e8a59ee7a295e7838f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)