To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 檍?????鎖?? 1001111011111000001111110011111100111111001111110011111110001101101111010011111100111111 9ef83f3f3f3f3f8dbd3f3f
EUC-JP 檍?????鎖?? 1101110011111010001111110011111100111111001111110011111110111010101111110011111100111111 dcfa3f3f3f3f3fbabf3f3f
UTF-8 檍용맧栒덌쬉鎖뀀젗 111001101010101010001101111011001001101010101001111010111010011110100111111001101010000010010010111010111000110110001100111011001010110010001001111010011000111010010110111010111000000010000000111011001010000010010111 e6aa8dec9aa9eba7a7e6a092eb8d8cecac89e98e96eb8080eca097
UHC 檍용맧栒덌쬉鎖뀀젗 111001011110010110111111111010111001000010110000111000101110001110001000111011111010011010011111111000011111000010110010111010111010000010010011 e5e5bfeb90b0e2e388efa69fe1f0b2eba093

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)