To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????裔??竊 001111110011111100111111001111110011111100111111111001011110000100111111001111111110001010000110 3f3f3f3f3f3fe5e13f3fe286
EUC-JP ??????裔??竊 001111110011111100111111001111110011111100111111111010101110001100111111001111111110001111100110 3f3f3f3f3f3feae33f3fe3e6
UTF-8 曆꿜폒兩좄쾾裔뀐숲竊 111011111010011010001011111010101011111110011100111011011000111110010010111011111010010110111000111011001010001010000100111011001011111010111110111010001010001110010100111010111000000010010000111011001000100010110010111001111010101110001010 efa68beabf9ced8f92efa5b8eca284ecbebee8a394eb8090ec88b2e7ab8a
UHC 曆꿜폒兩좄쾾裔뀐숲竊 1110011010110111101100101110010010111100100111001110010110111011101000001110100010110010100101001110011111100000101100101110111110111101101000111110111110111100 e6b7b2e4bc9ce5bba0e8b294e7e0b2efbda3efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)