To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????轅??筌 001111110011111100111111001111110011111100111111111001110111011000111111001111111110001010100011 3f3f3f3f3f3fe7763f3fe2a3
EUC-JP ??????轅??筌 001111110011111100111111001111110011111100111111111011011101011100111111001111111110010010100101 3f3f3f3f3f3fedd73f3fe4a5
UTF-8 閱뤿툖栒뗤틠轅대뙑筌 111010011001011010110001111010111010010010111111111011011000100010010110111001101010000010010010111010111001011110100100111011011000101110100000111010001011110110000101111010111000110010000000111010111001100110010001111001111010110110001100 e996b1eba4bfed8896e6a092eb97a4ed8ba0e8bd85eb8c80eb9991e7ad8c
UHC 閱뤿툖栒뗤틠轅대뙑筌 1110011011110011100011111110101110111000100011011110001011100011100010111110010010111010100011001110101010111111101101001110101110001100100101101110111110100111 e6f38febb88de2e38be4ba8ceabfb4eb8c96efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)