To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???油??筍?? 0011111100111111001111111001011011111011001111110011111111100010101000010011111100111111 3f3f3f96fb3f3fe2a13f3f
EUC-JP ???油??筍?? 0011111100111111001111111100110011111101001111110011111111100100101000110011111100111111 3f3f3fccfd3f3fe4a33f3f
UTF-8 閱뤿툙油닷ㅇ筍⑹쐺 111010011001011010110001111010111010010010111111111011011000100010011001111001101011001010111001111010111000101110110111111000111000010110000111111001111010110110001101111000101001000110111001111011001001000010111010 e996b1eba4bfed8899e6b2b9eb8bb7e38587e7ad8de291b9ec90ba
UHC 閱뤿툙油닷ㅇ筍⑹쐺 111001101111001110001111111010111011100010010000111010101111101010110100111001011010010010110111111000101110110010101001111011001001110010011100 e6f38febb890eafab4e5a4b7e2eca9ec9c9c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)