To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ????θ?幽?? 0011111100111111001111110011111110000011110001100011111110010111010010000011111100111111 3f3f3f3f83c63f97483f3f
EUC-JP ????θ?幽?? 0011111100111111001111110011111110100110110010000011111111001101101010010011111100111111 3f3f3f3fa6c83fcda93f3f
UTF-8 捻꿔꺂杻θ짆幽녿뙑 1110111110100110101001001110101010111111100101001110101010111010100000101110111110100111100010001100111010111000111011001010011110000110111001011011100110111101111010111000010110111111111010111001100110010001 efa6a4eabf94eaba82efa788ceb8eca786e5b9bdeb85bfeb9991
UHC 捻꿔꺂杻θ짆幽녿뙑 111001101111011110110010111000111000001110101011111010101111010010100101111010001010001110010101111010101110101110000110111010111000110010010110 e6f7b2e383abeaf4a5e8a395eaeb86eb8c96

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)