To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 闢埼亂縺帝Ζ邱定註 111010001001001110001101111010011001100010101010111000111000000110010010111010011000001110100100111001111011011110010010111010001001001010010000 e8938de998aae38192e983a4e7b792e89290
EUC-JP 闢埼亂縺帝Ζ邱定註 111011111111001110111010111010111101000010101100111001011110000111000100111010111010011010100110111011101011100111000100111010101100001111110000 eff3baebd0ace5e1c4eba6a6eeb9c4eac3f0
UTF-8 闢埼亂縺帝Ζ邱定註 1110100110010111101000101110010110011111101111001110010010111010100000101110011110111000101110101110010110111000100111011100111010010110111010011000001010110001111001011010111010011010111010001010100010111011 e997a2e59fbce4ba82e7b8bae5b89dce96e982b1e5ae9ae8a8bb
UHC 闢埼亂?帝Ζ邱定註 1101110010100011110100001111001011010101101011110011111111110000101010001010010111000110110011111100100011101111110100101111000111001001 dca3d0f2d5af3ff0a8a5c6cfc8efd2f1c9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)