To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 盖鶺鈔眩瘁韲^ 11100001101100111110101001010100111001111110001011100001101111111110000110000001111010001110110001011110 e1b3ea54e7e2e1bfe181e8ec5e
EUC-JP 盖鶺鈔眩瘁韲^ 11100010101101011111001110110101111011101110010011100010110000011110000111100001111100001110111001011110 e2b5f3b5eee4e2c1e1e1f0ee5e
UTF-8 盖鶺鈔眩瘁韲^ 11100111100110111001011011101001101101101011101011101001100010001001010011100111100111001010100111100111100110001000000111101001100111111011001001011110 e79b96e9b6bae98894e79ca9e79881e99fb25e
UHC 盖??眩??^ 110010111100110000111111001111111111101011011111001111110011111101011110 cbcc3f3ffadf3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)