To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 驕擾ス」驕丞髪 111010011000000110001111111011111011110110100011111010011000000110001111111001011001010010101111 e9818fefbda3e9818fe594af
EUC-JP 驕擾ス」驕丞髪 1111000111100001101111101111000110001110101111011000111010100011111100011110000110111110111001111100100010110001 f1e1bef18ebd8ea3f1e1bee7c8b1
UTF-8 驕擾ス」驕丞髪 111010011010100110010101111001101001001110111110111011111011110110111101111011111011110110100011111010011010100110010101111001001011100010011110111010011010101110101010 e9a995e693beefbdbdefbda3e9a995e4b89ee9abaa
UHC 驕擾??驕丞? 1100111011110110111010001111011000111111001111111100111011110110111000111010101000111111 cef6e8f63f3fcef6e3aa3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)