To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 苙肢―苙肢―^ 11100100100111101000111010001000100000010101110011100100100111101000111010001000100000010101110001011110 e49e8e88815ce49e8e88815c5e
EUC-JP 苙肢―苙肢―^ 11100111111111101011101111101000101000011011110111100111111111101011101111101000101000011011110101011110 e7febbe8a1bde7febbe8a1bd5e
UTF-8 苙肢―苙肢―^ 11101000100010111001100111101000100000101010001011100010100000001001010111101000100010111001100111101000100000101010001011100010100000001001010101011110 e88b99e882a2e28095e88b99e882a2e280955e
UHC ?肢―?肢―^ 0011111111110010101101101010000110101010001111111111001010110110101000011010101001011110 3ff2b6a1aa3ff2b6a1aa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)