To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 驩、蛟ャ蝗り」 1110100110001001101001001110010110000000101011001110010110011011100000101110100010100011 e989a4e580ace59b82e8a3
EUC-JP 驩、蛟ャ蝗り」 1111000111101001100011101010010011101001111000001000111010101100111010011111101110100100111010101000111010100011 f1e98ea4e9e08eace9fba4ea8ea3
UTF-8 驩、蛟ャ蝗り」 111010011010100110101001111011111011110110100100111010001001101110011111111011111011110110101100111010001001110110010111111000111000001010001010111011111011110110100011 e9a9a9efbda4e89b9fefbdace89d97e3828aefbda3
UHC 驩?蛟?蝗り? 1111110010111110001111111100111011110001001111111111110011011001101010101110101000111111 fcbe3fcef13ffcd9aaea3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)