To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???}i???}iB 0011111100111111001111110111110101101001001111110011111100111111011111010110100101000010 3f3f3f7d693f3f3f7d6942
SJIS-WIN 臣??}i臣??}iB 10010000011000100011111100111111011111010110100110010000011000100011111100111111011111010110100101000010 90623f3f7d6990623f3f7d6942
EUC-JP 臣??}i臣??}iB 10111111110000110011111100111111011111010110100110111111110000110011111100111111011111010110100101000010 bfc33f3f7d69bfc33f3f7d6942
UTF-8 臣뗥깷}i臣뗥깷}iB 1110100010000111101000111110101110010111101001011110101010111001101101110111110101101001111010001000011110100011111010111001011110100101111010101011100110110111011111010110100101000010 e887a3eb97a5eab9b77d69e887a3eb97a5eab9b77d6942
UHC 臣뗥깷}i臣뗥깷}iB 1110001111101101100010111110010110000011101001010111110101101001111000111110110110001011111001011000001110100101011111010110100101000010 e3ed8be583a57d69e3ed8be583a57d6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)