To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN 鶯????????i鶯????????iB 1110100111110010001111110011111100111111001111110011111100111111001111110011111101101001111010011111001000111111001111110011111100111111001111110011111100111111001111110110100101000010 e9f23f3f3f3f3f3f3f3f69e9f23f3f3f3f3f3f3f3f6942
EUC-JP 鶯??佾?????i鶯??佾?????iB 111100101111010000111111001111111000111110110000111110110011111100111111001111110011111100111111011010011111001011110100001111110011111110001111101100001111101100111111001111110011111100111111001111110110100101000010 f2f43f3f8fb0fb3f3f3f3f3f69f2f43f3f8fb0fb3f3f3f3f3f6942
UTF-8 鶯ㅼ렲佾믥툞流껋댅i鶯ㅼ렲佾믥툞流껋댅iB 111010011011011010101111111000111000010110111100111010111010000010110010111001001011110110111110111010111010111110100101111011011000100010011110111011111010011110001010111010101011101110001011111010111000110010000101011010011110100110110110101011111110001110000101101111001110101110100000101100101110010010111101101111101110101110101111101001011110110110001000100111101110111110100111100010101110101010111011100010111110101110001100100001010110100101000010 e9b6afe385bceba0b2e4bdbeebafa5ed889eefa78aeabb8beb8c8569e9b6afe385bceba0b2e4bdbeebafa5ed889eefa78aeabb8beb8c856942
UHC 鶯ㅼ렲佾믥툞流껋댅i鶯ㅼ렲佾믥툞流껋댅iB 111001011010001110100100111011001000111010111111111011001110101110010010111001111011100010010101111010101111110010000011111011001000100010101111011010011110010110100011101001001110110010001110101111111110110011101011100100101110011110111000100101011110101011111100100000111110110010001000101011110110100101000010 e5a3a4ec8ebfeceb92e7b895eafc83ec88af69e5a3a4ec8ebfeceb92e7b895eafc83ec88af6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)