To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????æ??????????? 00111111001111110011111100111111001111110011111100111111111001100011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3fe63f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??健イ’?????健??釐ゴ?碇ピ? 001111110011111110001100100100101000001101000011100000010110011000111111001111110011111100111111001111111000110010010010001111110011111111100111110110001000001101010011001111111001001011110100100000110111001100111111 3f3f8c92834381663f3f3f3f3f8c923f3fe7d883533f92f483733f
EUC-JP ??健イ’??æ??健??釐ゴ?碇ピ? 0011111100111111101101111111001010100101101001001010000111000111001111110011111110001111101010011100000100111111001111111011011111110010001111110011111111101110110110101010010110110100001111111100010011110110101001011101010000111111 3f3fb7f2a5a4a1c73f3f8fa9c13f3fb7f23f3feedaa5b43fc4f6a5d43f
UTF-8 룵엑健イ’룶햶æ룵엑健⒝룶釐ゴ룫碇ピ룵 1110101110100011101101011110110010010111100100011110010110000001101001011110001110000010101001001110001010000000100110011110101110100011101101101110110110010110101101101100001110100110111010111010001110110101111011001001011110010001111001011000000110100101111000101001001010011101111010111010001110110110111010011000011110010000111000111000001010110100111010111010001110101011111001111010001010000111111000111000001110010100111010111010001110110101 eba3b5ec9791e581a5e382a4e28099eba3b6ed96b6c3a6eba3b5ec9791e581a5e2929deba3b6e98790e382b4eba3abe7a287e38394eba3b5
UHC 룵엑健イ’룶햶æ룵엑健⒝룶釐ゴ룫碇ピ룵 1000111110101010101111111010001011001011111011011010101110100100101000011010111110001111101010111100000110001111101010011010000110001111101010101011111110100010110010111110110110101001110011101000111110101011110101111110110110101011101101001000111110100010111011111110110110101011110101001000111110101010 8faabfa2cbedaba4a1af8fabc18fa9a18faabfa2cbeda9ce8fabd7edabb48fa2efedabd48faa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)