To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????BF 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
SJIS-WIN ?????????????????????BF 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
EUC-JP ?????????????????????BF 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
UTF-8 챦쩍쩐챦쩍짭챕혴짬챦쩍쩐챦쩍짚챔혪혢챕혩째BF 1110110010110001101001101110110010101001100011011110110010101001100100001110110010110001101001101110110010101001100011011110110010100111101011011110110010110001100101011110110110011000101101001110110010100111101011001110110010110001101001101110110010101001100011011110110010101001100100001110110010110001101001101110110010101001100011011110110010100111100110101110110010110001100101001110110110011000101010101110110110011000101000101110110010110001100101011110110110011000101010011110110010100111101110000100001001000110 ecb1a6eca98deca990ecb1a6eca98deca7adecb195ed98b4eca7acecb1a6eca98deca990ecb1a6eca98deca79aecb194ed98aaed98a2ecb195ed98a9eca7b84246
UHC 챦쩍쩐챦쩍짭챕혴짬챦쩍쩐챦쩍짚챔혪혢챕혩째BF 1100001110101111110000101011110111000010101111101100001110101111110000101011110111000010101011001100001110101001110000101001101111000010101010111100001110101111110000101011110111000010101111101100001110101111110000101011110111000010101001001100001110101000110000101001001011000010100010111100001110101001110000101001000111000010101100000100001001000110 c3afc2bdc2bec3afc2bdc2acc3a9c29bc2abc3afc2bdc2bec3afc2bdc2a4c3a8c292c28bc3a9c291c2b04246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)