To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
SJIS-WIN ?????????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
EUC-JP ?????????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
UTF-8 챌쨉쨋챗쨌혴챗짼혴챌혖혣책쩍짹챙짖혩챘챌혞gB 1110110010110001100011001110110010101000100010011110110010101000100010111110110010110001100101111110110010101000100011001110110110011000101101001110110010110001100101111110110010100111101111001110110110011000101101001110110010110001100011001110110110011000100101101110110110011000101000111110110010110001100001011110110010101001100011011110110010100111101110011110110010110001100110011110110010100111100101101110110110011000101010011110110010110001100110001110110010110001100011001110110110011000100111100110011101000010 ecb18ceca889eca88becb197eca88ced98b4ecb197eca7bced98b4ecb18ced9896ed98a3ecb185eca98deca7b9ecb199eca796ed98a9ecb198ecb18ced989e6742
UHC 챌쨉쨋챗쨌혴챗짼혴챌혖혣책쩍짹챙짖혩챘챌혞gB 1100001110100111110000101011010111000010101101101100001110101010110000101011011111000010100110111100001110101010110000101011001011000010100110111100001110100111110000101000000111000010100011001100001110100101110000101011110111000010101100011100001110101100110000101010001011000010100100011100001110101011110000111010011111000010100010000110011101000010 c3a7c2b5c2b6c3aac2b7c29bc3aac2b2c29bc3a7c281c28cc3a5c2bdc2b1c3acc2a2c291c3abc3a7c2886742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)