To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
SJIS-WIN ?????????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
EUC-JP ?????????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
UTF-8 챌쩐짤챘혻짠챘짚째찾혚짰챕혧혘챠혨짜챙째짬gB 1110110010110001100011001110110010101001100100001110110010100111101001001110110010110001100110001110110110011000101110111110110010100111101000001110110010110001100110001110110010100111100110101110110010100111101110001110110010110000101111101110110110011000100110101110110010100111101100001110110010110001100101011110110110011000101001111110110110011000100110001110110010110001101000001110110110011000101010001110110010100111100111001110110010110001100110011110110010100111101110001110110010100111101011000110011101000010 ecb18ceca990eca7a4ecb198ed98bbeca7a0ecb198eca79aeca7b8ecb0beed989aeca7b0ecb195ed98a7ed9898ecb1a0ed98a8eca79cecb199eca7b8eca7ac6742
UHC 챌쩐짤챘혻짠챘짚째찾혚짰챕혧혘챠혨짜챙째짬gB 1100001110100111110000101011111011000010101010011100001110101011110000101010000011000010101001111100001110101011110000101010010011000010101100001100001110100011110000101000010111000010101011101100001110101001110000101000111111000010100000111100001110101101110000101001000011000010101001011100001110101100110000101011000011000010101010110110011101000010 c3a7c2bec2a9c3abc2a0c2a7c3abc2a4c2b0c3a3c285c2aec3a9c28fc283c3adc290c2a5c3acc2b0c2ab6742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)