To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????J??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010010100011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f4a3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???????????J??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010010100011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f4a3f3f3f3f3f3f3f3f3f3f42
EUC-JP ???????????J??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010010100011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f4a3f3f3f3f3f3f3f3f3f3f42
UTF-8 챨짼혦쨌챨쨋째챈챈쨍청J혦쨍챨쨋째챈혦쩔짜횧B 1110110010110001101010001110110010100111101111001110110110011000101001101110110010101000100011001110110010110001101010001110110010101000100010111110110010100111101110001110110010110001100010001110110010110001100010001110110010101000100011011110110010110010101011010100101011101101100110001010011011101100101010001000110111101100101100011010100011101100101010001000101111101100101001111011100011101100101100011000100011101101100110001010011011101100101010011001010011101100101001111001110011101101100110101010011101000010 ecb1a8eca7bced98a6eca88cecb1a8eca88beca7b8ecb188ecb188eca88decb2ad4aed98a6eca88decb1a8eca88beca7b8ecb188ed98a6eca994eca79ced9aa742
UHC 챨짼혦쨌챨쨋째챈챈쨍청J혦쨍챨쨋째챈혦쩔짜횧B 1100001110110000110000101011001011000010100011101100001010110111110000111011000011000010101101101100001010110000110000111010011011000011101001101100001010111000110000111011101101001010110000101000111011000010101110001100001110110000110000101011011011000010101100001100001110100110110000101000111011000010101111111100001010100101110000111001111001000010 c3b0c2b2c28ec2b7c3b0c2b6c2b0c3a6c3a6c2b8c3bb4ac28ec2b8c3b0c2b6c2b0c3a6c28ec2bfc2a5c39e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)