To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????i??????????iB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101101001001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f6942
SJIS-WIN ??????????i??????????iB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101101001001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f6942
EUC-JP ??????????i??????????iB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101101001001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f6942
UTF-8 챕챤횈채횆쨌책챕혦짭i챕챤횈채횆쨌책챕혦짭iB 111011001011000110010101111011001011000110100100111011011001101010001000111011001011000110000100111011011001101010000110111011001010100010001100111011001011000110000101111011001011000110010101111011011001100010100110111011001010011110101101011010011110110010110001100101011110110010110001101001001110110110011010100010001110110010110001100001001110110110011010100001101110110010101000100011001110110010110001100001011110110010110001100101011110110110011000101001101110110010100111101011010110100101000010 ecb195ecb1a4ed9a88ecb184ed9a86eca88cecb185ecb195ed98a6eca7ad69ecb195ecb1a4ed9a88ecb184ed9a86eca88cecb185ecb195ed98a6eca7ad6942
UHC 챕챤횈채횆쨌책챕혦짭i챕챤횈채횆쨌책챕혦짭iB 11000011101010011100001110101110110000111000011011000011101001001100001110000100110000101011011111000011101001011100001110101001110000101000111011000010101011000110100111000011101010011100001110101110110000111000011011000011101001001100001110000100110000101011011111000011101001011100001110101001110000101000111011000010101011000110100101000010 c3a9c3aec386c3a4c384c2b7c3a5c3a9c28ec2ac69c3a9c3aec386c3a4c384c2b7c3a5c3a9c28ec2ac6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)