To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????CM 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001101001101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f434d
SJIS-WIN ?????????????????????CM 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001101001101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f434d
EUC-JP ?????????????????????CM 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001101001101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f434d
UTF-8 챘짙쨉챠혫짝찾혗혚챘짙쨋채쨍쨔챙쨋쨀찾혘짚CM 1110110010110001100110001110110010100111100110011110110010101000100010011110110010110001101000001110110110011000101010111110110010100111100111011110110010110000101111101110110110011000100101111110110110011000100110101110110010110001100110001110110010100111100110011110110010101000100010111110110010110001100001001110110010101000100011011110110010101000100101001110110010110001100110011110110010101000100010111110110010101000100000001110110010110000101111101110110110011000100110001110110010100111100110100100001101001101 ecb198eca799eca889ecb1a0ed98abeca79decb0beed9897ed989aecb198eca799eca88becb184eca88deca894ecb199eca88beca880ecb0beed9898eca79a434d
UHC 챘짙쨉챠혫짝찾혗혚챘짙쨋채쨍쨔챙쨋쨀찾혘짚CM 1100001110101011110000101010001111000010101101011100001110101101110000101001001111000010101001101100001110100011110000101000001011000010100001011100001110101011110000101010001111000010101101101100001110100100110000101011100011000010101110011100001110101100110000101011011011000010101100111100001110100011110000101000001111000010101001000100001101001101 c3abc2a3c2b5c3adc293c2a6c3a3c282c285c3abc2a3c2b6c3a4c2b8c2b9c3acc2b6c2b3c3a3c283c2a4434d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)