To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 챘짧쨘챘짬혡챘짧쨘챘짬혢챘짬혗챘짧쩌챘짯혘챘짬 111011001011000110011000111011001010011110100111111011001010100010011000111011001011000110011000111011001010011110101100111011011001100010100001111011001011000110011000111011001010011110100111111011001010100010011000111011001011000110011000111011001010011110101100111011011001100010100010111011001011000110011000111011001010011110101100111011011001100010010111111011001011000110011000111011001010011110100111111011001010100110001100111011001011000110011000111011001010011110101111111011011001100010011000111011001011000110011000111011001010011110101100 ecb198eca7a7eca898ecb198eca7aced98a1ecb198eca7a7eca898ecb198eca7aced98a2ecb198eca7aced9897ecb198eca7a7eca98cecb198eca7afed9898ecb198eca7ac
UHC 챘짧쨘챘짬혡챘짧쨘챘짬혢챘짬혗챘짧쩌챘짯혘챘짬 11000011101010111100001010101010110000101011101011000011101010111100001010101011110000101000101011000011101010111100001010101010110000101011101011000011101010111100001010101011110000101000101111000011101010111100001010101011110000101000001011000011101010111100001010101010110000101011110011000011101010111100001010101101110000101000001111000011101010111100001010101011 c3abc2aac2bac3abc2abc28ac3abc2aac2bac3abc2abc28bc3abc2abc282c3abc2aac2bcc3abc2adc283c3abc2ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)