To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????v 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101110110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f76
SJIS-WIN ?????????????????????v 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101110110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f76
EUC-JP ?????????????????????v 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101110110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f76
UTF-8 책짬혙채쩌혦챌혘혞챌혛혡챘혻짯챔혗짖챘혻혯v 11101100101100011000010111101100101001111010110011101101100110001001100111101100101100011000010011101100101010011000110011101101100110001010011011101100101100011000110011101101100110001001100011101101100110001001111011101100101100011000110011101101100110001001101111101101100110001010000111101100101100011001100011101101100110001011101111101100101001111010111111101100101100011001010011101101100110001001011111101100101001111001011011101100101100011001100011101101100110001011101111101101100110001010111101110110 ecb185eca7aced9899ecb184eca98ced98a6ecb18ced9898ed989eecb18ced989bed98a1ecb198ed98bbeca7afecb194ed9897eca796ecb198ed98bbed98af76
UHC 책짬혙채쩌혦챌혘혞챌혛혡챘혻짯챔혗짖챘혻혯v 11000011101001011100001010101011110000101000010011000011101001001100001010111100110000101000111011000011101001111100001010000011110000101000100011000011101001111100001010000110110000101000101011000011101010111100001010100000110000101010110111000011101010001100001010000010110000101010001011000011101010111100001010100000110000101001011001110110 c3a5c2abc284c3a4c2bcc28ec3a7c283c288c3a7c286c28ac3abc2a0c2adc3a8c282c2a2c3abc2a0c29676

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)