To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 챌혮짱챘혻혳챈짧혥챘혻짧책혢혯챘쨍혘챘혡혚 111011001011000110001100111011011001100010101110111011001010011110110001111011001011000110011000111011011001100010111011111011011001100010110011111011001011000110001000111011001010011110100111111011011001100010100101111011001011000110011000111011011001100010111011111011001010011110100111111011001011000110000101111011011001100010100010111011011001100010101111111011001011000110011000111011001010100010001101111011011001100010011000111011001011000110011000111011011001100010100001111011011001100010011010 ecb18ced98aeeca7b1ecb198ed98bbed98b3ecb188eca7a7ed98a5ecb198ed98bbeca7a7ecb185ed98a2ed98afecb198eca88ded9898ecb198ed98a1ed989a
UHC 챌혮짱챘혻혳챈짧혥챘혻짧책혢혯챘쨍혘챘혡혚 110000111010011111000010100101011100001010101111110000111010101111000010101000001100001010011010110000111010011011000010101010101100001010001101110000111010101111000010101000001100001010101010110000111010010111000010100010111100001010010110110000111010101111000010101110001100001010000011110000111010101111000010100010101100001010000101 c3a7c295c2afc3abc2a0c29ac3a6c2aac28dc3abc2a0c2aac3a5c28bc296c3abc2b8c283c3abc28ac285

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)