To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 汁???泣????遵????泣????遵? 1000111101100000001111110011111100111111100010111000001100111111001111110011111100111111100011111000010100111111001111110011111100111111100010111000001100111111001111110011111100111111100011111000010100111111 8f603f3f3f8b833f3f3f3f8f853f3f3f3f8b833f3f3f3f8f853f
EUC-JP 汁???泣????遵????泣????遵? 1011110111000001001111110011111100111111101101011110001100111111001111110011111100111111101111011110010100111111001111110011111100111111101101011110001100111111001111110011111100111111101111011110010100111111 bdc13f3f3fb5e33f3f3f3fbde53f3f3f3fb5e33f3f3f3fbde53f
UTF-8 汁흗렓렜泣쇰웃渽렜遵띕웃渽렜泣쇰웃渽렜遵동 111001101011000110000001111011011001110110010111111010111010000010010011111010111010000010011100111001101011001110100011111011001000011110110000111011001001101110000011111001101011100010111101111010111010000010011100111010011000000110110101111010111001110110010101111011001001101110000011111001101011100010111101111010111010000010011100111001101011001110100011111011001000011110110000111011001001101110000011111001101011100010111101111010111010000010011100111010011000000110110101111010111000111110011001 e6b181ed9d97eba093eba09ce6b3a3ec87b0ec9b83e6b8bdeba09ce981b5eb9d95ec9b83e6b8bdeba09ce6b3a3ec87b0ec9b83e6b8bdeba09ce981b5eb8f99
UHC 汁흗렓렜泣쇰웃渽렜遵띕웃渽렜泣쇰웃渽렜遵동 111100011111000011001000111010011000111010101000100011101010111011101011111010001011110011101011101111111111010011101110101010101000111010101110111100011110010110110110111010111011111111110100111011101010101010001110101011101110101111101000101111001110101110111111111101001110111010101010100011101010111011110001111001011011010110111111 f1f0c8e98ea88eaeebe8bcebbff4eeaa8eaef1e5b6ebbff4eeaa8eaeebe8bcebbff4eeaa8eaef1e5b5bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)