To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 罌??┓???逸??矣??遙κ????喩??^ 111000111010000000111111001111111000010010101101001111110011111100111111100010001110110100111111001111111110000111100001001111110011111111101010101000011000001111001000001111110011111100111111001111111001101001100111001111110011111101011110 e3a03f3f84ad3f3f3f88ed3f3fe1e13f3feaa183c83f3f3f3f9a673f3f5e
EUC-JP 罌??┓???逸??矣??遙κ????喩??^ 111001101010001000111111001111111010100010101111001111110011111100111111101100001110111100111111001111111110001011100011001111110011111111110100101000111010011011001010001111110011111100111111001111111101001111001000001111110011111101011110 e6a23f3fa8af3f3f3fb0ef3f3fe2e33f3ff4a3a6ca3f3f3f3fd3c83f3f5e
UTF-8 罌뾔룹┓劣믨쒀逸사독矣먭쾲遙κ낟利꿰독喩쏇뮋^ 111001111011110110001100111010111011111010010100111010111010001110111001111000101001010010010011111011111010011010011101111010111010111110101000111011001001001010000000111010011000000010111000111011001000001010101100111010111000111110000101111001111001111110100011111010111010100010101101111011001011111010110010111010011000000110011001110011101011101011101011100000101001111111101111101001111001110111101010101111111011000011101011100011111000010111100101100101101010100111101100100011111000011111101011101011101000101101011110 e7bd8cebbe94eba3b9e29493efa69debafa8ec9280e980b8ec82aceb8f85e79fa3eba8adecbeb2e98199cebaeb829fefa79deabfb0eb8f85e596a9ec8f87ebae8b5e
UHC 罌뾔룹┓劣믨쒀逸사독矣먭쾲遙κ낟利꿰독喩쏇뮋^ 111001011010001010111011110011101011011111101100101001101010111111100110111010111001001011101010101111101010110011101100111011111011101111100111101101011011011011101011111110001001000011101010101100101000100011101001101010111010010111101010101100111010111011101100101001101011001011100111101101011011011011101010111001111001101111101101100100101001100101011110 e5a2bbceb7eca6afe6eb92eabeacecefbbe7b5b6ebf890eab288e9aba5eab3aeeca6b2e7b5b6eae79bed92995e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)