To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8宜??音??馭???繞???塋??苑 1110000110011111001111111000001001010111100010110101100000111111001111111000100110111001001111110011111111101001011001100011111100111111001111111110001110000101001111110011111100111111100110101100100000111111001111111000100110010001 e19f3f82578b583f3f89b93f3fe9663f3f3fe3853f3f3f9ac83f3f8991
EUC-JP 癲?8宜??音??馭???繞???塋??苑 1110001010100001001111111010001110111000101101011011100100111111001111111011001010111011001111110011111111110001110001110011111100111111001111111110010111100101001111110011111100111111110101001100101000111111001111111011000111110001 e2a13fa3b8b5b93f3fb2bb3f3ff1c73f3f3fe5e53f3f3fd4ca3f3fb1f1
UTF-8 癲쒕8宜룩눧音섎쇀馭귂뗫쭦繞섎맧큔塋딅뿦苑 111001111001100110110010111011001001001010010101111011111011110010011000111001011010111010011100111010111010001110101001111010111000100010100111111010011001111110110011111011001000010010001110111011001000011110000000111010011010011010101101111010101011011110000010111010111001011110101011111011001010110110100110111001111011100110011110111011001000010010001110111010111010011110100111111011011000000110010100111001011010000110001011111010111001010010000101111010111011111110100110111010001000101110010001 e799b2ec9295efbc98e5ae9ceba3a9eb88a7e99fb3ec848eec8780e9a6adeab782eb97abecada6e7b99eec848eeba7a7ed8194e5a18beb9485ebbfa6e88b91
UHC 癲쒕8宜룩눧音섎쇀馭귂뗫쭦繞섎맧큔塋딅뿦苑 111011111010011010011100111010111010001110111000111010111111000110110111111010001000011110111110111010111110010110011000111010111001100110110100111001011101111110000010110100011000101111101011101001111001101011101001101001001001100011101011100100001011000011000101101001101110011110101011100010101110101110010111101001101110101010111101 efa69ceba3b8ebf1b7e887beebe598eb99b4e5df82d18beba79ae9a498eb90b0c5a6e7ab8aeb97a6eabd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)