To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????F???????????? 00111111001111110011111100111111001111110011111101000110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f463f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癌先昻善洩?F癌先昻善洩腺癌先昻善洩屑 100010101110000010010000111001101111101011010000100100010101000010001001011010110011111101000110100010101110000010010000111001101111101011010000100100010101000010001001011010111001000101000010100010101110000010010000111001101111101011010000100100010101000010001001011010111000101111111011 8ae090e6fad09150896b3f468ae090e6fad09150896b91428ae090e6fad09150896b8bfb
EUC-JP 癌先?善洩啞F癌先?善洩腺癌先?善洩屑 1011010011100010110000001110100000111111110000011011000110110001110011001000111110110101111000000100011010110100111000101100000011101000001111111100000110110001101100011100110011000001101000111011010011100010110000001110100000111111110000011011000110110001110011001011011011111101 b4e2c0e83fc1b1b1cc8fb5e046b4e2c0e83fc1b1b1ccc1a3b4e2c0e83fc1b1b1ccb6fd
UTF-8 癌先昻善洩啞F癌先昻善洩腺癌先昻善洩屑 11100111100110011000110011100101100001011000100011100110100110001011101111100101100101101000010011100110101101001010100111100101100101011001111001000110111001111001100110001100111001011000010110001000111001101001100010111011111001011001011010000100111001101011010010101001111010001000010110111010111001111001100110001100111001011000010110001000111001101001100010111011111001011001011010000100111001101011010010101001111001011011000110010001 e7998ce58588e698bbe59684e6b4a9e5959e46e7998ce58588e698bbe59684e6b4a9e885bae7998ce58588e698bbe59684e6b4a9e5b191
UHC 癌先昻善洩啞F癌先昻善洩腺癌先昻善洩屑 11100100110111111110000010111011111001001110100111100000101111001110000011011101111001001010111101000110111001001101111111100000101110111110010011101001111000001011110011100000110111011110000011001101111001001101111111100000101110111110010011101001111000001011110011100000110111011110000011011010 e4dfe0bbe4e9e0bce0dde4af46e4dfe0bbe4e9e0bce0dde0cde4dfe0bbe4e9e0bce0dde0da

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)