To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 猷??松⑨?而??U}猷??松⑨?而??U{^ 10010111010100010011111100111111100011111011110010000111010010000011111110001110101001110011111100111111010101010111110110010111010100010011111100111111100011111011110010000111010010000011111110001110101001110011111100111111010101010111101101011110 97513f3f8fbc87483f8ea73f3f557d97513f3f8fbc87483f8ea73f3f557b5e
EUC-JP 猷??松??而??U}猷??松??而??U{^ 1100110110110010001111110011111110111110101111100011111100111111101111001010100100111111001111110101010101111101110011011011001000111111001111111011111010111110001111110011111110111100101010010011111100111111010101010111101101011110 cdb23f3fbebe3f3fbca93f3f557dcdb23f3fbebe3f3fbca93f3f557b5e
UTF-8 猷뜻궠松⑨쫭而╉긽U}猷뜻궠松⑨쫭而╉긽U{^ 1110011110001100101101111110101110011100101110111110101010110110101000001110011010011101101111101110001010010001101010001110110010101011101011011110100010000000100011001110001010010101100010011110101010111000101111010101010101111101111001111000110010110111111010111001110010111011111010101011011010100000111001101001110110111110111000101001000110101000111011001010101110101101111010001000000010001100111000101001010110001001111010101011100010111101010101010111101101011110 e78cb7eb9cbbeab6a0e69dbee291a8ecabade8808ce29589eab8bd557de78cb7eb9cbbeab6a0e69dbee291a8ecabade8808ce29589eab8bd557b5e
UHC 猷뜻궠松⑨쫭而╉긽U}猷뜻궠松⑨쫭而╉긽U{^ 1110101110100011101101101110011010000010101100111110000111100110101010001110111110100110100001011110110010111011101001101110001110000011100000010101010101111101111010111010001110110110111001101000001010110011111000011110011010101000111011111010011010000101111011001011101110100110111000111000001110000001010101010111101101011110 eba3b6e682b3e1e6a8efa685ecbba6e38381557deba3b6e682b3e1e6a8efa685ecbba6e38381557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)