To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 薰ッ迹殤R薰ッ迹殤^[薰ッ迹殤R薰ッ迹殤^[^ 1111101110011110101011111110011110010001100111110110111001010010111110111001111010101111111001111001000110011111011011100101111001011011111110111001111010101111111001111001000110011111011011100101001011111011100111101010111111100111100100011001111101101110010111100101101101011110 fb9eafe7919f6e52fb9eafe7919f6e5e5bfb9eafe7919f6e52fb9eafe7919f6e5e5b5e
EUC-JP ?ッ迹殤R?ッ迹殤^[?ッ迹殤R?ッ迹殤^[^ 0011111110001110101011111110110111110001110111011100111101010010001111111000111010101111111011011111000111011101110011110101111001011011001111111000111010101111111011011111000111011101110011110101001000111111100011101010111111101101111100011101110111001111010111100101101101011110 3f8eafedf1ddcf523f8eafedf1ddcf5e5b3f8eafedf1ddcf523f8eafedf1ddcf5e5b5e
UTF-8 薰ッ迹殤R薰ッ迹殤^[薰ッ迹殤R薰ッ迹殤^[^ 11101000100101101011000011101111101111011010111111101000101111111011100111100110101011101010010001010010111010001001011010110000111011111011110110101111111010001011111110111001111001101010111010100100010111100101101111101000100101101011000011101111101111011010111111101000101111111011100111100110101011101010010001010010111010001001011010110000111011111011110110101111111010001011111110111001111001101010111010100100010111100101101101011110 e896b0efbdafe8bfb9e6aea452e896b0efbdafe8bfb9e6aea45e5be896b0efbdafe8bfb9e6aea452e896b0efbdafe8bfb9e6aea45e5b5e
UHC 薰?迹?R薰?迹?^[薰?迹?R薰?迹?^[^ 11111101101110010011111111101110111010010011111101010010111111011011100100111111111011101110100100111111010111100101101111111101101110010011111111101110111010010011111101010010111111011011100100111111111011101110100100111111010111100101101101011110 fdb93feee93f52fdb93feee93f5e5bfdb93feee93f52fdb93feee93f5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)