To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}Y????????}bE 001111110011111100111111001111110011111100111111001111110011111101111101010110010011111100111111001111110011111100111111001111110011111100111111011111010110001001000101 3f3f3f3f3f3f3f3f7d593f3f3f3f3f3f3f3f7d6245
SJIS-WIN 薰サ迯庸薰サ迯与}Y薰サ迯庸薰サ迯与}bE 111110111001111010111011111001111000110110010111011001101111101110011110101110111110011110001101100101110101111001111101010110011111101110011110101110111110011110001101100101110110011011111011100111101011101111100111100011011001011101011110011111010110001001000101 fb9ebbe78d9766fb9ebbe78d975e7d59fb9ebbe78d9766fb9ebbe78d975e7d6245
EUC-JP ?サ迯庸?サ迯与}Y?サ迯庸?サ迯与}bE 001111111000111010111011111011011110110111001101110001110011111110001110101110111110110111101101110011011011111101111101010110010011111110001110101110111110110111101101110011011100011100111111100011101011101111101101111011011100110110111111011111010110001001000101 3f8ebbededcdc73f8ebbededcdbf7d593f8ebbededcdc73f8ebbededcdbf7d6245
UTF-8 薰サ迯庸薰サ迯与}Y薰サ迯庸薰サ迯与}bE 1110100010010110101100001110111110111101101110111110100010111111101011111110010110111010101110001110100010010110101100001110111110111101101110111110100010111111101011111110010010111000100011100111110101011001111010001001011010110000111011111011110110111011111010001011111110101111111001011011101010111000111010001001011010110000111011111011110110111011111010001011111110101111111001001011100010001110011111010110001001000101 e896b0efbdbbe8bfafe5bab8e896b0efbdbbe8bfafe4b88e7d59e896b0efbdbbe8bfafe5bab8e896b0efbdbbe8bfafe4b88e7d6245
UHC 薰??庸薰???}Y薰??庸薰???}bE 111111011011100100111111001111111110100110111100111111011011100100111111001111110011111101111101010110011111110110111001001111110011111111101001101111001111110110111001001111110011111100111111011111010110001001000101 fdb93f3fe9bcfdb93f3f3f7d59fdb93f3fe9bcfdb93f3f3f7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)