To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 障蕎?障????長魄障蕎?障????長白^ 10001111111000011000101110111100001111111000111111100001001111110011111100111111001111111001001010110111111010011010111010001111111000011000101110111100001111111000111111100001001111110011111100111111001111111001001010110111100101001001001001011110 8fe18bbc3f8fe13f3f3f3f92b7e9ae8fe18bbc3f8fe13f3f3f3f92b794925e
EUC-JP 障蕎?障????長魄障蕎?障????長白^ 10111110111000111011011010111110001111111011111011100011001111110011111100111111001111111100010010111001111100101011000010111110111000111011011010111110001111111011111011100011001111110011111100111111001111111100010010111001110001111111001001011110 bee3b6be3fbee33f3f3f3fc4b9f2b0bee3b6be3fbee33f3f3f3fc4b9c7f25e
UTF-8 障蕎ㄺ障븅렟흩윌長魄障蕎ㄺ障븅렟흩윌長白^ 11101001100110101001110011101000100101011000111011100011100001001011101011101001100110101001110011101011101110001000010111101011101000001001111111101101100111011010100111101100100111001000110011101001100101011011011111101001101011011000010011101001100110101001110011101000100101011000111011100011100001001011101011101001100110101001110011101011101110001000010111101011101000001001111111101101100111011010100111101100100111001000110011101001100101011011011111100111100110011011110101011110 e99a9ce8958ee384bae99a9cebb885eba09fed9da9ec9c8ce995b7e9ad84e99a9ce8958ee384bae99a9cebb885eba09fed9da9ec9c8ce995b7e799bd5e
UHC 障蕎ㄺ障븅렟흩윌長魄障蕎ㄺ障븅렟흩윌長白^ 1110111010100001110011101111000010100100101010101110111010100001101110101110100110001110101100001100100011110000110000001010101011101101111111101101101111011110111011101010000111001110111100001010010010101010111011101010000110111010111010011000111010110000110010001111000011000000101010101110110111111110110110111101110001011110 eea1cef0a4aaeea1bae98eb0c8f0c0aaedfedbdeeea1cef0a4aaeea1bae98eb0c8f0c0aaedfedbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)