To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 貉ソ螻。貉ソ蜿ア貉ソ蜿ア貉ソ豎先ケソ霆頑ケ 1110011010111001101111111110010110110001101000011110011010111001101111111110010110001111101100011110011010111001101111111110010110001111101100011110011010111001101111111110011010110001100100001110011010111001101111111110100010111011100010101110011010111001 e6b9bfe5b1a1e6b9bfe58fb1e6b9bfe58fb1e6b9bfe6b190e6b9bfe8bb8ae6b9
EUC-JP 貉ソ螻。貉ソ蜿ア貉ソ蜿ア貉ソ豎先ケソ霆頑ケ 111011001011101110001110101111111110101010110011100011101010000111101100101110111000111010111111111010011110111110001110101100011110110010111011100011101011111111101001111011111000111010110001111011001011101110001110101111111110110010110011110000001110100010001110101110011000111010111111111100001011110110110100111010001000111010111001 ecbb8ebfeab38ea1ecbb8ebfe9ef8eb1ecbb8ebfe9ef8eb1ecbb8ebfecb3c0e88eb98ebff0bdb4e88eb9
UTF-8 貉ソ螻。貉ソ蜿ア貉ソ蜿ア貉ソ豎先ケソ霆頑ケ 111010001011001010001001111011111011110110111111111010001001111010111011111011111011110110100001111010001011001010001001111011111011110110111111111010001001110010111111111011111011110110110001111010001011001010001001111011111011110110111111111010001001110010111111111011111011110110110001111010001011001010001001111011111011110110111111111010001011000110001110111001011000010110001000111011111011110110111001111011111011110110111111111010011001110010000110111010011010000010010001111011111011110110111001 e8b289efbdbfe89ebbefbda1e8b289efbdbfe89cbfefbdb1e8b289efbdbfe89cbfefbdb1e8b289efbdbfe8b18ee58588efbdb9efbdbfe99c86e9a091efbdb9
UHC ???????????????先??霆頑? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111111000001011101100111111001111111110111111111101111010001101011100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fe0bb3f3feffde8d73f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)