To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 甯嫉甯治甯嫉サシ甯嫉甯治甯嫉サシ^ 111110101010100010001110101110011111101010101000100011101010000111111010101010001000111010111001111100001011001110111011101111001111101010101000100011101011100111111010101010001000111010100001111110101010100010001110101110011111000010110011101110111011110001011110 faa88eb9faa88ea1faa88eb9f0b3bbbcfaa88eb9faa88ea1faa88eb9f0b3bbbc5e
EUC-JP 甯嫉甯治甯嫉?サシ甯嫉甯治甯嫉?サシ^ 1000111111001101101010101011110010111011100011111100110110101010101111001010001110001111110011011010101010111100101110110011111110001110101110111000111010111100100011111100110110101010101111001011101110001111110011011010101010111100101000111000111111001101101010101011110010111011001111111000111010111011100011101011110001011110 8fcdaabcbb8fcdaabca38fcdaabcbb3f8ebb8ebc8fcdaabcbb8fcdaabca38fcdaabcbb3f8ebb8ebc5e
UTF-8 甯嫉甯治甯嫉サシ甯嫉甯治甯嫉サシ^ 11100111100101001010111111100101101010111000100111100111100101001010111111100110101100101011101111100111100101001010111111100101101010111000100111101110100000011011001011101111101111011011101111101111101111011011110011100111100101001010111111100101101010111000100111100111100101001010111111100110101100101011101111100111100101001010111111100101101010111000100111101110100000011011001011101111101111011011101111101111101111011011110001011110 e794afe5ab89e794afe6b2bbe794afe5ab89ee81b2efbdbbefbdbce794afe5ab89e794afe6b2bbe794afe5ab89ee81b2efbdbbefbdbc5e
UHC ?嫉?治?嫉????嫉?治?嫉???^ 00111111111100101110110000111111111101101011110100111111111100101110110000111111001111110011111100111111111100101110110000111111111101101011110100111111111100101110110000111111001111110011111101011110 3ff2ec3ff6bd3ff2ec3f3f3f3ff2ec3ff6bd3ff2ec3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)