To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 毅貊??珥????祭猫毅貊??珥????祭杳^ 100010110100001011100110101110110011111100111111111000001110000000111111001111110011111100111111100011011101010110010100010011001000101101000010111001101011101100111111001111111110000011100000001111110011111100111111001111111000110111010101100111011110000001011110 8b42e6bb3f3fe0e03f3f3f3f8dd5944c8b42e6bb3f3fe0e03f3f3f3f8dd59de05e
EUC-JP 毅貊??珥????祭猫毅貊??珥????祭杳^ 101101011010001111101100101111010011111100111111111000001110001000111111001111110011111100111111101110101101011111000111101011011011010110100011111011001011110100111111001111111110000011100010001111110011111100111111001111111011101011010111110110101110001001011110 b5a3ecbd3f3fe0e23f3f3f3fbad7c7adb5a3ecbd3f3fe0e23f3f3f3fbad7dae25e
UTF-8 毅貊렎렠珥렮諪대ㄼ祭猫毅貊렎렠珥렮諪대ㄼ祭杳^ 11100110101011111000010111101000101100101000101011101011101000001000111011101011101000001010000011100111100011111010010111101011101000001010111011101000101010111010101011101011100011001000000011100011100001001011110011100111101001011010110111100111100011001010101111100110101011111000010111101000101100101000101011101011101000001000111011101011101000001010000011100111100011111010010111101011101000001010111011101000101010111010101011101011100011001000000011100011100001001011110011100111101001011010110111100110100111011011001101011110 e6af85e8b28aeba08eeba0a0e78fa5eba0aee8abaaeb8c80e384bce7a5ade78cabe6af85e8b28aeba08eeba0a0e78fa5eba0aee8abaaeb8c80e384bce7a5ade69db35e
UHC 毅貊렎렠珥렮諪대ㄼ祭猫毅貊렎렠珥렮諪대ㄼ祭杳^ 111010111111011011011000111001111000111010100100100011101011000111101100101101001000111010111011111011111111010110110100111010111010010010101100111100001010111011011001110111101110101111110110110110001110011110001110101001001000111010110001111011001011010010001110101110111110111111110101101101001110101110100100101011001111000010101110110110011101110001011110 ebf6d8e78ea48eb1ecb48ebbeff5b4eba4acf0aed9deebf6d8e78ea48eb1ecb48ebbeff5b4eba4acf0aed9dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)