To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??寃??柔κ?午???永??泣????? 1000100101101001001111110011111110011011100000110011111100111111100011110101111110000011110010000011111110001100110111110011111100111111001111111000100101101001001111110011111110001011100000110011111100111111001111110011111100111111 89693f3f9b833f3f8f5f83c83f8cdf3f3f3f89693f3f8b833f3f3f3f3f
EUC-JP 永??寃??柔κ?午???永??泣??洹?? 10110001110010100011111100111111110101011110001100111111001111111011110111000000101001101100101000111111101110001110000100111111001111110011111110110001110010100011111100111111101101011110001100111111001111111000111111000111101110100011111100111111 b1ca3f3fd5e33f3fbdc0a6ca3fb8e13f3f3fb1ca3f3fb5e33f3f8fc7ba3f3f
UTF-8 永띔퇊寃쏃죲柔κ쿆午닌띿돁永띔랬泣먪독洹쏄쿆 1110011010110000101110001110101110011101100101001110110110000111100010101110010110101111100000111110110010001111100000111110110010100011101100101110011010011111100101001100111010111010111011001011111110000110111001011000110110001000111010111000101110001100111010111001110110111111111010111000111110000001111001101011000010111000111010111001110110010100111010111001111010101100111001101011001110100011111010111010100010101010111010111000111110000101111001101011010010111001111011001000111110000100111011001011111110000110 e6b0b8eb9d94ed878ae5af83ec8f83eca3b2e69f94cebaecbf86e58d88eb8b8ceb9dbfeb8f81e6b0b8eb9d94eb9eace6b3a3eba8aaeb8f85e6b4b9ec8f84ecbf86
UHC 永띔퇊寃쏃죲柔κ쿆午닌띿돁永띔랬泣먪독洹쏄쿆 1110011110110101101101101110101010110111100110111110101010110010100110111110100110100001100011011110101011110101101001011110101010110010100110111110011111101101101101001101000110001101111011001000100110010100111001111011010110110110111010101011011110101000111010111110100010010000111001111011010110110110111010101011011110011011111010101011001010011011 e7b5b6eab79beab29be9a18deaf5a5eab29be7edb4d18dec8994e7b5b6eab7a8ebe890e7b5b6eab79beab29b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)