To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 潁??泣??醫??????蘖??揖?????厄 1001111111110001001111110011111110001011100000110011111100111111111001111100111000111111001111110011111100111111001111110011111110011111010100000011111100111111100101110100101100111111001111110011111100111111001111111001011011101111 9ff13f3f8b833f3fe7ce3f3f3f3f3f3f9f503f3f974b3f3f3f3f3f96ef
EUC-JP 潁??泣??醫??????蘖??揖?????厄 1101111011110011001111110011111110110101111000110011111100111111111011101101000000111111001111110011111100111111001111110011111111011101101100010011111100111111110011011010110000111111001111110011111100111111001111111100110011110001 def33f3fb5e33f3feed03f3f3f3f3f3fddb13f3fcdac3f3f3f3f3fccf1
UTF-8 潁뺣톪泣됪릸醫묒뒃裂┧우돺蘖뽰뼲揖좄쫨類앸쭇厄 111001101011110110000001111010111011101010100011111011011000011010101010111001101011001110100011111010111001000010101010111010111010011010111000111010011000011010101011111010111010110010010010111010111001001010000011111011111010011010100000111000101001010010100111111011001001101010110000111010111000111110111010111010001001100010010110111010111011110110110000111010111011110010110010111001101000111110010110111011001010001010000100111011001010101110101000111011111010011110010000111011001001010110111000111011001010110110000111111001011000111010000100 e6bd81ebbaa3ed86aae6b3a3eb90aaeba6b8e986abebac92eb9283efa6a0e294a7ec9ab0eb8fbae89896ebbdb0ebbcb2e68f96eca284ecaba8efa790ec95b8ecad87e58e84
UHC 潁뺣톪泣됪릸醫묒뒃裂┧우돺蘖뽰뼲揖좄쫨類앸쭇厄 11100111101110001001010111101011101101111000001011101011111010001000100111100110100100001001011011101100101000101001000111101100100010101000000111100110111100011010011011001110101111111110110010001001101111011110010111101110100101101110110010010110101101011110101111100111101000001110100010100110100000011110101110111010100111011110101110100111100000111110010011111000 e7b895ebb782ebe889e69096eca291ec8a81e6f1a6cebfec89bde5ee96ec96b5ebe7a0e8a681ebba9deba783e4f8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)