To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??泣η??⑥?永??揖??猿??永??陰?┠ 10001001011010010011111100111111100010111000001110000011110001010011111100111111100001110100010100111111100010010110100100111111001111111001011101001011001111110011111110001001100011100011111100111111100010010110100100111111001111111000100101000001001111111000010010110101 89693f3f8b8383c53f3f87453f89693f3f974b3f3f898e3f3f89693f3f89413f84b5
EUC-JP 永??泣η?洹??永??揖??猿??永??陰?┠ 1011000111001010001111110011111110110101111000111010011011000111001111111000111111000111101110100011111100111111101100011100101000111111001111111100110110101100001111110011111110110001111011100011111100111111101100011100101000111111001111111011000110100010001111111010100010110111 b1ca3f3fb5e3a6c73f8fc7ba3f3fb1ca3f3fcdac3f3fb1ee3f3fb1ca3f3fb1a23fa8b7
UTF-8 永띔퍜泣η독洹⑥댅永띔랬揖먪독猿볦돁永띔래陰울┠ 1110011010110000101110001110101110011101100101001110110110001101100111001110011010110011101000111100111010110111111010111000111110000101111001101011010010111001111000101001000110100101111010111000110010000101111001101011000010111000111010111001110110010100111010111001111010101100111001101000111110010110111010111010100010101010111010111000111110000101111001111000110010111111111010111011001110100110111010111000111110000001111001101011000010111000111010111001110110010100111010111001111010011000111010011001100110110000111011001001101010111000111000101001010010100000 e6b0b8eb9d94ed8d9ce6b3a3ceb7eb8f85e6b4b9e291a5eb8c85e6b0b8eb9d94eb9eace68f96eba8aaeb8f85e78cbfebb3a6eb8f81e6b0b8eb9d94eb9e98e999b0ec9ab8e294a0
UHC 永띔퍜泣η독洹⑥댅永띔랬揖먪독猿볦돁永띔래陰울┠ 111001111011010110110110111010101011101110010011111010111110100010100101111001111011010110110110111010101011011110101000111011001000100010101111111001111011010110110110111010101011011110101000111010111110011110010000111001111011010110110110111010101011101110010011111011001000100110010100111001111011010110110110111010101011011110100001111010111110010010111111111011111010011010110111 e7b5b6eabb93ebe8a5e7b5b6eab7a8ec88afe7b5b6eab7a8ebe790e7b5b6eabb93ec8994e7b5b6eab7a1ebe4bfefa6b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)