To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????v}B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011101100111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f767d42
SJIS-WIN 永??飮??猿??永??宜??惟ъ?v}B 10001001011010010011111100111111100111110101101000111111001111111000100110001110001111110011111110001001011010010011111100111111100010110101100000111111001111111000100011010010100001001000110000111111011101100111110101000010 89693f3f9f5a3f3f898e3f3f89693f3f8b583f3f88d2848c3f767d42
EUC-JP 永??飮??猿??永??宜??惟ъ?v}B 10110001110010100011111100111111110111011011101100111111001111111011000111101110001111110011111110110001110010100011111100111111101101011011100100111111001111111011000011010100101001111110110000111111011101100111110101000010 b1ca3f3fddbb3f3fb1ee3f3fb1ca3f3fb5b93f3fb0d4a7ec3f767d42
UTF-8 永띔퍜飮꿰독猿볦뒟永띔래宜듸쫵惟ъ춷v}B 1110011010110000101110001110101110011101100101001110110110001101100111001110100110100011101011101110101010111111101100001110101110001111100001011110011110001100101111111110101110110011101001101110101110010010100111111110011010110000101110001110101110011101100101001110101110011110100110001110010110101110100111001110101110010011101110001110110010101011101101011110011010000011100111111101000110001010111011001011011010110111011101100111110101000010 e6b0b8eb9d94ed8d9ce9a3aeeabfb0eb8f85e78cbfebb3a6eb929fe6b0b8eb9d94eb9e98e5ae9ceb93b8ecabb5e6839fd18aecb6b7767d42
UHC 永띔퍜飮꿰독猿볦뒟永띔래宜듸쫵惟ъ춷v}B 111001111011010110110110111010101011101110010011111010111110011010110010111001111011010110110110111010101011101110010011111011001000101010011011111001111011010110110110111010101011011110100001111010111111000110110101111011111010011010001100111010101110111010101100111011001010110110010011011101100111110101000010 e7b5b6eabb93ebe6b2e7b5b6eabb93ec8a9be7b5b6eab7a1ebf1b5efa68ceaeeacecad93767d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)