To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 永??飮??????η?猿??永??宥??^ 10001001011010010011111100111111100111110101101000111111001111110011111100111111001111110011111110000011110001010011111110001001100011100011111100111111100010010110100100111111001111111001011101000111001111110011111101011110 89693f3f9f5a3f3f3f3f3f3f83c53f898e3f3f89693f3f97473f3f5e
EUC-JP 永??飮??沅??馹η?猿??永??宥??^ 1011000111001010001111110011111111011101101110110011111100111111100011111100011011101001001111110011111110001111111010011010000110100110110001110011111110110001111011100011111100111111101100011100101000111111001111111100110110101000001111110011111101011110 b1ca3f3fddbb3f3f8fc6e93f3f8fe9a1a6c73fb1ee3f3fb1ca3f3fcda83f3f5e
UTF-8 永띔퍜飮꿰독沅쎈떧馹η독猿볦댉永띔떽宥삵뀟^ 111001101011000010111000111010111001110110010100111011011000110110011100111010011010001110101110111010101011111110110000111010111000111110000101111001101011001010000101111011001000111010001000111010111001011010100111111010011010011010111001110011101011011111101011100011111000010111100111100011001011111111101011101100111010011011101011100011001000100111100110101100001011100011101011100111011001010011101011100101101011110111100101101011101010010111101100100000101011010111101011100000001001111101011110 e6b0b8eb9d94ed8d9ce9a3aeeabfb0eb8f85e6b285ec8e88eb96a7e9a6b9ceb7eb8f85e78cbfebb3a6eb8c89e6b0b8eb9d94eb96bde5aea5ec82b5eb809f5e
UHC 永띔퍜飮꿰독沅쎈떧馹η독猿볦댉永띔떽宥삵뀟^ 11100111101101011011011011101010101110111001001111101011111001101011001011100111101101011011011011101010101101101011110111101011100010111011101011101100111100011010010111100111101101011011011011101010101110111001001111101100100010001011001011100111101101011011011011101010101101101011110111101010111010011011101111101101100001011001011001011110 e7b5b6eabb93ebe6b2e7b5b6eab6bdeb8bbaecf1a5e7b5b6eabb93ec88b2e7b5b6eab6bdeae9bbed85965e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)