To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???乙??癲ル?靭??遺??悠?┼???^ 0011111100111111001111111000100110110011001111110011111111100001100111111000001110001011001111111001000001111000001111110011111110001000111000100011111100111111100101110100100100111111100001001010100100111111001111110011111101011110 3f3f3f89b33f3fe19f838b3f90783f3f88e23f3f97493f84a93f3f3f5e
EUC-JP ???乙??癲ル?靭??遺??悠?┼洧??^ 00111111001111110011111110110010101101010011111100111111111000101010000110100101111010110011111110111111110110010011111100111111101100001110010000111111001111111100110110101010001111111010100010101011100011111100011110110100001111110011111101011110 3f3f3fb2b53f3fe2a1a5eb3fbfd93f3fb0e43f3fcdaa3fa8ab8fc7b43f3f5e
UTF-8 樂낅텪乙대뤉癲ル씭靭녽썒遺욎궡悠뱄┼洧얩뇛^ 11101111101001101011111111101011100000101000010111101101100001011010101011100100101110011001100111101011100011001000000011101011101001001000100111100111100110011011001011100011100000111010101111101100100101001010110111101001100111011010110111101011100001011011110111101100100011011001001011101001100000011011101011101100100110101000111011101010101101101010000111100110100000101010000011101011101100011000010011100010100101001011110011100110101101001010011111101100100101101010100111101011100001111001101101011110 efa6bfeb8285ed85aae4b999eb8c80eba489e799b2e383abec94ade99dadeb85bdec8d92e981baec9a8eeab6a1e682a0ebb184e294bce6b4a7ec96a9eb879b5e
UHC 樂낅텪乙대뤉癲ル씭靭녽썒遺욎궡悠뱄┼洧얩뇛^ 11101000111110011000010111101011101101101001111011101011111000001011010011101011100011111011100111101111101001101010101111101011100111011011111011101100111001011000011011101001100110111000010111101011101101101001111011101100100000101011010011101010111011011011100111101111101001101010101111101010111110111011111011101101100001111000011001011110 e8f985ebb69eebe0b4eb8fb9efa6abeb9dbeece586e99b85ebb69eec82b4eaedb9efa6abeafbbeed87865e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)