To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????×? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101011100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fd73f
SJIS-WIN ???猿???с?域?????怨k??レ×? 0011111100111111001111111000100110001110001111110011111100111111100001001000001100111111100010001110011000111111001111110011111100111111001111111000100110000101100000101000101100111111001111111000001110001100100000010111111000111111 3f3f3f898e3f3f3f84833f88e63f3f3f3f3f8985828b3f3f838c817e3f
EUC-JP ???猿???с?域??彛??怨k??レ×? 00111111001111110011111110110001111011100011111100111111001111111010011111100011001111111011000011101000001111110011111110001111101111001111101000111111001111111011000111100101101000111110101100111111001111111010010111101100101000011101111100111111 3f3f3fb1ee3f3f3fa7e33fb0e83f3f8fbcfa3f3fb1e5a3eb3f3fa5eca1df3f
UTF-8 捻뀁빓猿당땟戮с걶域㏓씈彛뉒넭怨k쳳曆レ×琉 11101111101001101010010011101011100000001000000111101011101110011001001111100111100011001011111111101011100010111011100111101011100101011001111111101111101001111001001011010001100000011110101010110001101101101110010110011111100111111110001110001111100100111110110010010100100010001110010110111101100110111110101110001001100100101110101110000100101011011110011010000000101010001110111110111101100010111110110010110011101100111110111110100110100010111110001110000011101011001100001110010111111011111010011110001100 efa6a4eb8081ebb993e78cbfeb8bb9eb959fefa792d181eab1b6e59f9fe38f93ec9488e5bd9beb8992eb84ade680a8efbd8becb3b3efa68be383acc397efa78c
UHC 捻뀁빓猿당땟戮с걶域㏓씈彛뉒넭怨k쳳曆レ×琉 1110011011110111101100101110110010010101101101111110101010111011101101001110011110110110101011011110101110111101101011001110001110000001100111001110011010110100101001111110101110011101101000001110110010101101100001111110011110000110101011001110101010110011101000111110101110101011100101101110011010110111101010111110110010100001101111111110101110100100 e6f7b2ec95b7eabbb4e7b6adebbdace3819ce6b4a7eb9da0ecad87e786aceab3a3ebab96e6b7abeca1bfeba4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)