To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????x?????????x^ 001111110011111100111111001111110011111100111111001111110011111100111111011110000011111100111111001111110011111100111111001111110011111100111111001111110111100001011110 3f3f3f3f3f3f3f3f3f783f3f3f3f3f3f3f3f3f785e
SJIS-WIN 劾??劾?????x劾??劾?????x^ 10001010010011100011111100111111100010100100111000111111001111110011111100111111001111110111100010001010010011100011111100111111100010100100111000111111001111110011111100111111001111110111100001011110 8a4e3f3f8a4e3f3f3f3f3f788a4e3f3f8a4e3f3f3f3f3f785e
EUC-JP 劾??劾?????x劾??劾?????x^ 10110011101011110011111100111111101100111010111100111111001111110011111100111111001111110111100010110011101011110011111100111111101100111010111100111111001111110011111100111111001111110111100001011110 b3af3f3fb3af3f3f3f3f3f78b3af3f3fb3af3f3f3f3f3f785e
UTF-8 劾귦삸劾귥쮯淋괒麗x劾귦삸劾귥쮯淋괒麗x^ 111001011000101010111110111010101011011110100110111011001000001010111000111001011000101010111110111010101011011110100101111011001010111010101111111011111010011110110101111010101011010010010010111011111010011010001000011110001110010110001010101111101110101010110111101001101110110010000010101110001110010110001010101111101110101010110111101001011110110010101110101011111110111110100111101101011110101010110100100100101110111110100110100010000111100001011110 e58abeeab7a6ec82b8e58abeeab7a5ecaeafefa7b5eab492efa68878e58abeeab7a6ec82b8e58abeeab7a5ecaeafefa7b5eab492efa688785e
UHC 劾귦삸劾귥쮯淋괒麗x劾귦삸劾귥쮯淋괒麗x^ 111110101011011010000010111011011001100010101111111110101011011010000010111011001010100010001100111011001111100010000001111111011110011010110000011110001111101010110110100000101110110110011000101011111111101010110110100000101110110010101000100011001110110011111000100000011111110111100110101100000111100001011110 fab682ed98affab682eca88cecf881fde6b078fab682ed98affab682eca88cecf881fde6b0785e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)