To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 艾??誼?├濡レ?[艾??誼?├濡レ?[^ 11100100100010000011111100111111100010110110001000111111100001001010010110010100010001111000001110001100001111110101101111100100100010000011111100111111100010110110001000111111100001001010010110010100010001111000001110001100001111110101101101011110 e4883f3f8b623f84a59447838c3f5be4883f3f8b623f84a59447838c3f5b5e
EUC-JP 艾??誼?├濡レ?[艾??誼?├濡レ?[^ 11100111111010000011111100111111101101011100001100111111101010001010011111000111101010001010010111101100001111110101101111100111111010000011111100111111101101011100001100111111101010001010011111000111101010001010010111101100001111110101101101011110 e7e83f3fb5c33fa8a7c7a8a5ec3f5be7e83f3fb5c33fa8a7c7a8a5ec3f5b5e
UTF-8 艾싲챶誼뤄├濡レ젆[艾싲챶誼뤄├濡レ젆[^ 111010001000100110111110111011001000101110110010111011001011000110110110111010001010101010111100111010111010010010000100111000101001010010011100111001101011111110100001111000111000001110101100111011001010000010000110010110111110100010001001101111101110110010001011101100101110110010110001101101101110100010101010101111001110101110100100100001001110001010010100100111001110011010111111101000011110001110000011101011001110110010100000100001100101101101011110 e889beec8bb2ecb1b6e8aabceba484e2949ce6bfa1e383aceca0865be889beec8bb2ecb1b6e8aabceba484e2949ce6bfa1e383aceca0865b5e
UHC 艾싲챶誼뤄├濡レ젆[艾싲챶誼뤄├濡レ젆[^ 111001001111010110011010111010111010101010000011111010111111111010110111111011111010011010100111111010111010000110101011111011001010000010001001010110111110010011110101100110101110101110101010100000111110101111111110101101111110111110100110101001111110101110100001101010111110110010100000100010010101101101011110 e4f59aebaa83ebfeb7efa6a7eba1abeca0895be4f59aebaa83ebfeb7efa6a7eba1abeca0895b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)