To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 鸚?????耀?┴U}鸚?????耀?┴U{^ 1110101001011111001111110011111100111111001111110011111110010111011100110011111110000100101010000101010101111101111010100101111100111111001111110011111100111111001111111001011101110011001111111000010010101000010101010111101101011110 ea5f3f3f3f3f3f97733f84a8557dea5f3f3f3f3f3f97733f84a8557b5e
EUC-JP 鸚?????耀?┴U}鸚?????耀?┴U{^ 1111001111000000001111110011111100111111001111110011111111001101110101000011111110101000101010100101010101111101111100111100000000111111001111110011111100111111001111111100110111010100001111111010100010101010010101010111101101011110 f3c03f3f3f3f3fcdd43fa8aa557df3c03f3f3f3f3fcdd43fa8aa557b5e
UTF-8 鸚뺟퉽隸뚩뱠耀띸┴U}鸚뺟퉽隸뚩뱠耀띸┴U{^ 1110100110111000100110101110101110111010100111111110110110001001101111011110111110100110101110001110101110011010101010011110101110110001101000001110100010000000100000001110101110011101101110001110001010010100101101000101010101111101111010011011100010011010111010111011101010011111111011011000100110111101111011111010011010111000111010111001101010101001111010111011000110100000111010001000000010000000111010111001110110111000111000101001010010110100010101010111101101011110 e9b89aebba9fed89bdefa6b8eb9aa9ebb1a0e88080eb9db8e294b4557de9b89aebba9fed89bdefa6b8eb9aa9ebb1a0e88080eb9db8e294b4557b5e
UHC 鸚뺟퉽隸뚩뱠耀띸┴U}鸚뺟퉽隸뚩뱠耀띸┴U{^ 1110010110100100100101011110011110111001100101011110011111100110100011001110100010010011100001101110100110100101100011011110011110100110101010100101010101111101111001011010010010010101111001111011100110010101111001111110011010001100111010001001001110000110111010011010010110001101111001111010011010101010010101010111101101011110 e5a495e7b995e7e68ce89386e9a58de7a6aa557de5a495e7b995e7e68ce89386e9a58de7a6aa557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)