To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 爾遯健?毬爾遯健?臼[爾遯健?毬爾遯健?臼[^ 100011101010001011100111101010101000110010010010001111111001111101111011100011101010001011100111101010101000110010010010001111111000100101010000010110111000111010100010111001111010101010001100100100100011111110011111011110111000111010100010111001111010101010001100100100100011111110001001010100000101101101011110 8ea2e7aa8c923f9f7b8ea2e7aa8c923f89505b8ea2e7aa8c923f9f7b8ea2e7aa8c923f89505b5e
EUC-JP 爾遯健?毬爾遯健?臼[爾遯健?毬爾遯健?臼[^ 101111001010010011101110101011001011011111110010001111111101110111011100101111001010010011101110101011001011011111110010001111111011000110110001010110111011110010100100111011101010110010110111111100100011111111011101110111001011110010100100111011101010110010110111111100100011111110110001101100010101101101011110 bca4eeacb7f23fdddcbca4eeacb7f23fb1b15bbca4eeacb7f23fdddcbca4eeacb7f23fb1b15b5e
UTF-8 爾遯健롅毬爾遯健롅臼[爾遯健롅毬爾遯健롅臼[^ 111001111000100010111110111010011000000110101111111001011000000110100101111010111010000110000101111001101010111110101100111001111000100010111110111010011000000110101111111001011000000110100101111010111010000110000101111010001000011110111100010110111110011110001000101111101110100110000001101011111110010110000001101001011110101110100001100001011110011010101111101011001110011110001000101111101110100110000001101011111110010110000001101001011110101110100001100001011110100010000111101111000101101101011110 e788bee981afe581a5eba185e6aface788bee981afe581a5eba185e887bc5be788bee981afe581a5eba185e6aface788bee981afe581a5eba185e887bc5b5e
UHC 爾遯健롅毬爾遯健롅臼[爾遯健롅毬爾遯健롅臼[^ 11101100101100111101010011101110110010111110110110001110110010111100111110110011111011001011001111010100111011101100101111101101100011101100101111001111101111110101101111101100101100111101010011101110110010111110110110001110110010111100111110110011111011001011001111010100111011101100101111101101100011101100101111001111101111110101101101011110 ecb3d4eecbed8ecbcfb3ecb3d4eecbed8ecbcfbf5becb3d4eecbed8ecbcfb3ecb3d4eecbed8ecbcfbf5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)