To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 僥??厭?????}僥??厭?????{^ 10011001010001100011111100111111100010010111110100111111001111110011111100111111001111110111110110011001010001100011111100111111100010010111110100111111001111110011111100111111001111110111101101011110 99463f3f897d3f3f3f3f3f7d99463f3f897d3f3f3f3f3f7b5e
EUC-JP 僥??厭?????}僥??厭?????{^ 11010001101001110011111100111111101100011101111000111111001111110011111100111111001111110111110111010001101001110011111100111111101100011101111000111111001111110011111100111111001111110111101101011110 d1a73f3fb1de3f3f3f3f3f7dd1a73f3fb1de3f3f3f3f3f7b5e
UTF-8 僥뚮젿厭묉젩溜뽳섞}僥뚮젿厭묉젩溜뽳섞{^ 111001011000001110100101111010111001101010101110111011001010000010111111111001011000111010101101111010111010110010001001111011001010000010101001111011111010011110001011111010111011110110110011111011001000010010011110011111011110010110000011101001011110101110011010101011101110110010100000101111111110010110001110101011011110101110101100100010011110110010100000101010011110111110100111100010111110101110111101101100111110110010000100100111100111101101011110 e583a5eb9aaeeca0bfe58eadebac89eca0a9efa78bebbdb3ec849e7de583a5eb9aaeeca0bfe58eadebac89eca0a9efa78bebbdb3ec849e7b5e
UHC 僥뚮젿厭묉젩溜뽳섞}僥뚮젿厭묉젩溜뽳섞{^ 111010001110100110001100111010111010000010110001111001101111010010010001111001101010000010100001111010101111111010010110111011111011110010101111011111011110100011101001100011001110101110100000101100011110011011110100100100011110011010100000101000011110101011111110100101101110111110111100101011110111101101011110 e8e98ceba0b1e6f491e6a0a1eafe96efbcaf7de8e98ceba0b1e6f491e6a0a1eafe96efbcaf7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)