To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ??????占θ?[??????占θ?[^ 00111111001111110011111100111111001111110011111110010000111010001000001111000110001111110101101100111111001111110011111100111111001111110011111110010000111010001000001111000110001111110101101101011110 3f3f3f3f3f3f90e883c63f5b3f3f3f3f3f3f90e883c63f5b5e
EUC-JP ??Ŧ???占θ?[??Ŧ???占θ?[^ 0011111100111111100011111010100110101111001111110011111100111111110000001110101010100110110010000011111101011011001111110011111110001111101010011010111100111111001111110011111111000000111010101010011011001000001111110101101101011110 3f3f8fa9af3f3f3fc0eaa6c83f5b3f3f8fa9af3f3f3fc0eaa6c83f5b5e
UTF-8 嶺든Ŧ玲믧깶占θ뮁[嶺든Ŧ玲믧깶占θ뮁[^ 1110111110100110101010111110101110010011101000001100010110100110111011111010011010101101111010111010111110100111111010101011100110110110111001011000110110100000110011101011100011101011101011101000000101011011111011111010011010101011111010111001001110100000110001011010011011101111101001101010110111101011101011111010011111101010101110011011011011100101100011011010000011001110101110001110101110101110100000010101101101011110 efa6abeb93a0c5a6efa6adebafa7eab9b6e58da0ceb8ebae815befa6abeb93a0c5a6efa6adebafa7eab9b6e58da0ceb8ebae815b5e
UHC 嶺든Ŧ玲믧깶占θ뮁[嶺든Ŧ玲믧깶占θ뮁[^ 111001111010110110110101111001111010100010101110111001111011111110010010111010011000001110100100111011111011111110100101111010001001001010010000010110111110011110101101101101011110011110101000101011101110011110111111100100101110100110000011101001001110111110111111101001011110100010010010100100000101101101011110 e7adb5e7a8aee7bf92e983a4efbfa5e892905be7adb5e7a8aee7bf92e983a4efbfa5e892905b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)