To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 疫??筌?ぉ??????疫??筌?ぉ??????^ 10001001011101010011111100111111111000101010001100111111100000101010011100111111001111110011111100111111001111110011111110001001011101010011111100111111111000101010001100111111100000101010011100111111001111110011111100111111001111110011111101011110 89753f3fe2a33f82a73f3f3f3f3f3f89753f3fe2a33f82a73f3f3f3f3f3f5e
EUC-JP 疫??筌?ぉ???艅??疫??筌?ぉ???艅??^ 1011000111010110001111110011111111100100101001010011111110100100101010010011111100111111001111111000111111010110111111010011111100111111101100011101011000111111001111111110010010100101001111111010010010101001001111110011111100111111100011111101011011111101001111110011111101011110 b1d63f3fe4a53fa4a93f3f3f8fd6fd3f3fb1d63f3fe4a53fa4a93f3f3f8fd6fd3f3f5e
UTF-8 疫뀀젶筌잙ぉ溜곕젔艅믩젩疫뀀젶筌잙ぉ溜곕젔艅믩젫^ 11100111100101101010101111101011100000001000000011101100101000001011011011100111101011011000110011101100100111101001100111100011100000011000100111101111101001111000101111101010101100111001010111101100101000001001010011101000100010011000010111101011101011111010100111101100101000001010100111100111100101101010101111101011100000001000000011101100101000001011011011100111101011011000110011101100100111101001100111100011100000011000100111101111101001111000101111101010101100111001010111101100101000001001010011101000100010011000010111101011101011111010100111101100101000001010101101011110 e796abeb8080eca0b6e7ad8cec9e99e38189efa78beab395eca094e88985ebafa9eca0a9e796abeb8080eca0b6e7ad8cec9e99e38189efa78beab395eca094e88985ebafa9eca0ab5e
UHC 疫뀀젶筌잙ぉ溜곕젔艅믩젩疫뀀젶筌잙ぉ溜곕젔艅믩젫^ 11100110101110011011001011101011101000001010101011101111101001111001111111101011101010101010100111101010111111101011000011101011101000001001001011100110101010011001001011101011101000001010000111100110101110011011001011101011101000001010101011101111101001111001111111101011101010101010100111101010111111101011000011101011101000001001001011100110101010011001001011101011101000001010001101011110 e6b9b2eba0aaefa79febaaa9eafeb0eba092e6a992eba0a1e6b9b2eba0aaefa79febaaa9eafeb0eba092e6a992eba0a35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)