To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 永????ぉ???疫??永????ぉ???疫??B 10001001011010010011111100111111001111110011111110000010101001110011111100111111001111111000100101110101001111110011111110001001011010010011111100111111001111110011111110000010101001110011111100111111001111111000100101110101001111110011111101000010 89693f3f3f3f82a73f3f3f89753f3f89693f3f3f3f82a73f3f3f89753f3f42
EUC-JP 永????ぉ???疫??永????ぉ???疫??B 10110001110010100011111100111111001111110011111110100100101010010011111100111111001111111011000111010110001111110011111110110001110010100011111100111111001111110011111110100100101010010011111100111111001111111011000111010110001111110011111101000010 b1ca3f3f3f3fa4a93f3f3fb1d63f3fb1ca3f3f3f3fa4a93f3f3fb1d63f3f42
UTF-8 永귣쓧溜볥ぉ溜곕젍疫뽯젌永귣쓧溜볥ぉ溜곕젍疫뽯젌B 11100110101100001011100011101010101101111010001111101100100100111010011111101111101001111000101111101011101100111010010111100011100000011000100111101111101001111000101111101010101100111001010111101100101000001000110111100111100101101010101111101011101111011010111111101100101000001000110011100110101100001011100011101010101101111010001111101100100100111010011111101111101001111000101111101011101100111010010111100011100000011000100111101111101001111000101111101010101100111001010111101100101000001000110111100111100101101010101111101011101111011010111111101100101000001000110001000010 e6b0b8eab7a3ec93a7efa78bebb3a5e38189efa78beab395eca08de796abebbdafeca08ce6b0b8eab7a3ec93a7efa78bebb3a5e38189efa78beab395eca08de796abebbdafeca08c42
UHC 永귣쓧溜볥ぉ溜곕젍疫뽯젌永귣쓧溜볥ぉ溜곕젍疫뽯젌B 11100111101101011000001011101011100111011000100011101010111111101001001111101011101010101010100111101010111111101011000011101011101000001000111011100110101110011001011011101011101000001000110111100111101101011000001011101011100111011000100011101010111111101001001111101011101010101010100111101010111111101011000011101011101000001000111011100110101110011001011011101011101000001000110101000010 e7b582eb9d88eafe93ebaaa9eafeb0eba08ee6b996eba08de7b582eb9d88eafe93ebaaa9eafeb0eba08ee6b996eba08d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)