To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 筌????┘???筌??筌????┘???筌??^ 11100010101000110011111100111111001111110011111110000100101000110011111100111111001111111110001010100011001111110011111111100010101000110011111100111111001111110011111110000100101000110011111100111111001111111110001010100011001111110011111101011110 e2a33f3f3f3f84a33f3f3fe2a33f3fe2a33f3f3f3f84a33f3f3fe2a33f3f5e
EUC-JP 筌????┘洹??筌??筌????┘洹??筌??^ 1110010010100101001111110011111100111111001111111010100010100101100011111100011110111010001111110011111111100100101001010011111100111111111001001010010100111111001111110011111100111111101010001010010110001111110001111011101000111111001111111110010010100101001111110011111101011110 e4a53f3f3f3fa8a58fc7ba3f3fe4a53f3fe4a53f3f3f3fa8a58fc7ba3f3fe4a53f3f5e
UTF-8 筌뚭여罹믭┘洹잙펳筌뗫푺筌뚭여罹믭┘洹잙펳筌뗫푺^ 11100111101011011000110011101011100110101010110111101100100101111010110011101111101001111010011011101011101011111010110111100010100101001001100011100110101101001011100111101100100111101001100111101101100011101011001111100111101011011000110011101011100101111010101111101101100100011011101011100111101011011000110011101011100110101010110111101100100101111010110011101111101001111010011011101011101011111010110111100010100101001001100011100110101101001011100111101100100111101001100111101101100011101011001111100111101011011000110011101011100101111010101111101101100100011011101001011110 e7ad8ceb9aadec97acefa7a6ebafade29498e6b4b9ec9e99ed8eb3e7ad8ceb97abed91bae7ad8ceb9aadec97acefa7a6ebafade29498e6b4b9ec9e99ed8eb3e7ad8ceb97abed91ba5e
UHC 筌뚭여罹믭┘洹잙펳筌뗫푺筌뚭여罹믭┘洹잙펳筌뗫푺^ 11101111101001111000110011101010101111111010100111101100101110101001001011101111101001101010010111101010101101111001111111101011101111001000010111101111101001111000101111101011101111101000011011101111101001111000110011101010101111111010100111101100101110101001001011101111101001101010010111101010101101111001111111101011101111001000010111101111101001111000101111101011101111101000011001011110 efa78ceabfa9ecba92efa6a5eab79febbc85efa78bebbe86efa78ceabfa9ecba92efa6a5eab79febbc85efa78bebbe865e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)