To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??Wznf??Wzn^}Y??Wznf??Wzn^}bE 0011111100111111010101110111101001101110011001100011111100111111010101110111101001101110010111100111110101011001001111110011111101010111011110100110111001100110001111110011111101010111011110100110111001011110011111010110001001000101 3f3f577a6e663f3f577a6e5e7d593f3f577a6e663f3f577a6e5e7d6245
SJIS-WIN 癌先Wznf癌先Wzn^}Y癌先Wznf癌先Wzn^}bE 10001010111000001001000011100110010101110111101001101110011001101000101011100000100100001110011001010111011110100110111001011110011111010101100110001010111000001001000011100110010101110111101001101110011001101000101011100000100100001110011001010111011110100110111001011110011111010110001001000101 8ae090e6577a6e668ae090e6577a6e5e7d598ae090e6577a6e668ae090e6577a6e5e7d6245
EUC-JP 癌先Wznf癌先Wzn^}Y癌先Wznf癌先Wzn^}bE 10110100111000101100000011101000010101110111101001101110011001101011010011100010110000001110100001010111011110100110111001011110011111010101100110110100111000101100000011101000010101110111101001101110011001101011010011100010110000001110100001010111011110100110111001011110011111010110001001000101 b4e2c0e8577a6e66b4e2c0e8577a6e5e7d59b4e2c0e8577a6e66b4e2c0e8577a6e5e7d6245
UTF-8 癌先Wznf癌先Wzn^}Y癌先Wznf癌先Wzn^}bE 111001111001100110001100111001011000010110001000010101110111101001101110011001101110011110011001100011001110010110000101100010000101011101111010011011100101111001111101010110011110011110011001100011001110010110000101100010000101011101111010011011100110011011100111100110011000110011100101100001011000100001010111011110100110111001011110011111010110001001000101 e7998ce58588577a6e66e7998ce58588577a6e5e7d59e7998ce58588577a6e66e7998ce58588577a6e5e7d6245
UHC 癌先Wznf癌先Wzn^}Y癌先Wznf癌先Wzn^}bE 11100100110111111110000010111011010101110111101001101110011001101110010011011111111000001011101101010111011110100110111001011110011111010101100111100100110111111110000010111011010101110111101001101110011001101110010011011111111000001011101101010111011110100110111001011110011111010110001001000101 e4dfe0bb577a6e66e4dfe0bb577a6e5e7d59e4dfe0bb577a6e66e4dfe0bb577a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)