To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 張?咀峰蕪應賂??張?咀峰蕪應賂??B 10010010101000110011111110011001111100001001010111110100100101011001001110011100111001001001100001000111001111110011111110010010101000110011111110011001111100001001010111110100100101011001001110011100111001001001100001000111001111110011111101000010 92a33f99f095f495939ce498473f3f92a33f99f095f495939ce498473f3f42
EUC-JP 張?咀峰蕪應賂??張?咀峰蕪應賂??B 11000100101001010011111111010010111100101100101011110110110010011111001111011000111001101100111110101000001111110011111111000100101001010011111111010010111100101100101011110110110010011111001111011000111001101100111110101000001111110011111101000010 c4a53fd2f2caf6c9f3d8e6cfa83f3fc4a53fd2f2caf6c9f3d8e6cfa83f3f42
UTF-8 張렜咀峰蕪應賂렰렞張렜咀峰蕪應賂렰렞B 11100101101111001011010111101011101000001001110011100101100100101000000011100101101100111011000011101000100101011010101011100110100001111000100111101000101100111000001011101011101000001011000011101011101000001001111011100101101111001011010111101011101000001001110011100101100100101000000011100101101100111011000011101000100101011010101011100110100001111000100111101000101100111000001011101011101000001011000011101011101000001001111001000010 e5bcb5eba09ce59280e5b3b0e895aae68789e8b382eba0b0eba09ee5bcb5eba09ce59280e5b3b0e895aae68789e8b382eba0b0eba09e42
UHC 張렜咀峰蕪應賂렰렞張렜咀峰蕪應賂렰렞B 11101101111001011000111010101110111011101011101011011100111010001101100111110011111010111110101111010110111100011000111010111101100011101010111111101101111001011000111010101110111011101011101011011100111010001101100111110011111010111110101111010110111100011000111010111101100011101010111101000010 ede58eaeeebadce8d9f3ebebd6f18ebd8eafede58eaeeebadce8d9f3ebebd6f18ebd8eaf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)