To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厭ャ?魏←?應??厭ャ?魏←?應??^ 1000100101111101100000111000001100111111111010011011000010000001101010010011111110011100111001000011111100111111100010010111110110000011100000110011111111101001101100001000000110101001001111111001110011100100001111110011111101011110 897d83833fe9b081a93f9ce43f3f897d83833fe9b081a93f9ce43f3f5e
EUC-JP 厭ャ?魏←?應??厭ャ?魏←?應??^ 1011000111011110101001011110001100111111111100101011001010100010101010110011111111011000111001100011111100111111101100011101111010100101111000110011111111110010101100101010001010101011001111111101100011100110001111110011111101011110 b1dea5e33ff2b2a2ab3fd8e63f3fb1dea5e33ff2b2a2ab3fd8e63f3f5e
UTF-8 厭ャ꺀魏←뭐應쇳뜡厭ャ꺀魏←뭐應쇳뜡^ 11100101100011101010110111100011100000111010001111101010101110101000000011101001101011011000111111100010100001101001000011101011101011011001000011100110100001111000100111101100100001111011001111101011100111001010000111100101100011101010110111100011100000111010001111101010101110101000000011101001101011011000111111100010100001101001000011101011101011011001000011100110100001111000100111101100100001111011001111101011100111001010000101011110 e58eade383a3eaba80e9ad8fe28690ebad90e68789ec87b3eb9ca1e58eade383a3eaba80e9ad8fe28690ebad90e68789ec87b3eb9ca15e
UHC 厭ャ꺀魏←뭐應쇳뜡厭ャ꺀魏←뭐應쇳뜡^ 11100110111101001010101111100011100000111010100111101010111000001010000111100111101110011011100111101011111010111011110011101101100011011010010011100110111101001010101111100011100000111010100111101010111000001010000111100111101110011011100111101011111010111011110011101101100011011010010001011110 e6f4abe383a9eae0a1e7b9b9ebebbced8da4e6f4abe383a9eae0a1e7b9b9ebebbced8da45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)