To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蠢?迂?齬耕??義???迂?齬耕??懿 11100101101111110011111110001001010010010011111111101010100101111000110101101011001111110011111110001011011000000011111100111111001111111000100101001001001111111110101010010111100011010110101100111111001111111001110011110010 e5bf3f89493fea978d6b3f3f8b603f3f3f89493fea978d6b3f3f9cf2
EUC-JP 蠢?迂?齬耕??義???迂?齬耕??懿 11101010110000010011111110110001101010100011111111110011111101111011100111001100001111110011111110110101110000010011111100111111001111111011000110101010001111111111001111110111101110011100110000111111001111111101100011110100 eac13fb1aa3ff3f7b9cc3f3fb5c13f3f3fb1aa3ff3f7b9cc3f3fd8f4
UTF-8 蠢렎迂렧齬耕렫렒義뀜렰렭迂렧齬耕렫렒懿 111010001010000010100010111010111010000010001110111010001011111110000010111010111010000010100111111010011011110110101100111010001000000010010101111010111010000010101011111010111010000010010010111001111011111010101001111010111000000010011100111010111010000010110000111010111010000010101101111010001011111110000010111010111010000010100111111010011011110110101100111010001000000010010101111010111010000010101011111010111010000010010010111001101000011110111111 e8a0a2eba08ee8bf82eba0a7e9bdace88095eba0abeba092e7bea9eb809ceba0b0eba0ade8bf82eba0a7e9bdace88095eba0abeba092e687bf
UHC 蠢렎迂렧齬耕렫렒義뀜렰렭迂렧齬耕렫렒懿 1111000111100011100011101010010011101001111001101000111010110110111001011110000111001100111010011000111010111001100011101010011111101011111110011011001011110001100011101011110110001110101110101110100111100110100011101011011011100101111000011100110011101001100011101011100110001110101001111110101111110011 f1e38ea4e9e68eb6e5e1cce98eb98ea7ebf9b2f18ebd8ebae9e68eb6e5e1cce98eb98ea7ebf3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)