To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 蹄??嶝?????麥???嶝??煜??B 10010010111110110011111100111111100110111101000100111111001111110011111100111111001111111110101001101101001111110011111100111111100110111101000100111111001111111111101101010101001111110011111101000010 92fb3f3f9bd13f3f3f3f3fea6d3f3f3f9bd13f3ffb553f3f42
EUC-JP 蹄??嶝??栯??麥???嶝??煜??B 11000100111111010011111100111111110101101101001100111111001111111000111111000011110100100011111100111111111100111100111000111111001111110011111111010110110100110011111100111111100011111100100111111100001111110011111101000010 c4fd3f3fd6d33f3f8fc3d23f3ff3ce3f3f3fd6d33f3f8fc9fc3f3f42
UTF-8 蹄ㅶ렯嶝렰렱栯숄렓麥렡ㅶ렯嶝렰렱煜얕썽B 11101000101110011000010011100011100001011011011011101011101000001010111111100101101101101001110111101011101000001011000011101011101000001011000111100110101000001010111111101100100010001000010011101011101000001001001111101001101110101010010111101011101000001010000111100011100001011011011011101011101000001010111111100101101101101001110111101011101000001011000011101011101000001011000111100111100001011001110011101100100101101001010111101100100011011011110101000010 e8b984e385b6eba0afe5b69deba0b0eba0b1e6a0afec8884eba093e9baa5eba0a1e385b6eba0afe5b69deba0b0eba0b1e7859cec9695ec8dbd42
UHC 蹄ㅶ렯嶝렰렱栯숄렓麥렡ㅶ렯嶝렰렱煜얕썽B 111100001011010010100100111001101000111010111100110101001111000110001110101111011000111010111110111010011111000110111100111100011000111010101000110110001110101010001110101100101010010011100110100011101011110011010100111100011000111010111101100011101011111011101001111100101011111011101000101111011110100101000010 f0b4a4e68ebcd4f18ebd8ebee9f1bcf18ea8d8ea8eb2a4e68ebcd4f18ebd8ebee9f2bee8bde942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)