To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 症?寃??嶝??云?窺寃??嶝??云肌 10001111110001110011111110011011100000110011111100111111100110111101000100111111001111111000100101011101001111111000100101001101100110111000001100111111001111111001101111010001001111110011111110001001010111011001010010100111 8fc73f9b833f3f9bd13f3f895d3f894d9b833f3f9bd13f3f895d94a7
EUC-JP 症?寃??嶝??云?窺寃??嶝??云肌 10111110110010010011111111010101111000110011111100111111110101101101001100111111001111111011000110111110001111111011000110101110110101011110001100111111001111111101011011010011001111110011111110110001101111101100100010101001 bec93fd5e33f3fd6d33f3fb1be3fb1aed5e33f3fd6d33f3fb1bec8a9
UTF-8 症렓寃꿩렯嶝렰렯云陋窺寃꿩렯嶝렰렯云肌 111001111001011110000111111010111010000010010011111001011010111110000011111010101011111110101001111010111010000010101111111001011011011010011101111010111010000010110000111010111010000010101111111001001011101010010001111011111010010110010001111001111010101010111010111001011010111110000011111010101011111110101001111010111010000010101111111001011011011010011101111010111010000010110000111010111010000010101111111001001011101010010001111010001000001010001100 e79787eba093e5af83eabfa9eba0afe5b69deba0b0eba0afe4ba91efa591e7aabae5af83eabfa9eba0afe5b69deba0b0eba0afe4ba91e8828c
UHC 症렓寃꿩렯嶝렰렯云陋窺寃꿩렯嶝렰렯云肌 1111000111111000100011101010100011101010101100101011001011100110100011101011110011010100111100011000111010111101100011101011110011101001111101101101001011101011110100001010101011101010101100101011001011100110100011101011110011010100111100011000111010111101100011101011110011101001111101101101000110111111 f1f88ea8eab2b2e68ebcd4f18ebd8ebce9f6d2ebd0aaeab2b2e68ebcd4f18ebd8ebce9f6d1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)