To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??宜?ぜ受??語ι?醫??獄??爾ο? 10001000101000110011111100111111100010110101100000111111100000101011101010001110111100110011111100111111100011001110101010000011110001110011111111100111110011100011111100111111100011011001011000111111001111111000111010100010100000111100110100111111 88a33f3f8b583f82ba8ef33f3f8cea83c73fe7ce3f3f8d963f3f8ea283cd3f
EUC-JP 哀??宜?ぜ受??語ι?醫??獄??爾ο? 10110000101001010011111100111111101101011011100100111111101001001011110010111100111101010011111100111111101110001110110010100110110010010011111111101110110100000011111100111111101110011111011000111111001111111011110010100100101001101100111100111111 b0a53f3fb5b93fa4bcbcf53f3fb8eca6c93feed03f3fb9f63f3fbca4a6cf3f
UTF-8 哀노맧宜배ぜ受쇰㎙語ι춳醫묆렊獄쏄퀡爾ο쬇 11100101100100111000000011101011100001011011100011101011101001111010011111100101101011101001110011101011101100001011000011100011100000011001110011100101100011111001011111101100100001111011000011100011100011101001100111101000101010101001111011001110101110011110110010110110101100111110100110000110101010111110101110101100100001101110101110100000100010101110011110001101100001001110110010001111100001001110110110000000101000011110011110001000101111101100111010111111111011001010110010000111 e59380eb85b8eba7a7e5ae9cebb0b0e3819ce58f97ec87b0e38e99e8aa9eceb9ecb6b3e986abebac86eba08ae78d84ec8f84ed80a1e788becebfecac87
UHC 哀노맧宜배ぜ受쇰㎙語ι춳醫묆렊獄쏄퀡爾ο쬇 111001001110111010110011111010111001000010110000111010111111000110111001111010001010101010111100111000011111010010111100111010111010011110101011111001011101111010100101111010011010110110001111111011001010001010010001111000111000111010100001111010001010101110011011111010101011001110010101111011001011001110100101111011111010011010011110 e4eeb3eb90b0ebf1b9e8aabce1f4bceba7abe5dea5e9ad8feca291e38ea1e8ab9beab395ecb3a5efa69e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)