To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 塋ゅ????耶??塋ゅ?泳?B 100110101100100010000010111000110011111100111111001111110011111110010110111010110011111100111111100110101100100010000010111000110011111110001001011010100011111101000010 9ac882e33f3f3f3f96eb3f3f9ac882e33f896a3f42
EUC-JP 塋ゅ?孼??耶??塋ゅ?泳?B 1101010011001010101001001110010100111111100011111011101011000011001111110011111111001100111011010011111100111111110101001100101010100100111001010011111110110001110010110011111101000010 d4caa4e53f8fbac33f3fcced3f3fd4caa4e53fb1cb3f42
UTF-8 塋ゅ콪孼껇꽦耶섉릍塋ゅ끀泳숤B 11100101101000011000101111100011100000101000010111101100101111011010101011100101101011011011110011101010101110111000011111101010101111011010011011101000100000001011011011101100100001001000100111101011101001101000110111100101101000011000101111100011100000101000010111101011100000011000000011100110101100111011001111101100100010001010010001000010 e5a18be38285ecbdaae5adbceabb87eabda6e880b6ec8489eba68de5a18be38285eb8180e6b3b3ec88a442
UHC 塋ゅ콪孼껇꽦耶섉릍塋ゅ끀泳숤B 1110011110101011101010101110010110110001100111101110010111101101100000111110100010000100101100011110010110101101100110001110011010111000101011001110011110101011101010101110010110000101101101101110011110110110100110100100000101000010 e7abaae5b19ee5ed83e884b1e5ad98e6b8ace7abaae585b6e7b69a4142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)