To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 丈ヲ叱鉦オ竺宍酌丈ヲ叱鉦オ竺宍灼^ 1000111111100100101001101000111010110110100011111101111010110101100011101011000110001110101100111000111011011110100011111110010010100110100011101011011010001111110111101011010110001110101100011000111010110011100011101101110001011110 8fe4a68eb68fdeb58eb18eb38ede8fe4a68eb68fdeb58eb18eb38edc5e
EUC-JP 丈ヲ叱鉦オ竺宍酌丈ヲ叱鉦オ竺宍灼^ 101111101110011010001110101001101011110010111000101111101110000010001110101101011011110010110011101111001011010110111100111000001011111011100110100011101010011010111100101110001011111011100000100011101011010110111100101100111011110010110101101111001101111001011110 bee68ea6bcb8bee08eb5bcb3bcb5bce0bee68ea6bcb8bee08eb5bcb3bcb5bcde5e
UTF-8 丈ヲ叱鉦オ竺宍酌丈ヲ叱鉦オ竺宍灼^ 11100100101110001000100011101111101111011010011011100101100011111011000111101001100010011010011011101111101111011011010111100111101010111011101011100101101011101000110111101001100001011000110011100100101110001000100011101111101111011010011011100101100011111011000111101001100010011010011011101111101111011011010111100111101010111011101011100101101011101000110111100111100000011011110001011110 e4b888efbda6e58fb1e989a6efbdb5e7abbae5ae8de9858ce4b888efbda6e58fb1e989a6efbdb5e7abbae5ae8de781bc5e
UHC 丈?叱鉦?竺?酌丈?叱鉦?竺?灼^ 111011011101101100111111111100101110101011101111111110100011111111110101111001110011111111101101110011001110110111011011001111111111001011101010111011111111101000111111111101011110011100111111111011011100011101011110 eddb3ff2eaeffa3ff5e73fedcceddb3ff2eaeffa3ff5e73fedc75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)