To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 止?憶伯止?憶伯B 100011100111111000111111100010011010111110010100100011001000111001111110001111111000100110101111100101001000110001000010 8e7e3f89af948c8e7e3f89af948c42
EUC-JP 止?憶伯止?憶伯B 101110111101111100111111101100101011000111000111111011001011101111011111001111111011001010110001110001111110110001000010 bbdf3fb2b1c7ecbbdf3fb2b1c7ec42
UTF-8 止렮憶伯止렮憶伯B 11100110101011011010001011101011101000001010111011100110100001101011011011100100101111001010111111100110101011011010001011101011101000001010111011100110100001101011011011100100101111001010111101000010 e6ada2eba0aee686b6e4bcafe6ada2eba0aee686b6e4bcaf42
UHC 止렮憶伯止렮憶伯B 1111001010101101100011101011101111100101111000111101101111010111111100101010110110001110101110111110010111100011110110111101011101000010 f2ad8ebbe5e3dbd7f2ad8ebbe5e3dbd742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)