To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????ø? 00111111001111110011111100111111001111110011111100111111001111111111100000111111 3f3f3f3f3f3f3f3ff83f
SJIS-WIN 嚥??要ら?猥??訝 100110101000101100111111001111111001011101110110100000101110011100111111111000001100111000111111001111111110011001100010 9a8b3f3f977682e73fe0ce3f3fe662
EUC-JP 嚥??要ら?猥?ø訝 1101001111101011001111110011111111001101110101111010010011101001001111111110000011010000001111111000111110101001110011001110101111000011 d3eb3f3fcdd7a4e93fe0d03f8fa9ccebc3
UTF-8 嚥듸숲要ら틭猥ㅹø訝 1110010110011010101001011110101110010011101110001110110010001000101100101110100010100110100000011110001110000010100010011110110110001011101011011110011110001100101001011110001110000101101110011100001110111000111010001010100010011101 e59aa5eb93b8ec88b2e8a681e38289ed8bade78ca5e385b9c3b8e8a89d
UHC 嚥듸숲要ら틭猥ㅹø訝 1110011010111111101101011110111110111101101000111110100110101001101010101110100110111010100101111110100011100101101001001110100110101001101010101110010010111000 e6bfb5efbda3e9a9aae9ba97e8e5a4e9a9aae4b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)