To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 曜??俉??急? 1001011101101010001111110011111111111010011000010011111100111111100010110111110100111111 976a3f3ffa613f3f8b7d3f
EUC-JP 曜??俉??急? 110011011100101100111111001111111000111110110001101110110011111100111111101101011101111000111111 cdcb3f3f8fb1bb3f3fb5de3f
UTF-8 曜섓쉠俉득셼急樂 111001101001101110011100111011001000010010010011111011001000100110100000111001001011111110001001111010111001001110011101111011001000010110111100111001101000000010100101111011111010011010111111 e69b9cec8493ec89a0e4bf89eb939dec85bce680a5efa6bf
UHC 曜섓쉠俉득셼急樂 11101000111110001001100011101111101111011010101011100111111010111011010111100110100110011000000111010000111000011110100011111001 e8f898efbdaae7ebb5e69981d0e1e8f9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)