To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 曜??耀????? 1001011101101010001111110011111110010111011100110011111100111111001111110011111100111111 976a3f3f97733f3f3f3f3f
EUC-JP 曜??耀????? 1100110111001011001111110011111111001101110101000011111100111111001111110011111100111111 cdcb3f3fcdd43f3f3f3f3f
UTF-8 曜쏅젳耀붾쓬溜믩젍 111001101001101110011100111011001000111110000101111011001010000010110011111010001000000010000000111010111011011010111110111011001001001110101100111011111010011110001011111010111010111110101001111011001010000010001101 e69b9cec8f85eca0b3e88080ebb6beec93acefa78bebafa9eca08d
UHC 曜쏅젳耀붾쓬溜믩젍 111010001111100010011011111010111010000010100111111010011010010110010100111010111001110110001100111010101111111010010010111010111010000010001110 e8f89beba0a7e9a594eb9d8ceafe92eba08e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)