To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???徇????徇?^ 00111111001111110011111110011100011011010011111100111111001111110011111110011100011011010011111101011110 3f3f3f9c6d3f3f3f3f9c6d3f5e
EUC-JP ???徇????徇?^ 00111111001111110011111111010111110011100011111100111111001111110011111111010111110011100011111101011110 3f3f3fd7ce3f3f3f3fd7ce3f5e
UTF-8 曆뤻숱徇륧曆뤻숱徇륧^ 11101111101001101000101111101011101001001011101111101100100010001011000111100101101111101000011111101011101001011010011111101111101001101000101111101011101001001011101111101100100010001011000111100101101111101000011111101011101001011010011101011110 efa68beba4bbec88b1e5be87eba5a7efa68beba4bbec88b1e5be87eba5a75e
UHC 曆뤻숱徇륧曆뤻숱徇륧^ 111001101011011110001111111010011011110110100010111000101101111110010000010011001110011010110111100011111110100110111101101000101110001011011111100100000100110001011110 e6b78fe9bda2e2df904ce6b78fe9bda2e2df904c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)