To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 瀋壤?瀋壤?^ 1110000001100011100110101101111100111111111000000110001110011010110111110011111101011110 e0639adf3fe0639adf3f5e
EUC-JP 瀋壤?瀋壤?^ 1101111111000100110101001110000100111111110111111100010011010100111000010011111101011110 dfc4d4e13fdfc4d4e13f5e
UTF-8 瀋壤걂瀋壤걂^ 11100111100000001000101111100101101000111010010011101010101100011000001011100111100000001000101111100101101000111010010011101010101100011000001001011110 e7808be5a3a4eab182e7808be5a3a4eab1825e
UHC 瀋壤걂瀋壤걂^ 11100100101000111110010110111101100000010110101011100100101000111110010110111101100000010110101001011110 e4a3e5bd816ae4a3e5bd816a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)