To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 臆??釗?臆??釗?E 100010011011000000111111001111111111101110111011001111111000100110110000001111110011111111111011101110110011111101000101 89b03f3ffbbb3f89b03f3ffbbb3f45
EUC-JP 臆??釗?臆??釗?E 1011001010110010001111110011111110001111111000111010011000111111101100101011001000111111001111111000111111100011101001100011111101000101 b2b23f3f8fe3a63fb2b23f3f8fe3a63f45
UTF-8 臆믭쫼釗셖臆믭쫼釗셚E 11101000100001111000011011101011101011111010110111101100101010111011110011101001100001111001011111101100100001011001011011101000100001111000011011101011101011111010110111101100101010111011110011101001100001111001011111101100100001011001101001000101 e88786ebafadecabbce98797ec8596e88786ebafadecabbce98797ec859a45
UHC 臆믭쫼釗셖臆믭쫼釗셚E 111001011110011010010010111011111010011010010011111000011111001010011001010110011110010111100110100100101110111110100110100100111110000111110010100110010110001001000101 e5e692efa693e1f29959e5e692efa693e1f2996245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)