To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN ??迂???迂?B 0011111100111111100010010100100100111111001111110011111110001001010010010011111101000010 3f3f89493f3f3f89493f42
EUC-JP 醬?迂?醬?迂?B 100011111110001011110011001111111011000110101010001111111000111111100010111100110011111110110001101010100011111101000010 8fe2f33fb1aa3f8fe2f33fb1aa3f42
UTF-8 醬렓迂렖醬렓迂렖B 11101001100001101010110011101011101000001001001111101000101111111000001011101011101000001001011011101001100001101010110011101011101000001001001111101000101111111000001011101011101000001001011001000010 e986aceba093e8bf82eba096e986aceba093e8bf82eba09642
UHC 醬렓迂렖醬렓迂렖B 1110110111111101100011101010100011101001111001101000111010101011111011011111110110001110101010001110100111100110100011101010101101000010 edfd8ea8e9e68eabedfd8ea8e9e68eab42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)