To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 甕??擁??輿??B 11100001010100000011111100111111100101110110100100111111001111111001011101100000001111110011111101000010 e1503f3f97693f3f97603f3f42
EUC-JP 甕??擁??輿??B 11100001101100010011111100111111110011011100101000111111001111111100110111000001001111110011111101000010 e1b13f3fcdca3f3fcdc13f3f42
UTF-8 甕됪쳛擁녘툒輿곮떨B 11100111100101001001010111101011100100001010101011101100101100111001101111100110100100111000000111101011100001011001100011101101100010001001001011101000101111001011111111101010101100111010111011101011100101101010100001000010 e79495eb90aaecb39be69381eb8598ed8892e8bcbfeab3aeeb96a842
UHC 甕됪쳛擁녘툒輿곮떨B 11101000101110001000100111100110101010111000000111101000101101101011001111101000101110001000100111100110101010111000000111101000101101101011001101000010 e8b889e6ab81e8b6b3e8b889e6ab81e8b6b342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)