To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 賊兢??賊兢??B 10010001101011111001100101011101001111110011111110010001101011111001100101011101001111110011111101000010 91af995d3f3f91af995d3f3f42
EUC-JP 賊兢??賊兢??B 11000010101100011101000110111110001111110011111111000010101100011101000110111110001111110011111101000010 c2b1d1be3f3fc2b1d1be3f3f42
UTF-8 賊兢렍테賊兢렍테B 11101000101100111000101011100101100001011010001011101011101000001000110111101101100001011000110011101000101100111000101011100101100001011010001011101011101000001000110111101101100001011000110001000010 e8b38ae585a2eba08ded858ce8b38ae585a2eba08ded858c42
UHC 賊兢렍테賊兢렍테B 1110111011100100110100001110011110001110101000111100010111010111111011101110010011010000111001111000111010100011110001011101011101000010 eee4d0e78ea3c5d7eee4d0e78ea3c5d742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)