To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ’C’Z’C’Z[’C’Z’C’Z[^ 110000101001001001000011110000101001001001011010110000101001001001000011110000101001001001011010010110111100001010010010010000111100001010010010010110101100001010010010010000111100001010010010010110100101101101011110 c29243c2925ac29243c2925a5bc29243c2925ac29243c2925a5b5e
SJIS-WIN ??C??Z??C??Z[??C??Z??C??Z[^ 001111110011111101000011001111110011111101011010001111110011111101000011001111110011111101011010010110110011111100111111010000110011111100111111010110100011111100111111010000110011111100111111010110100101101101011110 3f3f433f3f5a3f3f433f3f5a5b3f3f433f3f5a3f3f433f3f5a5b5e
EUC-JP Â?CÂ?ZÂ?CÂ?Z[Â?CÂ?ZÂ?CÂ?Z[^ 10001111101010101010010000111111010000111000111110101010101001000011111101011010100011111010101010100100001111110100001110001111101010101010010000111111010110100101101110001111101010101010010000111111010000111000111110101010101001000011111101011010100011111010101010100100001111110100001110001111101010101010010000111111010110100101101101011110 8faaa43f438faaa43f5a8faaa43f438faaa43f5a5b8faaa43f438faaa43f5a8faaa43f438faaa43f5a5b5e
UTF-8 ’C’Z’C’Z[’C’Z’C’Z[^ 11000011100000101100001010010010010000111100001110000010110000101001001001011010110000111000001011000010100100100100001111000011100000101100001010010010010110100101101111000011100000101100001010010010010000111100001110000010110000101001001001011010110000111000001011000010100100100100001111000011100000101100001010010010010110100101101101011110 c382c29243c382c2925ac382c29243c382c2925a5bc382c29243c382c2925ac382c29243c382c2925a5b5e
UHC ??C??Z??C??Z[??C??Z??C??Z[^ 001111110011111101000011001111110011111101011010001111110011111101000011001111110011111101011010010110110011111100111111010000110011111100111111010110100011111100111111010000110011111100111111010110100101101101011110 3f3f433f3f5a3f3f433f3f5a5b3f3f433f3f5a3f3f433f3f5a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)