To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??e????e??B 0011111100111111011001010011111100111111001111110011111101100101001111110011111101000010 3f3f653f3f3f3f653f3f42
SJIS-WIN テ。eツ テ。eツ B 11000011101000010110010111000010100000010100000011000011101000010110010111000010100000010100000001000010 c3a165c28140c3a165c2814042
EUC-JP テ。eツ テ。eツ B 10001110110000111000111010100001011001011000111011000010101000011010000110001110110000111000111010100001011001011000111011000010101000011010000101000010 8ec38ea1658ec2a1a18ec38ea1658ec2a1a142
UTF-8 テ。eツ テ。eツ B 111011111011111010000011111011111011110110100001011001011110111110111110100000101110001110000000100000001110111110111110100000111110111110111101101000010110010111101111101111101000001011100011100000001000000001000010 efbe83efbda165efbe82e38080efbe83efbda165efbe82e3808042
UHC ??e? ??e? B 00111111001111110110010100111111101000011010000100111111001111110110010100111111101000011010000101000010 3f3f653fa1a13f3f653fa1a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)