To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??~?????DB 00111111001111110111111000111111001111110011111100111111001111110100010001000010 3f3f7e3f3f3f3f3f4442
SJIS-WIN テァ~ツ青可環ゥDB 11000011101001110111111011000010100100001100001010001001110000101000101011000010101010010100010001000010 c3a77ec290c289c28ac2a94442
EUC-JP テァ~ツ青可環ゥDB 1000111011000011100011101010011101111110100011101100001011000000110001001011001011000100101101001100010010001110101010010100010001000010 8ec38ea77e8ec2c0c4b2c4b4c48ea94442
UTF-8 テァ~ツ青可環ゥDB 111011111011111010000011111011111011110110100111011111101110111110111110100000101110100110011101100100101110010110001111101011111110011110010010101100001110111110111101101010010100010001000010 efbe83efbda77eefbe82e99d92e58fafe792b0efbda94442
UHC ??~??可環?DB 001111110011111101111110001111110011111111001010101001101111110010111011001111110100010001000010 3f3f7e3f3fcaa6fcbb3f4442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)