To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????GB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4742
SJIS-WIN テッツァツオテォツ・ツづャツ可サGB 1100001110101111110000101010011111000010101101011100001110101011110000101010010111000010100000101100001110101100110000101000100111000010101110110100011101000010 c3afc2a7c2b5c3abc2a5c282c3acc289c2bb4742
EUC-JP テッツァツオテォツ・ツづャツ可サGB 10001110110000111000111010101111100011101100001010001110101001111000111011000010100011101011010110001110110000111000111010101011100011101100001010001110101001011000111011000010101001001100010110001110101011001000111011000010101100101100010010001110101110110100011101000010 8ec38eaf8ec28ea78ec28eb58ec38eab8ec28ea58ec2a4c58eac8ec2b2c48ebb4742
UTF-8 テッツァツオテォツ・ツづャツ可サGB 1110111110111110100000111110111110111101101011111110111110111110100000101110111110111101101001111110111110111110100000101110111110111101101101011110111110111110100000111110111110111101101010111110111110111110100000101110111110111101101001011110111110111110100000101110001110000001101001011110111110111101101011001110111110111110100000101110010110001111101011111110111110111101101110110100011101000010 efbe83efbdafefbe82efbda7efbe82efbdb5efbe83efbdabefbe82efbda5efbe82e381a5efbdacefbe82e58fafefbdbb4742
UHC ???????????づ??可?GB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111101010101100010100111111001111111100101010100110001111110100011101000010 3f3f3f3f3f3f3f3f3f3f3faac53f3fcaa63f4742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)