To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN テァツコツ湘ュツ什テ」ツョツづ、ツ慎テ「ツィ 1100001110100111110000101011101011000010100011111100001110101101110000101000111101011001110000111010001111000010101011101100001010000010110000111010010011000010100100000101010011000011101000101100001010101000 c3a7c2bac28fc3adc28f59c3a3c2aec282c3a4c29054c3a2c2a8
EUC-JP テァツコツ湘ュツ什テ」ツョツづ、ツ慎テ「ツィ 1000111011000011100011101010011110001110110000101000111010111010100011101100001010111110110001011000111010101101100011101100001010111101101110101000111011000011100011101010001110001110110000101000111010101110100011101100001010100100110001011000111010100100100011101100001010111111101101011000111011000011100011101010001010001110110000101000111010101000 8ec38ea78ec28eba8ec2bec58ead8ec2bdba8ec38ea38ec28eae8ec2a4c58ea48ec2bfb58ec38ea28ec28ea8
UTF-8 テァツコツ湘ュツ什テ」ツョツづ、ツ慎テ「ツィ 111011111011111010000011111011111011110110100111111011111011111010000010111011111011110110111010111011111011111010000010111001101011100110011000111011111011110110101101111011111011111010000010111001001011101110000000111011111011111010000011111011111011110110100011111011111011111010000010111011111011110110101110111011111011111010000010111000111000000110100101111011111011110110100100111011111011111010000010111001101000010110001110111011111011111010000011111011111011110110100010111011111011111010000010111011111011110110101000 efbe83efbda7efbe82efbdbaefbe82e6b998efbdadefbe82e4bb80efbe83efbda3efbe82efbdaeefbe82e381a5efbda4efbe82e6858eefbe83efbda2efbe82efbda8
UHC ?????湘??什?????づ??????? 00111111001111110011111100111111001111111101111111001111001111110011111111100100101001110011111100111111001111110011111100111111101010101100010100111111001111110011111100111111001111110011111100111111 3f3f3f3f3fdfcf3f3fe4a73f3f3f3f3faac53f3f3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)