To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 紙?新?ぎ頃?拔ぎ拒紙?新?ぎ頃?拔ぎ居^ 1000111010000110001111111001000001010110001111111000001010101100100011011010000000111111100111010101010110000010101011001000101110010001100011101000011000111111100100000101011000111111100000101010110010001101101000000011111110011101010101011000001010101100100010111000111101011110 8e863f90563f82ac8da03f9d5582ac8b918e863f90563f82ac8da03f9d5582ac8b8f5e
EUC-JP 紙?新?ぎ頃?拔ぎ拒紙?新?ぎ頃?拔ぎ居^ 1011101111100110001111111011111110110111001111111010010010101110101110101010001000111111110110011011011010100100101011101011010111110001101110111110011000111111101111111011011100111111101001001010111010111010101000100011111111011001101101101010010010101110101101011110111101011110 bbe63fbfb73fa4aebaa23fd9b6a4aeb5f1bbe63fbfb73fa4aebaa23fd9b6a4aeb5ef5e
UTF-8 紙렧新저ぎ頃렑拔ぎ拒紙렧新저ぎ頃렑拔ぎ居^ 11100111101101001001100111101011101000001010011111100110100101101011000011101100101000001000000011100011100000011000111011101001101000001000001111101011101000001001000111100110100010111001010011100011100000011000111011100110100010111001001011100111101101001001100111101011101000001010011111100110100101101011000011101100101000001000000011100011100000011000111011101001101000001000001111101011101000001001000111100110100010111001010011100011100000011000111011100101101100011000010101011110 e7b499eba0a7e696b0eca080e3818ee9a083eba091e68b94e3818ee68b92e7b499eba0a7e696b0eca080e3818ee9a083eba091e68b94e3818ee5b1855e
UHC 紙렧新저ぎ頃렑拔ぎ拒紙렧新저ぎ頃렑拔ぎ居^ 1111001010110101100011101011011011100011111001101100000011111010101010101010111011001100111100011000111010100110110110101111101110101010101011101100101111011110111100101011010110001110101101101110001111100110110000001111101010101010101011101100110011110001100011101010011011011010111110111010101010101110110010111101110001011110 f2b58eb6e3e6c0faaaaeccf18ea6dafbaaaecbdef2b58eb6e3e6c0faaaaeccf18ea6dafbaaaecbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)