To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 額?????永??淫??鍮?????譽??^ 100010100111101000111111001111110011111100111111001111111000100101101001001111110011111110001000111110100011111100111111111010000100101000111111001111110011111100111111001111111110011010100011001111110011111101011110 8a7a3f3f3f3f3f89693f3f88fa3f3fe84a3f3f3f3f3fe6a33f3f5e
EUC-JP 額?????永??淫??鍮?????譽??^ 101100111101101100111111001111110011111100111111001111111011000111001010001111110011111110110000111111000011111100111111111011111010101100111111001111110011111100111111001111111110110010100101001111110011111101011110 b3db3f3f3f3f3fb1ca3f3fb0fc3f3fefab3f3f3f3f3feca53f3f5e
UTF-8 額ㅻ퉬溜곕젒永귞탢淫욃젮鍮꾨젾溜묊탞譽쏁봽^ 11101001101000011000110111100011100001011011101111101101100010011010110011101111101001111000101111101010101100111001010111101100101000001001001011100110101100001011100011101010101101111001111011101101100000111010001011100110101101111010101111101100100110101000001111101100101000001010111011101001100011011010111011101010101111101010100011101100101000001011111011101111101001111000101111101011101011001000101011101101100000111001111011101000101011011011110111101100100011111000000111101011101101001011110101011110 e9a18de385bbed89acefa78beab395eca092e6b0b8eab79eed83a2e6b7abec9a83eca0aee98daeeabea8eca0beefa78bebac8aed839ee8adbdec8f81ebb4bd5e
UHC 額ㅻ퉬溜곕젒永귞탢淫욃젮鍮꾨젾溜묊탞譽쏁봽^ 11100100111111101010010011101011101110011000010011101010111111101011000011101011101000001001000111100111101101011000001011100111101101011000010111101011111000101001111011100101101000001010010011101011101110011000010011101011101000001011000011101010111111101001000111100111101101011000001011100111111000101001101111100111100101001000010001011110 e4fea4ebb984eafeb0eba091e7b582e7b585ebe29ee5a0a4ebb984eba0b0eafe91e7b582e7e29be794845e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)