To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 永??宥??淫??n}永??宥??淫??n{^ 1000100101101001001111110011111110010111010001110011111100111111100010001111101000111111001111110110111001111101100010010110100100111111001111111001011101000111001111110011111110001000111110100011111100111111011011100111101101011110 89693f3f97473f3f88fa3f3f6e7d89693f3f97473f3f88fa3f3f6e7b5e
EUC-JP 永??宥??淫??n}永??宥??淫??n{^ 1011000111001010001111110011111111001101101010000011111100111111101100001111110000111111001111110110111001111101101100011100101000111111001111111100110110101000001111110011111110110000111111000011111100111111011011100111101101011110 b1ca3f3fcda83f3fb0fc3f3f6e7db1ca3f3fcda83f3fb0fc3f3f6e7b5e
UTF-8 永띕슞宥곻쭇淫볧뫝n}永띕슞宥곻쭇淫볧뫝n{^ 1110011010110000101110001110101110011101100101011110110010001010100111101110010110101110101001011110101010110011101110111110110010101101100001111110011010110111101010111110101110110011101001111110101110101011100111010110111001111101111001101011000010111000111010111001110110010101111011001000101010011110111001011010111010100101111010101011001110111011111011001010110110000111111001101011011110101011111010111011001110100111111010111010101110011101011011100111101101011110 e6b0b8eb9d95ec8a9ee5aea5eab3bbecad87e6b7abebb3a7ebab9d6e7de6b0b8eb9d95ec8a9ee5aea5eab3bbecad87e6b7abebb3a7ebab9d6e7b5e
UHC 永띕슞宥곻쭇淫볧뫝n}永띕슞宥곻쭇淫볧뫝n{^ 1110011110110101101101101110101110011010101010101110101011101001100000011110111110100111100000111110101111100010100100111110110110010001101111010110111001111101111001111011010110110110111010111001101010101010111010101110100110000001111011111010011110000011111010111110001010010011111011011001000110111101011011100111101101011110 e7b5b6eb9aaaeae981efa783ebe293ed91bd6e7de7b5b6eb9aaaeae981efa783ebe293ed91bd6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)