To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 堯??堯??堯??n}堯??堯??堯??n{^ 1110101010011111001111110011111111101010100111110011111100111111111010101001111100111111001111110110111001111101111010101001111100111111001111111110101010011111001111110011111111101010100111110011111100111111011011100111101101011110 ea9f3f3fea9f3f3fea9f3f3f6e7dea9f3f3fea9f3f3fea9f3f3f6e7b5e
EUC-JP 堯??堯??堯??n}堯??堯??堯??n{^ 1111010010100001001111110011111111110100101000010011111100111111111101001010000100111111001111110110111001111101111101001010000100111111001111111111010010100001001111110011111111110100101000010011111100111111011011100111101101011110 f4a13f3ff4a13f3ff4a13f3f6e7df4a13f3ff4a13f3ff4a13f3f6e7b5e
UTF-8 堯붿뼼堯붷젾堯붾퍋n}堯붿뼼堯붷젾堯붾퍋n{^ 1110010110100000101011111110101110110110101111111110101110111100101111001110010110100000101011111110101110110110101101111110110010100000101111101110010110100000101011111110101110110110101111101110110110001101100010110110111001111101111001011010000010101111111010111011011010111111111010111011110010111100111001011010000010101111111010111011011010110111111011001010000010111110111001011010000010101111111010111011011010111110111011011000110110001011011011100111101101011110 e5a0afebb6bfebbcbce5a0afebb6b7eca0bee5a0afebb6beed8d8b6e7de5a0afebb6bfebbcbce5a0afebb6b7eca0bee5a0afebb6beed8d8b6e7b5e
UHC 堯붿뼼堯붷젾堯붾퍋n}堯붿뼼堯붷젾堯붾퍋n{^ 1110100011101011100101001110110010010110101111111110100011101011100101001110010110100000101100001110100011101011100101001110101110111011100000100110111001111101111010001110101110010100111011001001011010111111111010001110101110010100111001011010000010110000111010001110101110010100111010111011101110000010011011100111101101011110 e8eb94ec96bfe8eb94e5a0b0e8eb94ebbb826e7de8eb94ec96bfe8eb94e5a0b0e8eb94ebbb826e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)