To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????r[?????????r[^ 0011111100111111001111110011111100111111001111110011111100111111001111110111001001011011001111110011111100111111001111110011111100111111001111110011111100111111011100100101101101011110 3f3f3f3f3f3f3f3f3f725b3f3f3f3f3f3f3f3f3f725b5e
SJIS-WIN 茹?????烏??r[茹?????烏??r[^ 111001001010010100111111001111110011111100111111001111111000100101000111001111110011111101110010010110111110010010100101001111110011111100111111001111110011111110001001010001110011111100111111011100100101101101011110 e4a53f3f3f3f3f89473f3f725be4a53f3f3f3f3f89473f3f725b5e
EUC-JP 茹?????烏??r[茹?????烏??r[^ 111010001010011100111111001111110011111100111111001111111011000110101000001111110011111101110010010110111110100010100111001111110011111100111111001111110011111110110001101010000011111100111111011100100101101101011110 e8a73f3f3f3f3fb1a83f3f725be8a73f3f3f3f3fb1a83f3f725b5e
UTF-8 茹됰젪歷쎈젣烏녿젍r[茹됰젪歷쎈젣烏녿젍r[^ 1110100010001100101110011110101110010000101100001110110010100000101010101110111110100110100011001110110010001110100010001110110010100000101000111110011110000011100011111110101110000101101111111110110010100000100011010111001001011011111010001000110010111001111010111001000010110000111011001010000010101010111011111010011010001100111011001000111010001000111011001010000010100011111001111000001110001111111010111000010110111111111011001010000010001101011100100101101101011110 e88cb9eb90b0eca0aaefa68cec8e88eca0a3e7838feb85bfeca08d725be88cb9eb90b0eca0aaefa68cec8e88eca0a3e7838feb85bfeca08d725b5e
UHC 茹됰젪歷쎈젣烏녿젍r[茹됰젪歷쎈젣烏녿젍r[^ 1110011010101010100010011110101110100000101000101110011010111000101111011110101110100000100111001110100010100001100001101110101110100000100011100111001001011011111001101010101010001001111010111010000010100010111001101011100010111101111010111010000010011100111010001010000110000110111010111010000010001110011100100101101101011110 e6aa89eba0a2e6b8bdeba09ce8a186eba08e725be6aa89eba0a2e6b8bdeba09ce8a186eba08e725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)