To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 驗堤「∝ョヲ魄渙シ驗堤「∝ョヲ魄渙コ^ 1110100110000100100100101110011110100010100000011110010110101110101001101110100110101110100111111101000010111100111010011000010010010010111001111010001010000001111001011010111010100110111010011010111010011111110100001011101001011110 e98492e7a281e5aea6e9ae9fd0bce98492e7a281e5aea6e9ae9fd0ba5e
EUC-JP 驗堤「∝ョヲ魄渙シ驗堤「∝ョヲ魄渙コ^ 11110001111001001100010011101001100011101010001010100010111001111000111010101110100011101010011011110010101100001101111011010010100011101011110011110001111001001100010011101001100011101010001010100010111001111000111010101110100011101010011011110010101100001101111011010010100011101011101001011110 f1e4c4e98ea2a2e78eae8ea6f2b0ded28ebcf1e4c4e98ea2a2e78eae8ea6f2b0ded28eba5e
UTF-8 驗堤「∝ョヲ魄渙シ驗堤「∝ョヲ魄渙コ^ 11101001101010011001011111100101101000001010010011101111101111011010001011100010100010001001110111101111101111011010111011101111101111011010011011101001101011011000010011100110101110001001100111101111101111011011110011101001101010011001011111100101101000001010010011101111101111011010001011100010100010001001110111101111101111011010111011101111101111011010011011101001101011011000010011100110101110001001100111101111101111011011101001011110 e9a997e5a0a4efbda2e2889defbdaeefbda6e9ad84e6b899efbdbce9a997e5a0a4efbda2e2889defbdaeefbda6e9ad84e6b899efbdba5e
UHC 驗堤?∝??魄渙?驗堤?∝??魄渙?^ 1111101011010000111100001010011100111111101000011111000000111111001111111101101111011110111111001011100100111111111110101101000011110000101001110011111110100001111100000011111100111111110110111101111011111100101110010011111101011110 fad0f0a73fa1f03f3fdbdefcb93ffad0f0a73fa1f03f3fdbdefcb93f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)