To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?儀??肯??疑?∧淨?儀??肯??疑?∧^ 100111111100010000111111100010110101011000111111001111111000110101101101001111110011111110001011010111100011111110000001110010001001111111000100001111111000101101010110001111110011111110001101011011010011111100111111100010110101111000111111100000011100100001011110 9fc43f8b563f3f8d6d3f3f8b5e3f81c89fc43f8b563f3f8d6d3f3f8b5e3f81c85e
EUC-JP 淨?儀??肯??疑?∧淨?儀??肯??疑?∧^ 110111101100011000111111101101011011011100111111001111111011100111001110001111110011111110110101101111110011111110100010110010101101111011000110001111111011010110110111001111110011111110111001110011100011111100111111101101011011111100111111101000101100101001011110 dec63fb5b73f3fb9ce3f3fb5bf3fa2cadec63fb5b73f3fb9ce3f3fb5bf3fa2ca5e
UTF-8 淨렠儀븀렚肯렖렕疑얜∧淨렠儀븀렚肯렖렕疑얜∧^ 11100110101101111010100011101011101000001010000011100101100001001000000011101011101110001000000011101011101000001001101011101000100000101010111111101011101000001001011011101011101000001001010111100111100101101001000111101100100101101001110011100010100010001010011111100110101101111010100011101011101000001010000011100101100001001000000011101011101110001000000011101011101000001001101011101000100000101010111111101011101000001001011011101011101000001001010111100111100101101001000111101100100101101001110011100010100010001010011101011110 e6b7a8eba0a0e58480ebb880eba09ae882afeba096eba095e79691ec969ce288a7e6b7a8eba0a0e58480ebb880eba09ae882afeba096eba095e79691ec969ce288a75e
UHC 淨렠儀븀렚肯렖렕疑얜∧淨렠儀븀렚肯렖렕疑얜∧^ 111011111110010010001110101100011110101111110000101110101110011110001110101011011101000011101001100011101010101110001110101010101110101111110111101111101110101110100001111111001110111111100100100011101011000111101011111100001011101011100111100011101010110111010000111010011000111010101011100011101010101011101011111101111011111011101011101000011111110001011110 efe48eb1ebf0bae78eadd0e98eab8eaaebf7beeba1fcefe48eb1ebf0bae78eadd0e98eab8eaaebf7beeba1fc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)