To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨???儀???虞?寃?淨???儀???虞?寃?^ 100111111100010000111111001111110011111110001011010101100011111100111111001111111000101111110001001111111001101110000011001111111001111111000100001111110011111100111111100010110101011000111111001111110011111110001011111100010011111110011011100000110011111101011110 9fc43f3f3f8b563f3f3f8bf13f9b833f9fc43f3f3f8b563f3f3f8bf13f9b833f5e
EUC-JP 淨???儀???虞?寃?淨???儀???虞?寃?^ 110111101100011000111111001111110011111110110101101101110011111100111111001111111011011011110011001111111101010111100011001111111101111011000110001111110011111100111111101101011011011100111111001111110011111110110110111100110011111111010101111000110011111101011110 dec63f3f3fb5b73f3f3fb6f33fd5e33fdec63f3f3fb5b73f3f3fb6f33fd5e33f5e
UTF-8 淨렠欌렪儀븀렏렕虞렧寃넸淨렠欌렪儀븀렏렕虞렧寃넵^ 11100110101101111010100011101011101000001010000011100110101011001000110011101011101000001010101011100101100001001000000011101011101110001000000011101011101000001000111111101011101000001001010111101000100110011001111011101011101000001010011111100101101011111000001111101011100001001011100011100110101101111010100011101011101000001010000011100110101011001000110011101011101000001010101011100101100001001000000011101011101110001000000011101011101000001000111111101011101000001001010111101000100110011001111011101011101000001010011111100101101011111000001111101011100001001011010101011110 e6b7a8eba0a0e6ac8ceba0aae58480ebb880eba08feba095e8999eeba0a7e5af83eb84b8e6b7a8eba0a0e6ac8ceba0aae58480ebb880eba08feba095e8999eeba0a7e5af83eb84b55e
UHC 淨렠欌렪儀븀렏렕虞렧寃넸淨렠欌렪儀븀렏렕虞렧寃넵^ 11101111111001001000111010110001111011011110101110001110101110001110101111110000101110101110011110001110101001011000111010101010111010011110010110001110101101101110101010110010101100111101111011101111111001001000111010110001111011011110101110001110101110001110101111110000101110101110011110001110101001011000111010101010111010011110010110001110101101101110101010110010101100111101110001011110 efe48eb1edeb8eb8ebf0bae78ea58eaae9e58eb6eab2b3deefe48eb1edeb8eb8ebf0bae78ea58eaae9e58eb6eab2b3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)