To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?弔???伊豆?矜???紆?寃??怨峯? 100111111100010000111111100100101010001000111111001111110011111110001000110010011001001110100100001111111110000111100000001111110011111100111111111000101111110000111111100110111000001100111111001111111000100110000101100101011111010100111111 9fc43f92a23f3f3f88c993a43fe1e03f3f3fe2fc3f9b833f3f898595f53f
EUC-JP 淨?弔???伊豆?矜???紆?寃??怨峯? 110111101100011000111111110001001010010000111111001111110011111110110000110010111100011010100110001111111110001011100010001111110011111100111111111001001111111000111111110101011110001100111111001111111011000111100101110010101111011100111111 dec63fc4a43f3f3fb0cbc6a63fe2e23f3f3fe4fe3fd5e33f3fb1e5caf73f
UTF-8 淨렠弔렟罹렗伊豆렚矜썬欌렪紆렣寃닿㉢怨峯렚 111001101011011110101000111010111010000010100000111001011011110010010100111010111010000010011111111011111010011110100110111010111010000010010111111001001011110010001010111010001011000110000110111010111010000010011010111001111001111110011100111011001000110110101100111001101010110010001100111010111010000010101010111001111011010010000110111010111010000010100011111001011010111110000011111010111000101110111111111000111000100110100010111001101000000010101000111001011011001110101111111010111010000010011010 e6b7a8eba0a0e5bc94eba09fefa7a6eba097e4bc8ae8b186eba09ae79f9cec8dace6ac8ceba0aae7b486eba0a3e5af83eb8bbfe389a2e680a8e5b3afeba09a
UHC 淨렠弔렟罹렗伊豆렚矜썬欌렪紆렣寃닿㉢怨峯렚 111011111110010010001110101100011111000011000000100011101011000011101100101110101000111010101100111011001010010111010100111001111000111010101101110100001110100010111101111000111110110111101011100011101011100011101001111000011000111010110100111010101011001010110100111010101010100010110011111010101011001111011100111001111000111010101101 efe48eb1f0c08eb0ecba8eaceca5d4e78eadd0e8bde3edeb8eb8e9e18eb4eab2b4eaa8b3eab3dce78ead

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)