To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?弔???伊豆?矜???紆?寃???垣? 1001111111000100001111111001001010100010001111110011111100111111100010001100100110010011101001000011111111100001111000000011111100111111001111111110001011111100001111111001101110000011001111110011111100111111100010100101111100111111 9fc43f92a23f3f3f88c993a43fe1e03f3f3fe2fc3f9b833f3f3f8a5f3f
EUC-JP 淨?弔???伊豆?矜???紆?寃???垣? 1101111011000110001111111100010010100100001111110011111100111111101100001100101111000110101001100011111111100010111000100011111100111111001111111110010011111110001111111101010111100011001111110011111100111111101100111100000000111111 dec63fc4a43f3f3fb0cbc6a63fe2e23f3f3fe4fe3fd5e33f3f3fb3c03f
UTF-8 淨렠弔렟罹렗伊豆렚矜썬欌렪紆렣寃닿렱렟垣렖 111001101011011110101000111010111010000010100000111001011011110010010100111010111010000010011111111011111010011110100110111010111010000010010111111001001011110010001010111010001011000110000110111010111010000010011010111001111001111110011100111011001000110110101100111001101010110010001100111010111010000010101010111001111011010010000110111010111010000010100011111001011010111110000011111010111000101110111111111010111010000010110001111010111010000010011111111001011001111010100011111010111010000010010110 e6b7a8eba0a0e5bc94eba09fefa7a6eba097e4bc8ae8b186eba09ae79f9cec8dace6ac8ceba0aae7b486eba0a3e5af83eb8bbfeba0b1eba09fe59ea3eba096
UHC 淨렠弔렟罹렗伊豆렚矜썬欌렪紆렣寃닿렱렟垣렖 111011111110010010001110101100011111000011000000100011101011000011101100101110101000111010101100111011001010010111010100111001111000111010101101110100001110100010111101111000111110110111101011100011101011100011101001111000011000111010110100111010101011001010110100111010101000111010111110100011101011000011101010101011111000111010101011 efe48eb1f0c08eb0ecba8eaceca5d4e78eadd0e8bde3edeb8eb8e9e18eb4eab2b4ea8ebe8eb0eaaf8eab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)