To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 將?紆?虞?弔????將???衣???寃?? 1001101110010010001111111110001011111100001111111000101111110001001111111001001010100010001111110011111100111111001111111001101110010010001111110011111100111111100010001101111100111111001111110011111110011011100000110011111100111111 9b923fe2fc3f8bf13f92a23f3f3f3f9b923f3f3f88df3f3f3f9b833f3f
EUC-JP 將?紆?虞?弔?勖??將???衣???寃?? 11010101111100100011111111100100111111100011111110110110111100110011111111000100101001000011111110001111101100111110110100111111001111111101010111110010001111110011111100111111101100001110000100111111001111110011111111010101111000110011111100111111 d5f23fe4fe3fb6f33fc4a43f8fb3ed3f3fd5f23f3f3fb0e13f3f3fd5e33f3f
UTF-8 將렚紆렣虞렧弔렲勖쾌욱將렚罹렗衣쯔렱렲寃당렟 111001011011000010000111111010111010000010011010111001111011010010000110111010111010000010100011111010001001100110011110111010111010000010100111111001011011110010010100111010111010000010110010111001011000101110010110111011001011111010001100111011001001101010110001111001011011000010000111111010111010000010011010111011111010011110100110111010111010000010010111111010001010000110100011111011001010111110010100111010111010000010110001111010111010000010110010111001011010111110000011111010111000101110111001111010111010000010011111 e5b087eba09ae7b486eba0a3e8999eeba0a7e5bc94eba0b2e58b96ecbe8cec9ab1e5b087eba09aefa7a6eba097e8a1a3ecaf94eba0b1eba0b2e5af83eb8bb9eba09f
UHC 將렚紆렣虞렧弔렲勖쾌욱將렚罹렗衣쯔렱렲寃당렟 1110110111100010100011101010110111101001111000011000111010110100111010011110010110001110101101101111000011000000100011101011111111101001111011011100010011101000101111111110110111101101111000101000111010101101111011001011101010001110101011001110101111111101110000101110101010001110101111101000111010111111111010101011001010110100111001111000111010110000 ede28eade9e18eb4e9e58eb6f0c08ebfe9edc4e8bfedede28eadecba8eacebfdc2ea8ebe8ebfeab2b4e78eb0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)