To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 將?虞?鷹?????頭?虞?鷹?????淡 10011011100100100011111110001011111100010011111110010001111010010011111100111111001111110011111100111111100100111010101000111111100010111111000100111111100100011110100100111111001111110011111100111111001111111001001001010111 9b923f8bf13f91e93f3f3f3f3f93aa3f8bf13f91e93f3f3f3f3f9257
EUC-JP 將?虞?鷹??勖??頭?虞?鷹??勖??淡 1101010111110010001111111011011011110011001111111100001011101011001111110011111110001111101100111110110100111111001111111100011010101100001111111011011011110011001111111100001011101011001111110011111110001111101100111110110100111111001111111100001110111000 d5f23fb6f33fc2eb3f3f8fb3ed3f3fc6ac3fb6f33fc2eb3f3f8fb3ed3f3fc3b8
UTF-8 將렚虞렧鷹꿴ㄿ勖쾡렧頭ㄿ虞렧鷹꿴ㄿ勖쾡렧淡 111001011011000010000111111010111010000010011010111010001001100110011110111010111010000010100111111010011011011110111001111010101011111110110100111000111000010010111111111001011000101110010110111011001011111010100001111010111010000010100111111010011010000010101101111000111000010010111111111010001001100110011110111010111010000010100111111010011011011110111001111010101011111110110100111000111000010010111111111001011000101110010110111011001011111010100001111010111010000010100111111001101011011110100001 e5b087eba09ae8999eeba0a7e9b7b9eabfb4e384bfe58b96ecbea1eba0a7e9a0ade384bfe8999eeba0a7e9b7b9eabfb4e384bfe58b96ecbea1eba0a7e6b7a1
UHC 將렚虞렧鷹꿴ㄿ勖쾡렧頭ㄿ虞렧鷹꿴ㄿ勖쾡렧淡 111011011110001010001110101011011110100111100101100011101011011011101011111011011011001011101001101001001010111111101001111011011100010011101001100011101011011011010100111010011010010010101111111010011110010110001110101101101110101111101101101100101110100110100100101011111110100111101101110001001110100110001110101101101101001110111111 ede28eade9e58eb6ebedb2e9a4afe9edc4e98eb6d4e9a4afe9e58eb6ebedb2e9a4afe9edc4e98eb6d3bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)