To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??揖??惟??悟??唯?ゥ怨??永 111000111010000000111111001111111001011101001011001111110011111110001000110100100011111100111111100011001110010100111111001111111001011101000010001111111000001101000100100010011000010100111111001111111000100101101001 e3a03f3f974b3f3f88d23f3f8ce53f3f97423f834489853f3f8969
EUC-JP 罌??揖??惟??悟??唯?ゥ怨??永 111001101010001000111111001111111100110110101100001111110011111110110000110101000011111100111111101110001110011100111111001111111100110110100011001111111010010110100101101100011110010100111111001111111011000111001010 e6a23f3fcdac3f3fb0d43f3fb8e73f3fcda33fa5a5b1e53f3fb1ca
UTF-8 罌삳냲揖욕컜惟듭뒳悟듽굨唯곲ゥ怨꿸굉永 111001111011110110001100111011001000001010110011111010111000001110110010111001101000111110010110111011001001101010010101111011001011101110011100111001101000001110011111111010111001001110101101111010111001001010110011111001101000001010011111111010111001001110111101111010101011010110101000111001011001010010101111111010101011001110110010111000111000001010100101111001101000000010101000111010101011111110111000111010101011010110001001111001101011000010111000 e7bd8cec82b3eb83b2e68f96ec9a95ecbb9ce6839feb93adeb92b3e6829feb93bdeab5a8e594afeab3b2e382a5e680a8eabfb8eab589e6b0b8
UHC 罌삳냲揖욕컜惟듭뒳悟듽굨唯곲ゥ怨꿸굉永 1110010110100010101110111110101110000110100000101110101111100111101111111110010110110000100001111110101011101110101101011110110010001010101011001110011111110110100010101110001110000010100011101110101011100110100000011110100110101011101001011110101010110011101100101110101010110001101100101110011110110101 e5a2bbeb8682ebe7bfe5b087eaeeb5ec8aace7f68ae3828eeae681e9aba5eab3b2eab1b2e7b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)