To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???援??揄??孃る?而g?揄??? 0011111100111111001111111000100110000111001111110011111110011101100010010011111100111111100110110110111110000010111010010011111110001110101001111000001010000111001111111001110110001001001111110011111100111111 3f3f3f89873f3f9d893f3f9b6f82e93f8ea782873f9d893f3f3f
EUC-JP ???援??揄??孃る?而g?揄??孼 00111111001111110011111110110001111001110011111100111111110110011110100100111111001111111101010111010000101001001110101100111111101111001010100110100011111001110011111111011001111010010011111100111111100011111011101011000011 3f3f3fb1e73f3fd9e93f3fd5d0a4eb3fbca9a3e73fd9e93f3f8fbac3
UTF-8 嶺뚮뱪援㎪룚揄멥룋孃る슣而g춯揄먭덴孼 111011111010011010101011111010111001101010101110111010111011000110101010111001101000111110110100111000111000111010101010111010111010001110011010111001101000111110000100111010111010100110100101111010111010001110001011111001011010110110000011111000111000001010001011111011001000101010100011111010001000000010001100111011111011110110000111111011001011011010101111111001101000111110000100111010111010100010101101111010111000110110110100111001011010110110111100 efa6abeb9aaeebb1aae68fb4e38eaaeba39ae68f84eba9a5eba38be5ad83e3828bec8aa3e8808cefbd87ecb6afe68f84eba8adeb8db4e5adbc
UHC 嶺뚮뱪援㎪룚揄멥룋孃る슣而g춯揄먭덴孼 1110011110101101100011001110101110010011100100001110101010110101101001111110011010001111100101101110101011110001101110001110001110001111100010101110010110111110101010101110101110011010101011111110110010111011101000111110011110101101100011001110101011110001100100001110101010110101101001111110010111101101 e7ad8ceb9390eab5a7e68f96eaf1b8e38f8ae5beaaeb9aafecbba3e7ad8ceaf190eab5a7e5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)