To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 外??穩??映ч?節??穩??潁 1000101001001111001111110011111111100010011100100011111100111111100010010110011010000100100010010011111110010000110111110011111100111111111000100111001000111111001111111001111111110001 8a4f3f3fe2723f3f896684893f90df3f3fe2723f3f9ff1
EUC-JP 外??穩??映ч?節??穩??潁 1011001110110000001111110011111111100011110100110011111100111111101100011100011110100111111010010011111111000000111000010011111100111111111000111101001100111111001111111101111011110011 b3b03f3fe3d33f3fb1c7a7e93fc0e13f3fe3d33f3fdef3
UTF-8 外믭숲穩뚳슈映ч냽節억숲穩뚳슈潁 1110010110100100100101101110101110101111101011011110110010001000101100101110011110101001101010011110101110011010101100111110110010001010100010001110011010011000101000001101000110000111111010111000001110111101111001111010111110000000111011001001011010110101111011001000100010110010111001111010100110101001111010111001101010110011111011001000101010001000111001101011110110000001 e5a496ebafadec88b2e7a9a9eb9ab3ec8a88e698a0d187eb83bde7af80ec96b5ec88b2e7a9a9eb9ab3ec8a88e6bd81
UHC 外믭숲穩뚳슈映ч냽節억숲穩뚳슈潁 1110100011100010100100101110111110111101101000111110100010110001100011001110111110111101101101001110011110110001101011001110100110000110100011011110111110111101101111101110111110111101101000111110100010110001100011001110111110111101101101001110011110111000 e8e292efbda3e8b18cefbdb4e7b1ace9868defbdbeefbda3e8b18cefbdb4e7b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)