To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厭レ?有??飮??掩??裕??音??掩 100010010111110110000011100011000011111110010111010011000011111100111111100111110101101000111111001111111000100110000110001111110011111110010111010101000011111100111111100010011011100100111111001111111000100110000110 897d838c3f974c3f3f9f5a3f3f89863f3f97543f3f89b93f3f8986
EUC-JP 厭レ?有??飮??掩??裕??音??掩 101100011101111010100101111011000011111111001101101011010011111100111111110111011011101100111111001111111011000111100110001111110011111111001101101101010011111100111111101100101011101100111111001111111011000111100110 b1dea5ec3fcdad3f3fddbb3f3fb1e63f3fcdb53f3fb2bb3f3fb1e6
UTF-8 厭レ슌有며땸飮곷츍掩뽯뎾裕덌쫿音쀫츍掩 111001011000111010101101111000111000001110101100111011001000101010001100111001101001110010001001111010111010100110110000111010111001010110111000111010011010001110101110111010101011001110110111111011001011100010001101111001101000111010101001111010111011110110101111111010111000111010111110111010001010001110010101111010111000110110001100111011001010101110111111111010011001111110110011111011001000000010101011111011001011100010001101111001101000111010101001 e58eade383acec8a8ce69c89eba9b0eb95b8e9a3aeeab3b7ecb88de68ea9ebbdafeb8ebee8a395eb8d8cecabbfe99fb3ec80abecb88de68ea9
UHC 厭レ슌有며땸飮곷츍掩뽯뎾裕덌쫿音쀫츍掩 1110011011110100101010111110110010011010100111001110101011110011101110001110011110001011100011101110101111100110100000011110101110101110100010001110010111110011100101101110101110001001100100011110101110101110100010001110111110100110100101101110101111100101100101111110101110101110100010001110010111110011 e6f4abec9a9ceaf3b8e78b8eebe681ebae88e5f396eb8991ebae88efa696ebe597ebae88e5f3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)