To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????a?????N?????a?????N^ 00111111001111110011111100111111001111110110000100111111001111110011111100111111001111110100111000111111001111110011111100111111001111110110000100111111001111110011111100111111001111110100111001011110 3f3f3f3f3f613f3f3f3f3f4e3f3f3f3f3f613f3f3f3f3f4e5e
SJIS-WIN 賊論┗窪?a賊論┗窪?N賊論┗窪?a賊論┗窪?N^ 1001000110101111100110000101111110000100101011111000110001000101001111110110000110010001101011111001100001011111100001001010111110001100010001010011111101001110100100011010111110011000010111111000010010101111100011000100010100111111011000011001000110101111100110000101111110000100101011111000110001000101001111110100111001011110 91af985f84af8c453f6191af985f84af8c453f4e91af985f84af8c453f6191af985f84af8c453f4e5e
EUC-JP 賊論┗窪?a賊論┗窪?N賊論┗窪?a賊論┗窪?N^ 1100001010110001110011111100000010101000101100011011011110100110001111110110000111000010101100011100111111000000101010001011000110110111101001100011111101001110110000101011000111001111110000001010100010110001101101111010011000111111011000011100001010110001110011111100000010101000101100011011011110100110001111110100111001011110 c2b1cfc0a8b1b7a63f61c2b1cfc0a8b1b7a63f4ec2b1cfc0a8b1b7a63f61c2b1cfc0a8b1b7a63f4e5e
UTF-8 賊論┗窪렜a賊論┗窪렜N賊論┗窪렜a賊論┗窪렜N^ 1110100010110011100010101110100010101011100101101110001010010100100101111110011110101010101010101110101110100000100111000110000111101000101100111000101011101000101010111001011011100010100101001001011111100111101010101010101011101011101000001001110001001110111010001011001110001010111010001010101110010110111000101001010010010111111001111010101010101010111010111010000010011100011000011110100010110011100010101110100010101011100101101110001010010100100101111110011110101010101010101110101110100000100111000100111001011110 e8b38ae8ab96e29497e7aaaaeba09c61e8b38ae8ab96e29497e7aaaaeba09c4ee8b38ae8ab96e29497e7aaaaeba09c61e8b38ae8ab96e29497e7aaaaeba09c4e5e
UHC 賊論┗窪렜a賊論┗窪렜N賊論┗窪렜a賊論┗窪렜N^ 111011101110010011010110111001011010011010110001111010001100000110001110101011100110000111101110111001001101011011100101101001101011000111101000110000011000111010101110010011101110111011100100110101101110010110100110101100011110100011000001100011101010111001100001111011101110010011010110111001011010011010110001111010001100000110001110101011100100111001011110 eee4d6e5a6b1e8c18eae61eee4d6e5a6b1e8c18eae4eeee4d6e5a6b1e8c18eae61eee4d6e5a6b1e8c18eae4e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)