To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 庸?????蹂μ??????キ魏??筌??? 10010111011001100011111100111111001111110011111100111111111001101111100010000011110010100011111100111111001111110011111100111111001111111000001101001100111010011011000000111111001111111110001010100011001111110011111100111111 97663f3f3f3f3fe6f883ca3f3f3f3f3f3f834ce9b03f3fe2a33f3f3f
EUC-JP 庸?????蹂μ?濚?Ł??キ魏??筌??? 1100110111000111001111110011111100111111001111110011111111101100111110101010011011001100001111111000111111001001101000010011111110001111101010011010100000111111001111111010010110101101111100101011001000111111001111111110010010100101001111110011111100111111 cdc73f3f3f3f3fecfaa6cc3f8fc9a13f8fa9a83f3fa5adf2b23f3fe4a53f3f3f
UTF-8 庸뉕퇌李뉛쭓蹂μ젫濚밸Ł吏귟キ魏귥첎筌뗣렔泥 11100101101110101011100011101011100010011001010111101101100001111000110011101111101001111010000111101011100010011001101111101100101011011001001111101000101110011000001011001110101111001110110010100000101010111110011010111111100110101110101110110000101110001100010110000001111011111010011110011110111010101011011110011111111000111000001010101101111010011010110110001111111010101011011110100101111011001011001010001110111001111010110110001100111010111001011110100011111010111010000010010100111011111010011110100011 e5bab8eb8995ed878cefa7a1eb899becad93e8b982cebceca0abe6bf9aebb0b8c581efa79eeab79fe382ade9ad8feab7a5ecb28ee7ad8ceb97a3eba094efa7a3
UHC 庸뉕퇌李뉛쭓蹂μ젫濚밸Ł吏귟キ魏귥첎筌뗣렔泥 1110100110111100100001111110101010110111100111011110110010110000100001111110111110100111100010111110101110110011101001011110110010100000101000111110011110111001101110011110101110101000101010011110110010100111100000101110100010101011101011011110101011100000100000101110110010101010100110111110111110100111100010111110001110001110101010011110110010110010 e9bc87eab79decb087efa78bebb3a5eca0a3e7b9b9eba8a9eca782e8abadeae082ecaa9befa78be38ea9ecb2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)