To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??猷⑤???????濡レ??λ?苡 0011111100111111001111111110001010000110001111110011111110010111010100011000011101000100001111110011111100111111001111110011111100111111001111111001010001000111100000111000110000111111001111111000001111001001001111111110010010001111 3f3f3fe2863f3f975187443f3f3f3f3f3f3f9447838c3f3f83c93fe48f
EUC-JP ???竊??猷??孼?????濡レ??λ?苡 001111110011111100111111111000111110011000111111001111111100110110110010001111110011111110001111101110101100001100111111001111110011111100111111001111111100011110101000101001011110110000111111001111111010011011001011001111111110011111101111 3f3f3fe3e63f3fcdb23f3f8fbac33f3f3f3f3fc7a8a5ec3f3fa6cb3fe7ef
UTF-8 捻뀁뮆竊섇츦猷⑤럞孼꾩뮆吏뤻뼸濡レ굻嶪λ뎾苡 1110111110100110101001001110101110000000100000011110101110101110100001101110011110101011100010101110110010000100100001111110110010111000101001101110011110001100101101111110001010010001101001001110101110011111100111101110010110101101101111001110101010111110101010011110101110101110100001101110111110100111100111101110101110100100101110111110101110111100101110001110011010111111101000011110001110000011101011001110101010110101101110111110010110110110101010101100111010111011111010111000111010111110111010001000101110100001 efa6a4eb8081ebae86e7ab8aec8487ecb8a6e78cb7e291a4eb9f9ee5adbceabea9ebae86efa79eeba4bbebbcb8e6bfa1e383aceab5bbe5b6aacebbeb8ebee88ba1
UHC 捻뀁뮆竊섇츦猷⑤럞孼꾩뮆吏뤻뼸濡レ굻嶪λ뎾苡 1110011011110111101100101110110010010010100101011110111110111100100110001110010110101110100111001110101110100011101010001110101110001110100000011110010111101101100001001110110010010010100101011110110010100111100011111110100110010110101110111110101110100001101010111110110010110001101111111110010111110101101001011110101110001001100100011110110010111110 e6f7b2ec9295efbc98e5ae9ceba3a8eb8e81e5ed84ec9295eca78fe996bbeba1abecb1bfe5f5a5eb8991ecbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)