To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??誼??釉??筌??應??邑??茹??誼?B 1110001010100011001111110011111110001011011000100011111100111111111001111101011000111111001111111110001010100011001111110011111110011100111001000011111100111111100101110101011100111111001111111110010010100101001111110011111110001011011000100011111101000010 e2a33f3f8b623f3fe7d63f3fe2a33f3f9ce43f3f97573f3fe4a53f3f8b623f42
EUC-JP 筌??誼??釉??筌??應??邑??茹??誼?B 1110010010100101001111110011111110110101110000110011111100111111111011101101100000111111001111111110010010100101001111110011111111011000111001100011111100111111110011011011100000111111001111111110100010100111001111110011111110110101110000110011111101000010 e4a53f3fb5c33f3feed83f3fe4a53f3fd8e63f3fcdb83f3fe8a73f3fb5c33f42
UTF-8 筌뗭궠誼쏉쭓釉띿젴筌뗫툝應믭쭓邑뀁죰茹띻퍓誼튛B 11100111101011011000110011101011100101111010110111101010101101101010000011101000101010101011110011101100100011111000100111101100101011011001001111101001100001111000100111101011100111011011111111101100101000001011010011100111101011011000110011101011100101111010101111101101100010001001110111100110100001111000100111101011101011111010110111101100101011011001001111101001100000101001000111101011100000001000000111101100101000111011000011101000100011001011100111101011100111011011101111101101100011011001001111101000101010101011110011101101100010101001101101000010 e7ad8ceb97adeab6a0e8aabcec8f89ecad93e98789eb9dbfeca0b4e7ad8ceb97abed889de68789ebafadecad93e98291eb8081eca3b0e88cb9eb9dbbed8d93e8aabced8a9b42
UHC 筌뗭궠誼쏉쭓釉띿젴筌뗫툝應믭쭓邑뀁죰茹띻퍓誼튛B 1110111110100111100010111110110010000010101100111110101111111110100110111110111110100111100010111110101110111000100011011110110010100000101010001110111110100111100010111110101110111000100101001110101111101011100100101110111110100111100010111110101111101001101100101110110010100001100010111110011010101010100011011110101010111011100010101110101111111110101110100100110001000010 efa78bec82b3ebfe9befa78bebb88deca0a8efa78bebb894ebeb92efa78bebe9b2eca18be6aa8deabb8aebfeba4c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)