To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 縡?鬱??狡??趙貊??縡?鬱??狡??趙貊??B 1110001101110001001111111001111101010100001111110011111111100000110000100011111100111111111001101110001011100110101110110011111100111111111000110111000100111111100111110101010000111111001111111110000011000010001111110011111111100110111000101110011010111011001111110011111101000010 e3713f9f543f3fe0c23f3fe6e2e6bb3f3fe3713f9f543f3fe0c23f3fe6e2e6bb3f3f42
EUC-JP 縡?鬱??狡??趙貊??縡?鬱??狡??趙貊??B 1110010111010010001111111101110110110101001111110011111111100000110001000011111100111111111011001110010011101100101111010011111100111111111001011101001000111111110111011011010100111111001111111110000011000100001111110011111111101100111001001110110010111101001111110011111101000010 e5d23fddb53f3fe0c43f3fece4ecbd3f3fe5d23fddb53f3fe0c43f3fece4ecbd3f3f42
UTF-8 縡렕鬱讀렲狡렕렟趙貊렟끝縡렕鬱讀렲狡렕렟趙貊렟끝B 11100111101110001010000111101011101000001001010111101001101011001011000111101111101001011001101011101011101000001011001011100111100010111010000111101011101000001001010111101011101000001001111111101000101101101001100111101000101100101000101011101011101000001001111111101011100000011001110111100111101110001010000111101011101000001001010111101001101011001011000111101111101001011001101011101011101000001011001011100111100010111010000111101011101000001001010111101011101000001001111111101000101101101001100111101000101100101000101011101011101000001001111111101011100000011001110101000010 e7b8a1eba095e9acb1efa59aeba0b2e78ba1eba095eba09fe8b699e8b28aeba09feb819de7b8a1eba095e9acb1efa59aeba0b2e78ba1eba095eba09fe8b699e8b28aeba09feb819d42
UHC 縡렕鬱讀렲狡렕렟趙貊렟끝縡렕鬱讀렲狡렕렟趙貊렟끝B 11101110101011011000111010101010111010101010011011010100111001101000111010111111110011101110101010001110101010101000111010110000111100001110000111011000111001111000111010110000101100111010000111101110101011011000111010101010111010101010011011010100111001101000111010111111110011101110101010001110101010101000111010110000111100001110000111011000111001111000111010110000101100111010000101000010 eead8eaaeaa6d4e68ebfceea8eaa8eb0f0e1d8e78eb0b3a1eead8eaaeaa6d4e68ebfceea8eaa8eb0f0e1d8e78eb0b3a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)