To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 縡?鬱??狡??畯貊?魄縡?鬱??狡??畯貊?白^ 11100011011100010011111110011111010101000011111100111111111000001100001000111111001111111111101101101111111001101011101100111111111010011010111011100011011100010011111110011111010101000011111100111111111000001100001000111111001111111111101101101111111001101011101100111111100101001001001001011110 e3713f9f543f3fe0c23f3ffb6fe6bb3fe9aee3713f9f543f3fe0c23f3ffb6fe6bb3f94925e
EUC-JP 縡?鬱??狡??畯貊?魄縡?鬱??狡??畯貊?白^ 111001011101001000111111110111011011010100111111001111111110000011000100001111110011111110001111110011011011101111101100101111010011111111110010101100001110010111010010001111111101110110110101001111110011111111100000110001000011111100111111100011111100110110111011111011001011110100111111110001111111001001011110 e5d23fddb53f3fe0c43f3f8fcdbbecbd3ff2b0e5d23fddb53f3fe0c43f3f8fcdbbecbd3fc7f25e
UTF-8 縡렕鬱讀렲狡렕렟畯貊렠魄縡렕鬱讀렲狡렕렟畯貊렠白^ 11100111101110001010000111101011101000001001010111101001101011001011000111101111101001011001101011101011101000001011001011100111100010111010000111101011101000001001010111101011101000001001111111100111100101011010111111101000101100101000101011101011101000001010000011101001101011011000010011100111101110001010000111101011101000001001010111101001101011001011000111101111101001011001101011101011101000001011001011100111100010111010000111101011101000001001010111101011101000001001111111100111100101011010111111101000101100101000101011101011101000001010000011100111100110011011110101011110 e7b8a1eba095e9acb1efa59aeba0b2e78ba1eba095eba09fe795afe8b28aeba0a0e9ad84e7b8a1eba095e9acb1efa59aeba0b2e78ba1eba095eba09fe795afe8b28aeba0a0e799bd5e
UHC 縡렕鬱讀렲狡렕렟畯貊렠魄縡렕鬱讀렲狡렕렟畯貊렠白^ 11101110101011011000111010101010111010101010011011010100111001101000111010111111110011101110101010001110101010101000111010110000111100011110000111011000111001111000111010110001110110111101111011101110101011011000111010101010111010101010011011010100111001101000111010111111110011101110101010001110101010101000111010110000111100011110000111011000111001111000111010110001110110111101110001011110 eead8eaaeaa6d4e68ebfceea8eaa8eb0f1e1d8e78eb1dbdeeead8eaaeaa6d4e68ebfceea8eaa8eb0f1e1d8e78eb1dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)