To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???靭?㎡?≪?齬??唯??惟?㎜? 0011111100111111001111111001000001111000001111111000011101110101001111111000000111100001001111111110101010010111001111110011111110010111010000100011111100111111100010001101001000111111100001110110111100111111 3f3f3f90783f87753f81e13fea973f3f97423f3f88d23f876f3f
EUC-JP ???靭??洹≪?齬??唯??惟??繇 00111111001111110011111110111111110110010011111100111111100011111100011110111010101000101110001100111111111100111111011100111111001111111100110110100011001111110011111110110000110101000011111100111111100011111101010011010001 3f3f3fbfd93f3f8fc7baa2e33ff3f73f3fcda33f3fb0d43f3f8fd4d1
UTF-8 麗몃쓷靭뚳㎡洹≪뒉齬잆굥唯뽬죲惟곕㎜繇 111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001101010110011111000111000111010100001111001101011010010111001111000101000100110101010111010111001001010001001111010011011110110101100111011001001111010000110111010101011010110100101111001011001010010101111111010111011110110101100111011001010001110110010111001101000001110011111111010101011001110010101111000111000111010011100111001111011100110000111 efa688ebaa83ec93b7e99dadeb9ab3e38ea1e6b4b9e289aaeb9289e9bdacec9e86eab5a5e594afebbdaceca3b2e6839feab395e38e9ce7b987
UHC 麗몃쓷靭뚳㎡洹≪뒉齬잆굥唯뽬죲惟곕㎜繇 1110011010110000101110001110101110011101100101001110110011100101100011001110111110100111101100111110101010110111101000011110110010001010100001101110010111100001100111111110001110000010100010111110101011100110100101101110100010100001100011011110101011101110101100001110101110100111101011101110100110100011 e6b0b8eb9d94ece58cefa7b3eab7a1ec8a86e5e19fe3828beae696e8a18deaeeb0eba7aee9a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)