To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?щぉ??????源?К??????晤??^ 001111111000010010001011100000101010011100111111001111110011111100111111001111110011111110001100101110010011111110000100010010110011111100111111001111110011111100111111001111111001110111101011001111110011111101011110 3f848b82a73f3f3f3f3f3f8cb93f844b3f3f3f3f3f3f9deb3f3f5e
EUC-JP ?щぉ??????源?К??????晤??^ 001111111010011111101011101001001010100100111111001111110011111100111111001111110011111110111000101110110011111110100111101011000011111100111111001111110011111100111111001111111101101011101101001111110011111101011110 3fa7eba4a93f3f3f3f3f3fb8bb3fa7ac3f3f3f3f3f3fdaed3f3f5e
UTF-8 寧щぉ溜곕젒女앲쓮源녺К紐⑸젾溜묉옗晤롪퉮^ 1110111110100110101010101101000110001001111000111000000110001001111011111010011110001011111010101011001110010101111011001010000010010010111011111010011010000001111011001001010110110010111011001001001110101110111001101011101010010000111010111000010110111010110100001001101011101111101001111000111111100010100100011011100011101100101000001011111011101111101001111000101111101011101011001000100111101100100110001001011111100110100110011010010011101011101000011010101011101101100010011010111001011110 efa6aad189e38189efa78beab395eca092efa681ec95b2ec93aee6ba90eb85bad09aefa78fe291b8eca0beefa78bebac89ec9897e699a4eba1aaed89ae5e
UHC 寧щぉ溜곕젒女앲쓮源녺К紐⑸젾溜묉옗晤롪퉮^ 11100111101011001010110011101011101010101010100111101010111111101011000011101011101000001001000111100101111111001001110111101000100111011000111011101010101110011000011011100111101011001010110011101011101010101010100111101011101000001011000011101010111111101001000111100110100111101001110111100111111110111000111011101010101110011000011001011110 e7acacebaaa9eafeb0eba091e5fc9de89d8eeab986e7acacebaaa9eba0b0eafe91e69e9de7fb8eeab9865e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)