To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癰?????濡??訝????????猷??^ 111000011001111000111111001111110011111100111111001111111001010001000111001111111000000101001000111001100110001000111111001111110011111100111111001111110011111100111111001111111001011101010001001111110011111101011110 e19e3f3f3f3f3f94473f8148e6623f3f3f3f3f3f3f3f97513f3f5e
EUC-JP 癰??旿??濡??訝????????猷??^ 1110000111111110001111110011111110001111110000011111010000111111001111111100011110101000001111111010000110101001111010111100001100111111001111110011111100111111001111110011111100111111001111111100110110110010001111110011111101011110 e1fe3f3f8fc1f43f3fc7a83fa1a9ebc33f3f3f3f3f3f3f3fcdb23f3f5e
UTF-8 癰잙젿旿껊젍濡됰?訝덅튋溜김쐝溜롫죮猷욤졅^ 11100111100110011011000011101100100111101001100111101100101000001011111111100110100101111011111111101010101110111000101011101100101000001000110111100110101111111010000111101011100100001011000011101111101111001001111111101000101010001001110111101011100011011000010111101101100010101000101111101111101001111000101111101010101110011000000011101100100100001001110111101111101001111000101111101011101000011010101111101100101000111010111011100111100011001011011111101100100110101010010011101100101000011000010101011110 e799b0ec9e99eca0bfe697bfeabb8aeca08de6bfa1eb90b0efbc9fe8a89deb8d85ed8a8befa78beab980ec909defa78beba1abeca3aee78cb7ec9aa4eca1855e
UHC 癰잙젿旿껊젍濡됰?訝덅튋溜김쐝溜롫죮猷욤졅^ 11101000101110011001111111101011101000001011000111100111111110101000001111101011101000001000111011101011101000011000100111101011101000111011111111100100101110001000100011101000101110011001111111101010111111101011000111101000100111001000001111101010111111101000111011101011101000011000100111101011101000111011111111101000101000001011011001011110 e8b99feba0b1e7fa83eba08eeba189eba3bfe4b888e8b99feafeb1e89c83eafe8eeba189eba3bfe8a0b65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)