To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 吾ч?永??潁??n}吾ч?永??潁??n{^ 10001100111000011000010010001001001111111000100101101001001111110011111110011111111100010011111100111111011011100111110110001100111000011000010010001001001111111000100101101001001111110011111110011111111100010011111100111111011011100111101101011110 8ce184893f89693f3f9ff13f3f6e7d8ce184893f89693f3f9ff13f3f6e7b5e
EUC-JP 吾ч?永??潁??n}吾ч?永??潁??n{^ 10111000111000111010011111101001001111111011000111001010001111110011111111011110111100110011111100111111011011100111110110111000111000111010011111101001001111111011000111001010001111110011111111011110111100110011111100111111011011100111101101011110 b8e3a7e93fb1ca3f3fdef33f3f6e7db8e3a7e93fb1ca3f3fdef33f3f6e7b5e
UTF-8 吾ч쭏永귟뙧潁곮뮁n}吾ч쭏永귟뙧潁곮뮁n{^ 111001011001000010111110110100011000011111101100101011011000111111100110101100001011100011101010101101111001111111101011100110011010011111100110101111011000000111101010101100111010111011101011101011101000000101101110011111011110010110010000101111101101000110000111111011001010110110001111111001101011000010111000111010101011011110011111111010111001100110100111111001101011110110000001111010101011001110101110111010111010111010000001011011100111101101011110 e590bed187ecad8fe6b0b8eab79feb99a7e6bd81eab3aeebae816e7de590bed187ecad8fe6b0b8eab79feb99a7e6bd81eab3aeebae816e7b5e
UHC 吾ч쭏永귟뙧潁곮뮁n}吾ч쭏永귟뙧潁곮뮁n{^ 1110011111101110101011001110100110100111100010001110011110110101100000101110100010001100101010111110011110111000100000011110100010010010100100000110111001111101111001111110111010101100111010011010011110001000111001111011010110000010111010001000110010101011111001111011100010000001111010001001001010010000011011100111101101011110 e7eeace9a788e7b582e88cabe7b881e892906e7de7eeace9a788e7b582e88cabe7b881e892906e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)