To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???[???????????[????????E 00111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f5b3f3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f45
SJIS-WIN 猷??[猷??嗽?5??猷??[猷??嗽?5??E 100101110101000100111111001111110101101110010111010100010011111100111111100110100111010100111111100000100101010000111111001111111001011101010001001111110011111101011011100101110101000100111111001111111001101001110101001111111000001001010100001111110011111101000101 97513f3f5b97513f3f9a753f82543f3f97513f3f5b97513f3f9a753f82543f3f45
EUC-JP 猷??[猷??嗽?5蓀?猷??[猷??嗽?5蓀?E 11001101101100100011111100111111010110111100110110110010001111110011111111010011110101100011111110100011101101011000111111011000111110000011111111001101101100100011111100111111010110111100110110110010001111110011111111010011110101100011111110100011101101011000111111011000111110000011111101000101 cdb23f3f5bcdb23f3fd3d63fa3b58fd8f83fcdb23f3f5bcdb23f3fd3d63fa3b58fd8f83f45
UTF-8 猷듈굛[猷듭㉤嗽덈5蓀긛猷듈굛[猷듭㉤嗽덈5蓀긞E 111001111000110010110111111010111001001110001000111010101011010110011011010110111110011110001100101101111110101110010011101011011110001110001001101001001110010110010111101111011110101110001101100010001110111110111100100101011110100010010011100000001110101010111000100110111110011110001100101101111110101110010011100010001110101010110101100110110101101111100111100011001011011111101011100100111010110111100011100010011010010011100101100101111011110111101011100011011000100011101111101111001001010111101000100100111000000011101010101110001001111001000101 e78cb7eb9388eab59b5be78cb7eb93ade389a4e597bdeb8d88efbc95e89380eab89be78cb7eb9388eab59b5be78cb7eb93ade389a4e597bdeb8d88efbc95e89380eab89e45
UHC 猷듈굛[猷듭㉤嗽덈5蓀긛猷듈굛[猷듭㉤嗽덈5蓀긞E 1110101110100011101101011110001010000010100000110101101111101011101000111011010111101100101010001011010111100001111101011000100011101011101000111011010111100001111000001000001101011001111010111010001110110101111000101000001010000011010110111110101110100011101101011110110010101000101101011110000111110101100010001110101110100011101101011110000111100000100000110110001001000101 eba3b5e282835beba3b5eca8b5e1f588eba3b5e1e08359eba3b5e282835beba3b5eca8b5e1f588eba3b5e1e0836245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)