To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??恂?4而??嗽?6茵??猷??受 10010111010100010011111100111111100111001001011000111111100000100101001110001110101001110011111100111111100110100111010100111111100000100101010111100100100111110011111100111111100101110101000100111111001111111000111011110011 97513f3f9c963f82538ea73f3f9a753f8255e49f3f3f97513f3f8ef3
EUC-JP 猷??恂?4而??嗽?6茵??猷??受 11001101101100100011111100111111110101111111011000111111101000111011010010111100101010010011111100111111110100111101011000111111101000111011011011101000101000010011111100111111110011011011001000111111001111111011110011110101 cdb23f3fd7f63fa3b4bca93f3fd3d63fa3b6e8a13f3fcdb23f3fbcf5
UTF-8 猷띤뼳恂귣4而╉궙嗽뉖6茵껁깘猷뜯뀦受 111001111000110010110111111010111001110110100100111010111011110010110011111001101000000110000010111010101011011110100011111011111011110010010100111010001000000010001100111000101001010110001001111010101011011010011001111001011001011110111101111010111000100110010110111011111011110010010110111010001000110010110101111010101011101110000001111010101011100110011000111001111000110010110111111010111001110010101111111010111000000010100110111001011000111110010111 e78cb7eb9da4ebbcb3e68182eab7a3efbc94e8808ce29589eab699e597bdeb8996efbc96e88cb5eabb81eab998e78cb7eb9cafeb80a6e58f97
UHC 猷띤뼳恂귣4而╉궙嗽뉖6茵껁깘猷뜯뀦受 1110101110100011101101101110110110010110101101101110001011100001100000101110101110100011101101001110110010111011101001101110001110000010101011101110000111110101100001111110101110100011101101101110110011100000100000111110001110000011100100001110101110100011101101101110001010000101100111011110000111110100 eba3b6ed96b6e2e182eba3b4ecbba6e382aee1f587eba3b6ece083e38390eba3b6e2859de1f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)