To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壓??泣ゆ?宥??齬??愉??幽?????由 100110101101100000111111001111111000101110000011100000101110010000111111100101110100011100111111001111111110101010010111001111110011111110010110111110010011111100111111100101110100100000111111001111110011111100111111001111111001011101010010 9ad83f3f8b8382e43f97473f3fea973f3f96f93f3f97483f3f3f3f3f9752
EUC-JP 壓??泣ゆ?宥??齬??愉??幽??孼??由 1101010011011010001111110011111110110101111000111010010011100110001111111100110110101000001111110011111111110011111101110011111100111111110011001111101100111111001111111100110110101001001111110011111110001111101110101100001100111111001111111100110110110011 d4da3f3fb5e3a4e63fcda83f3ff3f73f3fccfb3f3fcda93f3f8fbac33f3fcdb3
UTF-8 壓쇰낄泣ゆ룚宥룰텫齬잙벊愉볢뎲幽덈쐣孼뽯떥由 111001011010001110010011111011001000011110110000111010111000001010000100111001101011001110100011111000111000001010000110111010111010001110011010111001011010111010100101111010111010001110110000111011011000010110101011111010011011110110101100111011001001111010011001111010111011001010001010111001101000010010001001111010111011001110100010111010111000111010110010111001011011100110111101111010111000110110001000111011001001000010100011111001011010110110111100111010111011110110101111111010111001011010100101111001111001010010110001 e5a393ec87b0eb8284e6b3a3e38286eba39ae5aea5eba3b0ed85abe9bdacec9e99ebb28ae68489ebb3a2eb8eb2e5b9bdeb8d88ec90a3e5adbcebbdafeb96a5e794b1
UHC 壓쇰낄泣ゆ룚宥룰텫齬잙벊愉볢뎲幽덈쐣孼뽯떥由 1110010011100010101111001110101110110011101001011110101111101000101010101110011010001111100101101110101011101001101101111110101010110110100111111110010111100001100111111110101110010011101011011110101011110000100100111110100010001001100001011110101011101011100010001110101110011100100010011110010111101101100101101110101110001011101110001110101110100110 e4e2bcebb3a5ebe8aae68f96eae9b7eab69fe5e19feb93adeaf093e88985eaeb88eb9c89e5ed96eb8bb8eba6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)