To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 猷?┰徇?4怏??D猷?┰徇?4怏??D^ 10010111010100010011111110000100101110111001110001101101001111111000001001010011100111001000100100111111001111110100010010010111010100010011111110000100101110111001110001101101001111111000001001010011100111001000100100111111001111110100010001011110 97513f84bb9c6d3f82539c893f3f4497513f84bb9c6d3f82539c893f3f445e
EUC-JP 猷?┰徇?4怏??D猷?┰徇?4怏??D^ 11001101101100100011111110101000101111011101011111001110001111111010001110110100110101111110100100111111001111110100010011001101101100100011111110101000101111011101011111001110001111111010001110110100110101111110100100111111001111110100010001011110 cdb23fa8bdd7ce3fa3b4d7e93f3f44cdb23fa8bdd7ce3fa3b4d7e93f3f445e
UTF-8 猷띠┰徇귣4怏잆걖D猷띠┰徇귣4怏잆걖D^ 111001111000110010110111111010111001110110100000111000101001010010110000111001011011111010000111111010101011011110100011111011111011110010010100111001101000000010001111111011001001111010000110111010101011000110010110010001001110011110001100101101111110101110011101101000001110001010010100101100001110010110111110100001111110101010110111101000111110111110111100100101001110011010000000100011111110110010011110100001101110101010110001100101100100010001011110 e78cb7eb9da0e294b0e5be87eab7a3efbc94e6808fec9e86eab19644e78cb7eb9da0e294b0e5be87eab7a3efbc94e6808fec9e86eab196445e
UHC 猷띠┰徇귣4怏잆걖D猷띠┰徇귣4怏잆걖D^ 111010111010001110110110111011001010011010111101111000101101111110000010111010111010001110110100111001001110100010011111111000111000000110000001010001001110101110100011101101101110110010100110101111011110001011011111100000101110101110100011101101001110010011101000100111111110001110000001100000010100010001011110 eba3b6eca6bde2df82eba3b4e4e89fe3818144eba3b6eca6bde2df82eba3b4e4e89fe38181445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)